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Visualization  of  phosphorylation  of  tyrosine  residues  in  the  nicotinic  acetylcholine  receptor  at 
cholinergic  synapses  in  the  electric  organ  (electroplax)  of  the  electric  ray  Torpedo.  The  electro- 
plax  were  immunofluorescently  labeled  with  rhodamine- coupled  a-bungarotoxin  to  localize  the 
receptor  (red)  and  with  antibodies  to  phosphotyrosine  to  localize  phosphotyrosine  residues 
(green).  The  two  fluorescent  signals  are  superimposable;  however,  they  have  been  artificially 
separated  to  allow  a  side-by-side  comparison  of  the  labeling  patterns.  Tyrosine  phosphorylation 
of  the  nicotinic  receptor  in  the  postsynaptic  cell  is  regulated  by  the  innervating  neuron  and  plays 
a  role  in  regulating  the  sensitivity  of  the  receptor  to  its  neurotransmitter  and  in  clustering  the 
receptor  beneath  the  nerve  terminal. 
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Foreword 


HE  Howard  Hughes  Medical  Institute  (HHMI)  is  pleased  to  present  the  sixth  volume  in  the 
series  Research  in  Progress.  From  its  origin  in  1986  as  a  means  of  providing  a  brief  but 
informative  overview  of  the  research  being  carried  out  by  the  Institute's  investigators,  this 
publication  has  become  a  valuable  source  of  information  for  not  only  the  entire  HHMI  commu- 
nity but  also  many  scientists  and  nonscientists  who  are  interested  in  current  progress  in  biomed- 
ical research  in  general.  One  measure  of  the  usefulness  of  these  reports  is  that  last  year  approxi- 
mately 2,000  copies  were  distributed,  many  to  individuals  requesting  it  for  the  first  time. 

New  readers  may  be  interested  in  other  HHMI  publications,  some  of  which  describe  the  pro- 
grams of  the  Institute  or  deal  with  general  topics  in  scientific  research  and  education.  The  Annual 
Scientific  Report  is  the  official  archival  record  of  the  Institute  as  a  Medical  Research  Organization 
(MRO).  It  contains  a  description  of  the  research  by  each  HHMI  investigator,  including  a  bibliogra- 
phy, and  highlights  other  activities  of  the  Institute.  A  general  Annual  Report  of  the  Howard 
Hughes  Medical  Institute  describes  for  a  lay  audience  the  various  HHMI  programs  and  gives  a 
summary  of  Institute  financial  data. 

A  series  of  reports  for  a  general  audience  on  subjects  of  current  scientific  interest  is  also  published. 
The  first  report,  produced  last  year  and  entitled  Finding  the  Critical  Shapes,  describes  exciting 
advances  in  structural  biology  and  how  the  determination  of  the  structure  of  biologically  important 
molecules  and  the  correlation  of  structure  with  function  are  leading  to  understanding  of  life  processes 
and  new  approaches  to  the  prevention  and  treatment  of  diseases.  This  report  was  widely  acclaimed  by 
a  broad  spectrum  of  readers,  especially  teachers  who  found  it  useful  in  their  classrooms.  Over  33,000 
copies  have  been  distributed.  A  second  report,  on  genetics  and  human  diseases,  entitled  Blazing  a 
Genetic  Trail,  has  recently  been  released;  it  is  clear  that  it  also  will  be  highly  successful  and  in  great 
demand,  as  the  first  printing  of  45,000  copies  has  almost  been  exhausted  in  a  few  weeks  in  response  to 
many  requests,  including  again  teachers  all  across  the  country. 

HHMI  has  a  large  grants  program  that  complements  its  research  activities  as  an  MRO.  It  is  antici- 
pated that  this  year  grant  awards  will  total  approximately  $47  million.  The  grants  program  supports 
education  in  the  biomedical  and  related  sciences  at  the  precollege,  college  undergraduate,  graduate, 
and  postdoctoral  levels.  These  initiatives  are  described  in  detail  in  an  annual  publication  entitled 
Grants  for  Science  Education.  In  1990  a  report  on  Support  for  Science  Education  and  Research, 
summarizing  the  level  of  funding  by  foundations  and  voluntary  health  associations,  was  also 
published. 

In  addition  to  providing  an  up-to-date  report  on  the  rapid  progress  in  biomedical  research  as 
illustrated  by  the  work  being  done  in  HHMI  laboratories,  this  issue  of  Research  in  Progress  reflects 
the  continued  expansion  of  the  Institute's  activities.  As  of  July  1 ,  1 99 1 ,  there  were  223  HHMI  investi- 
gators based  at  52  institutions  across  the  nation.  Their  research  is  the  core  of  the  Institute's  activities, 
and  we  invite  you  to  share,  in  these  pages,  in  the  excitement  of  discovery  and  the  advances  in  the 
knowledge  of  biological  processes  and  disease  mechanisms  that  hold  great  promise  for  the  betterment 
of  the  human  condition. 
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Introduction 


THE  annual  publication  of  Research 
in  Progress  has  become  one  of  the 
highlights  of  the  Howard  Hughes  Medi- 
cal Institute's  research  activities.  By  pro- 
viding a  synoptic  overview  of  the  re- 
search being  conducted  by  the  Institute's 
investigators,  in  essentially  nontechnical 
language,  it  has  come  to  be  greatly  ap- 
preciated not  only  by  those  interested  in 
the  Institute's  activities  as  a  Medical  Re- 
search Organization,  but  by  many  others 
who  view  it  as  an  accessible  introduction 
to  current  biomedical  research. 
The  present  volume  in  this  series  follows  fairly 
closely  the  pattern  that  has  evolved  over  the  past 
three  or  four  years:  it  provides  a  snapshot  of  the 
status  of  the  Institute's  research  in  early  1991 .  As 
in  the  previous  volume,  this  issue  includes  a  se- 
ries of  introductory  essays  that  are  intended  to 
assist  the  reader  who  has  had  little  or  no  back- 
ground in  biology  or  medicine.  These  essays  are 
not  intended  to  be  exhaustive  nor  simply  to  repli- 
cate material  readily  available  in  most  standard 
biology  or  medical  texts;  rather  they  are  intended 
to  serve  as  an  expanded  glossary  of  terms,  defin- 
ing in  a  general  way  many  of  the  terms  used  in  the 
individual  reports.  To  emphasize  this  feature  of 
the  essays,  these  technical  terms  are  shown  in 
bold  type.  The  essays  are  based  on  an  initial  set  of 
drafts  prepared  for  each  of  the  five  programmatic 
areas  in  which  the  Institute  conducts  research: 
cell  biology  and  regulation;  genetics;  immunol- 
ogy; neuroscience;  and  structural  biology.  The 
original  drafts  were  kindly  prepared  by  Drs. 
Richard  O.  Hynes,  Philip  Leder,  Charles  A.  Jane- 
way,  Eric  R.  Kandel,  and  Stephen  C.  Harrison,  re- 
spectively. We  are  grateful  to  them  and  to  a  num- 
ber of  their  colleagues  within  the  Institute  who 
commented  on  the  essays  or  provided  additional  in- 
formation and,  in  some  cases,  clarified  ambiguities 


or  corrected  errors  in  the  original  text.  In  this  year's 
volume  the  text  of  the  essays  has  been  supple- 
mented by  a  number  of  illustrations,  derived  for  the 
most  part  from  general  texts  or  scientific  papers  that 
are  identified  in  the  accompanying  legends.  We 
hope  that  these  diagrams  and  photographs  will  make 
the  essays  even  more  useful. 

For  readers  who  wish  to  know  about  specific 
research  being  conducted  by  HHMI  investigators, 
we  have  provided  once  again  a  detailed  index. 
The  index  lists  almost  every  topic  or  term,  from 
ahd-A  (a  gene  that  directs  the  segmental  develop- 
ment of  the  abdomen — first  identified  in  the  fruit 
fly  Drosophila  melanogaster)  to  the  zeppelin 
gene  and,  in  between,  the  less  arcane  subjects  of 
AIDS,  cancer,  cystic  fibrosis,  diabetes,  hemo- 
philia, muscular  dystrophy,  and  obesity.  Several 
readers  have  remarked  how  useful  the  addition  of 
an  index  has  been,  and  we  are  grateful  once  more 
to  Diana  Witt  for  preparation  of  the  index. 

As  in  previous  years,  the  greater  part  of  the  vol- 
ume consists  of  a  series  of  short  reports  by  the 
investigators  associated  with  the  Institute  during 
the  past  year.  The  fact  that  this  volume  is  appre- 
ciably larger  than  its  predecessor  simply  reflects 
the  increased  number  of  investigators  whose 
work  is  represented,  since  each  is  limited  to  no 
more  than  two  printed  pages.  The  reports  submit- 
ted by  each  investigator  have  been  collated  and 
edited  by  Dr.  Claire  H.  Winestock,  Senior  Re- 
search Program  Administrator;  Elizabeth  Cowley, 
Copy  Editor;  William  T.  Carrigan,  Editor/Writer; 
Gail  Markley,  Manager  of  Publications;  and  Kim- 
berly  A.  Cornejo,  Permissions  Editor.  We  are 
grateful  to  them  and  to  the  many  investigators 
who  have  provided  us  with  suitable  illustrations 
taken  from  their  ongoing  research.  Not  only  do 
the  illustrations  help  to  clarify  the  written  text, 
but  they  also  greatly  enhance  the  aesthetic  appeal 
of  the  volume. 


Cell  Biology  and  Regulation  Program 

The  Cell  Biology  and  Regulation  Program  is  the 
oldest  of  the  Institute's  research  programs.  Origi- 
nally referred  to  as  Metabolic  Regulation,  which 
reflected  the  program's  roots  in  clinical  studies 
of  metabolic  and  endocrine  disorders,  the  title  of 
this  program  was  changed  five  years  ago  to  reflect 
more  accurately  its  principal  theme — the  biol- 
ogy of  individual  cells,  the  factors  that  regulate 
their  normal  growth  and  distinctive  functions, 
and  the  ways  in  which  cells  interact  with  each 


other.  In  this  sense  the  program  has  come  to  oc- 
cupy a  critical  position  in  the  Institute's  research 
endeavors,  and  it  lies  firmly  within  the  main- 
stream of  contemporary  biological  research. 

The  human  body  contains  several  million  mil- 
lion cells  of  a  thousand  or  more  distinct  types. 
Research  in  the  field  of  cell  biology  seeks  to  un- 
derstand how  these  various  cells  are  constructed 
and  organized,  how  they  diff^er  from  one  another, 
how  they  sense  and  respond  to  outside  influ- 
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ences,  how  they  interact  with  their  neighbors  to 
form  more  complex  tissues  and  organs,  and,  in 
general,  how  the  cells  of  the  body  are  integrated 
to  produce  an  appropriately  functioning  organ- 
ism. Equally  important,  research  in  this  area  is 
aimed  at  understanding  how  these  cellular  func- 
tions are  perturbed  by  disease.  To  this  extent  the 
problems  addressed  by  investigators  in  the  Cell 
Biology  and  Regulation  Program  inevitably  im- 
pinge on  related  work  in  genetics,  development, 
neuroscience,  and  immunology.  For  a  Medical 
Research  Organization  it  is  especially  gratifying 
to  see  how  many  of  the  insights  gleaned  from 
these  studies  are  already  beginning  to  throw  light 
on  such  medically  important  problems  as  dia- 
betes, heart  disease,  cancer,  muscular  dystrophy, 
cystic  fibrosis,  and  a  number  of  other  genetic 
disorders. 

The  strikingly  rapid  advances  that  have  oc- 
curred in  cell  biology  in  recent  years  have  been 
due,  in  large  part,  to  earlier  progress  in  biochem- 
istry and  cellular  physiology,  but  especially  to 
the  dramatic  developments  that  have  occurred  in 
molecular  biology  since  the  early  1960s.  The 
techniques  developed  in  these  fields  have  been 
invaluable  to  cell  biology,  which  has  always  been 
quick  to  apply  different  and  newly  emerging  ap- 
proaches to  the  solution  of  the  many  problems  of 
cell  structure  and  function.  To  understand  the 
types  of  research  being  conducted  in  contempo- 
rary cell  biology,  it  may  be  helpful  to  begin  with  a 
general  account  of  a  "typical"  animal  cell  (Fig- 
ure 1). 

Near  the  center  of  each  cell  is  the  nucleus, 
which  contains  the  genes  (Figure  2)  that  are 
made  of  DNA  and  encode  the  information  neces- 
sary to  construct  an  entire  organism  and  maintain 
its  day-to-day  activities.  The  entire  complement 
of  genes  is  the  genome,  which  comprises  a  set  of 
instructions  encoded  in  the  sequences  of  the  DNA 
molecules  (as  described  more  fully  in  the  section 
on  genetics).  The  human  genome  consists  of  46 
chromosomes — 22  pairs  of  autosomes  and  2  sex 
chromosomes.  The  two  copies  of  each  autosome 
are  inherited  from  the  mother  and  the  father,  re- 
spectively; in  females  there  are  two  X  chromo- 
somes (one  from  each  parent) ,  while  in  males  the 
Y  chromosome  is  always  inherited  from  the  father 
and  the  X  chromosome  (Figure  3)  is  always  in- 
herited from  the  mother.  The  46  chromosomes 
comprise  a  total  of  about  3  billion  pairs  of  nu- 
cleotides. Estimates  vary,  but  it  is  thought  that 
there  may  be  as  many  as  100,000  genes  in  the 
human  genome.  These  genes  vary  in  length  from 


around  1,000  to  about  2  million  nucleotides. 
Each  gene  encodes  the  information  for  a  particu- 
lar cellular  structure  or  function.  This  informa- 
tion, which  can  be  likened  to  a  computer  lan- 
guage, is  first  read  (transcribed)  into  RNA,  and 
the  message  contained  within  the  nucleotide  se- 
quence of  the  RNA  molecule  is  then  decoded 
(translated)  by  the  machinery  of  the  cell  into  a 
different  language,  or  chemical  structure.  While 
the  RNA  transcript  mirrors  exactly  the  DNA  se- 
quence of  the  gene,  the  messenger  RNA  (mRNA) 
that  is  translated  into  the  amino  acid  sequence  of 
the  encoded  protein  is  a  highly  edited  message 
(Figure  4).  Generally  several  intervening  (or 
noncoding)  sequences  called  introns  are  selec- 
tively removed  from  the  transcript.  Introns  ac- 
count for  a  considerable  proportion  of  the  DNA  in 
all  higher  organisms  and,  together  with  several 
other  noncoding  stretches  of  DNA  (spacer  DNA, 
satellite  DNA,  and  other  repetitive  DNA  se- 
quences) ,  are  sometimes  referred  to  as  selfish  or 
junk  DNA.  In  humans  it  is  estimated  that  more 
than  90  percent  of  the  DNA  in  the  genome  is  of 
this  kind.  The  possible  functions  (if  any)  of  this 
noncoding  DNA  are  not  known,  but  the  mecha- 
nisms whereby  the  coding  sequences  or  exons 
are  spliced  out  and  joined  together  (sometimes  in 
different  order — a  process  known  as  alternative 
splicing)  is  a  subject  of  considerable  interest  at 
present.  The  chemical  language  of  the  cell  has  20 
different  characters  or  units,  known  as  amino 
acids,  which  are  linked  together,  again  in  linear 
arrays,  to  make  proteins. 

Whereas  DNA  is  the  blueprint  directing  the 
cell's  development  and  function,  proteins  are  the 
molecules  from  which  cells  are  built  and  which 
carry  out  most  cellular  functions.  Most  genes  en- 
code proteins,  and  each  cell  contains  about 
10,000  different  types  of  protein.  That  is,  each 
cell  uses  only  about  1 0  percent  of  the  total  set  of 
genes  at  any  one  time.  This  raises  two  of  the  cen- 
tral questions  in  cell  biology  today:  1 )  How  are 
genes  turned  on  and  off  so  that  each  cell  type 
expresses  only  its  appropriate  set  of  genes  and 
contains  only  its  correct  complement  of  proteins? 
2)  How  are  the  genes  in  a  given  cell  regulated,  so 
that  the  cell  can  respond  appropriately  to  outside 
influences  by  changing  either  the  pattern  of 
genes  it  uses  (and  thus  the  kinds  of  proteins  it 
produces)  or  the  amounts  of  each  protein  it 
makes  (Figure  5) .  In  the  next  section  we  describe 
the  considerable  progress  that  has  been  made 
recently  in  deciphering  the  DNA  sequence 
elements  that  determine  whether  a  gene  is 
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Figure  1.  A  small  section  of  a  typical  mammalian  cell  as  seen  in  the  electron  microscope. 
Part  of  the  nucleus  (N)  with  its  surrounding  membrane  and  nucleolus  (Nu )  are  shown  in 
the  upper  right;  the  cytoplasm  that  occupies  the  rest  of  the  electron  micrograph  contains 
several  different  organelles,  including  several  mitochondria  (M),  parts  of  the  Golgi  com- 
plex (GC),  and  the  rough  endoplasmic  reticulum  (RER).  Magnification  approximately 
50,000  X. 

Micrograph  provided  by  David  D.  Sabatini. 


xxvii 


Introduction 


Figure  2.  From  genes  and  chromosomes  to  cells,  organs,  and  entire  bodies. 

Adapted  from  an  illustration  by  Warren  Isensee  for  The  Chronicle  of  Higher  Educa- 
tion, September  3,  1986,  with  permission  from  the  publisher. 
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Figure  3-  A  schematic  representation  of  the  X  chro- 
mosome showing  the  location  of  the  characteristic 
banding  patterns  and  the  distribution  of  several  of 
the  genes  that  have  been  localized  to  this 
chromosome. 

Excerpt  from  Stephens,  J.C.,  Mador,  M  L.,  Cava- 
naugh,  M.L.,  Gradie,  M.I.,  and  Kidd,  K.K.  The  Human 
Genome  Map.  Science  250:  October  12,  1990.  Copy- 
right 1990  by  the  AAAS. 
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Figure  4.  The  primary  RNA  transcript  of  a  gene  is  very  much  larger  than  the  message  that  is 
actually  translated  to  form  a  protein.  A  variable  number  of  noncoding  sequences  called  introns 
are  edited  out,  and  the  remaining  exons  are  spliced  together  by  a  complex  mechanism  shown  in 
the  drawings  to  the  left. 

From  Dugaiczyk,  A.,  Woo,  S.L.C.,  Colbert,  D.A.,  Lai,  B.C.,  Mace,  M.L.,  Jr.,  and  O'Malley,  B. 
1979.  Proc  Natl  Acad  Sci  USA  76:2255-2257. 
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Figure  5.  Transcription  of  a  gene  by  the  enzyme  RNA  polymerase  and  the  subsequent  translation  of  this  messenger  RNA  ( mRNA ) 
by  ribosomes  to  form  a  protein. 

From  Raven,  P.H.,  and  Johnson,  G.B.  1988.  Understanding  Biology.  St.  Louis,  MO:  Times  Mirror/Mosby  College  Publishing, 
p.  267. 
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Figure  6.  The  synthesis  and  intracellular 
trafficking  of  proteins  from  the  endoplas- 
mic reticulum  through  the  Golgi  apparatus 
to  the  cell  membrane. 

From  Alberts,  B.,  Bray,  D.,  Lewis,  J.,  Raff, 
M.,  Roberts,  K.,  and  Watson,  J.D.  1989.  Mo- 
lecular Biology  of  the  Cell,  2nd  edition.  New 
York:  Garland,  p.  455. 
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transcribed.  Proteins,  commonly  known  as  tran- 
scriptional factors,  bind  to  these  DNA  se- 
quence elements  and  determine  when,  and  how 
frequently,  a  gene  is  transcribed.  Since  this  often 
influences  how  much  of  the  encoded  protein  is 
produced  by  a  cell,  such  information  provides 
one  level  of  insight  about  how  cells  differ  from 
one  another. 

Another  large  issue  concerns  the  way  in  which 
proteins  are  deployed  in  the  cell.  A  cell  is  not  just 
a  bag  of  randomly  distributed  molecules;  it  has  a 
highly  organized  internal  structure.  The  DNA  that 
makes  up  the  genes  and  the  machinery  for  gene 
transcription  are  packaged  in  the  nucleus,  which 
is  surrounded  by  a  membrane  that  separates  it 
from  the  cytoplasm,  which  comprises  the  rest  of 
the  cell.  Nearly  all  cellular  functions  are  com- 
partmentalized in  other  cellular  structures  collec- 
tively referred  to  as  organelles.  Among  the  more 
prominent  organelles  in  animal  cells  are  the  mi- 
tochondria, which  are  the  cells'  principal  en- 
ergy source;  lysosomes,  which  are  concerned 
largely  with  the  degradation  of  foreign  materials 
and  of  cellular  proteins  whose  functions  have 
been  fulfilled;  the  rough  endoplasmic  reticu- 
lum, a  complex,  ribosome-studded  system  of 
membranes  responsible  for  the  synthesis  of  pro- 
teins secreted  by  the  cells;  and  the  Golgi  appara- 
tus, which  both  modifies  proteins  (by  adding 
other  chemical  groups  such  as  sugars)  and  pack- 
ages them  for  transport  to  their  appropriate  loca- 
tions, such  as  the  cell  membrane.  Most  of  these 
organelles  are  themselves  surrounded  by  mem- 
branes that  separate  their  functions  from  those  of 
the  rest  of  the  cell.  It  is  easy  to  see  how  such  a 
compartmentalized  structure  allows  the  cell  to 
organize  its  different  processes  efficiently,  but  it 
poses  an  organizational  problem  that  is  of  consid- 
erable current  interest  in  cell  biology:  How  are 
particular  proteins  routed  to  the  correct  organ- 
elles? As  we  began  to  learn  about  the  structures  of 
individual  proteins,  it  was  discovered  that  there 
are  specific  "signals"  built  into  proteins  that  tar- 
get them  to  particular  organelles  or  particular  lo- 
cations within  the  cell  and  that  there  are  distinc- 
tive cellular  machineries  that  "detect"  these 
signals  and  steer  the  proteins  in  particular  direc- 
tions (Figure  6).  Thus  certain  proteins  are  di- 
rected to  the  nucleus,  while  others  are  targeted 
for  insertion  into  the  surface  membrane  of  the 
cell,  and  yet  others  are  destined  for  export  out  of 
the  cell  as  secretory  products. 

At  another  level  of  organization,  it  has  become 
evident  that  the  organelles  within  a  cell  are  also 


not  distributed  randomly.  In  many  cells  one  can 
identify  a  distinct  "top"  and  "bottom."  Other 
cells,  while  not  polarized  in  this  manner,  have 
asymmetric  structures  arranged  in  such  a  way  that 
given  organelles  are  distributed  in  different,  but 
quite  reproducible,  patterns.  Still  other  cells,  es- 
pecially nerve  cells,  have  unusual  extensions  or 
processes  that  may  be  many  hundred  times  as 
long  as  the  body  of  the  cell.  In  each  case  organ- 
elles have  to  be  transported  to  particular  loca- 
tions and  maintained  there;  they  do  not  drift 
about  haphazardly  inside  the  cell. 

The  asymmetric  shapes  of  cells  and  the  loca- 
tions of  their  organelles  both  rely  on  cellu- 
lar structures  known  collectively  as  the  cyto- 
skeleton.  The  cytoskeleton  consists  of  several 
types  of  elongated  threads  or  filaments  (micro- 
filaments, microtubules,  intermediate  fila- 
ments) ,  each  made  of  specific  proteins  that  are 
so  designed  as  to  assemble  spontaneously  into  fil- 
aments. These  cytoskeletal  elements  serve  as  a 
form  of  internal  scaffolding  to  maintain  the  shape 
of  the  cell,  and  as  a  system  of  tracks  along  which 
organelles  can  be  transported.  Recent  research 
has  disclosed  a  variety  of  proteins  that  function  as 
molecular  motors  that  can  move  proteins  and  or- 
ganelles that  attach  to  them  along  particular  cyto- 
skeletal filaments  to  various  locations  within  the 
cell.  We  also  know  that  the  appropriate  organiza- 
tion of  cytoskeletal  filaments  and  motor  proteins 
can,  in  some  cases,  contribute  to  cell  motility, 
that  is,  the  movement  of  the  entire  cell  from  one 
location  to  another.  Such  cellular  movements  are 
especially  important  in  development  but  con- 
tinue to  play  an  integral  role  in  the  life  of  certain 
cells  even  in  adult  life. 

All  these  processes — gene  transcription,  pro- 
tein targeting,  organelle  movement,  and  cell  mo- 
tility— must  be  carefully  regulated  so  that  cells 
respond  appropriately  to  different  situations.  The 
same  is  true  of  many  other  cellular  processes.  For 
example,  the  proliferation  of  cells  that  takes 
place  by  cell  division  involves  copying  or  repli- 
cating the  genes,  the  breakdown  of  the  nuclear 
membrane,  the  separation  of  the  duplicated 
chromosomes  into  two  equivalent  sets,  division 
of  the  cell  into  two  daughter  cells,  the  re-forma- 
tion of  a  nucleus  in  each  of  the  daughter  cells, 
and  finally  the  resumption  of  normal  function  in 
both  cells.  This  whole  process,  which  is  known  as 
the  cell  cycle,  takes  place  whenever  cells  divide 
and  remains  an  important  part  of  the  life  of  all  but 
a  few  cell  types.  Many  types  of  cells — for  exam- 
ple, the  cells  in  the  blood  and  skin — are  contin- 
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ually  being  formed  and  replaced.  Other  cells  pro- 
liferate rarely,  and  some  divide  only  during  early 
development;  the  nerve  cells  that  make  up  the 
brain,  for  example,  proliferate  rapidly  during  de- 
velopment, but  no  further  cell  division  occurs 
until  death  some  70  or  more  years  later.  Obvi- 
ously, cell  proliferation  must  be  tightly  con- 
trolled. Recent  advances  have  shown  that  the  cell 
cycle  is  controlled  by  a  set  of  proteins  whose  role 
is  to  modify  other  proteins  selectively  and  thus 
regulate  their  functions.  One  common  way  in 
which  proteins  are  modified  (but  certainly  not 
the  only  one)  is  to  attach  a  small  chemical  group, 
such  as  a  phosphate  group,  to  a  protein.  Proteins 
that  attach  phosphate  groups  to  other  proteins 
are  called  protein  kinases.  Control  of  many 
aspects  of  the  cell  cycle  and,  indeed,  of  many 
other  cellular  functions  relies  on  complex  con- 
trol networks  of  protein  kinases  acting  on  key 
proteins  at  pivotal  stages  in  the  life  of  the  cell. 

Thus  far  we  have  considered  only  processes  oc- 
curring within  a  cell.  An  important  related  set  of 
issues  concerns  how  cells  interact  with  each 
other  and  how  they  respond  to  the  external  envi- 
ronment. Each  cell  is  surrounded  by  a  surface 
(or  plasma)  membrane,  which  serves  as  a  se- 
lective barrier  separating  the  inside  of  the  cell 
from  the  world  outside  itself.  Embedded  in  this 
membrane  are  several  types  of  proteins.  One  es- 
sential class  are  transporters,  specialized  for 
the  ordered  movement  in  and  out  of  the  cell  of 
nutrients,  ions  and  other  small  molecules  that  are 
essential  for  normal  cell  function. 

A  second  group  of  cell  surface  proteins  are  re- 
ceptors, which  bind  other  types  of  molecules 
that  interact  with  the  cell.  As  the  name  suggests, 
receptors  serve  to  receive  input  from  the  cell's 
external  environment.  They  are  of  many  different 
types.  The  largest  group  binds  peptide  hormones 
or  diffusible  factors  produced  locally  or  at  a  dis- 
tance by  other  cells,  but  another  important  group 
serves  to  transport  materials  like  cholesterol  from 
outside  to  the  interior  of  the  cell.  Typically  these 
receptors  have  three  parts:  an  external  part  or  li- 
gand-binding  domain  that  can  bind  the  hormone 
or  diffusible  factor,  a  transmembrane  domain  that 
spans  the  cell  membrane,  and  an  intracellular 
part  that  can  interact  with  internal  components  of 
the  cell.  The  binding  of  a  hormone  or  other  diffu- 
sible factor  to  such  a  receptor  triggers  in  some 
way,  as  yet  undetermined,  a  signal  inside  the  cell. 
These  signals  are  of  many  types.  Some  receptors 
are  protein  kinases  that  are  selectively  activated 
by  binding  the  appropriate  external  factor; 


others,  when  activated,  lead  to  the  release  of  dif- 
fusible, small  molecules,  such  as  calcium  ions  or 
cyclic  nucleotides.  These  diffusible  second  mes- 
senger molecules  in  turn  activate  other  control 
mechanisms  inside  the  cell,  including  protein  ki- 
nases and  other  regulatory  molecules.  In  this  way 
the  triggering  of  a  receptor  from  outside  cells  can 
result  in  a  cascade  of  events  that  ultimately  con- 
trols the  various  intracellular  processes  discussed 
earlier.  One  of  the  current  "hot  topics"  in  cell 
biology  research  concerns  the  nature  and  mecha- 
nisms of  cell  surface  receptor  signaling  and  the 
control  circuits  inside  cells  that  link  receptor  ac- 
tivity to  other  control  mechanisms,  including 
those  that  regulate  gene  function  and  the  control 
of  the  cell  cycle. 

A  special  subset  of  this  class  of  receptors  are 
those  that  respond  to  the  release  of  chemical  sig- 
nals (transmitters)  at  the  specialized  endings  of 
nerve  cell  processes  (Figure  7) .  The  released  neu- 
rotransmitters bind  to  the  external  part  of  the  cell 
surface  receptor  and  in  doing  so  may  open  an  ion 
channel  or  trigger  the  activation  of  a  second 
intracellular  message.  Since  the  majority  of 
nerve  signals  are  transmitted  from  cell  to  cell 
in  this  way,  the  analysis  of  this  class  of  recep- 
tors is  (as  we  shall  see  in  the  section  on  neuro- 
science)  one  of  the  central  issues  in  contem- 
porary neuroscience. 

Another  class  of  cell  surface  receptors  is  in- 
volved in  the  adhesion  of  cells,  either  to  their 
neighbors  or  to  the  extracellular  matrix,  a 
complex  group  of  secreted  proteins  and  polysac- 
charides that  assemble  into  an  organized  mesh- 
work  on  the  cell  surface.  Depending  upon  the 
cell  type  and  environment,  the  extracellular  ma- 
trix performs  various  functions  (Figure  8).  In  a 
petri  dish,  for  example,  the  extracellular  matrix 
provides  a  cushion  on  which  the  cell  sits.  In  the 
epidermis,  the  extracellular  matrix  helps  to  form 
the  basement  membrane,  which  anchors  the 
epidermis  to  the  rest  of  the  skin.  In  connective 
tissues,  the  extracellular  matrix  completely 
surrounds  most  cells  and  is  often  more  extensive 
in  its  distribution  than  the  cells  themselves.  In 
this  case,  the  extracellular  matrix  helps  to  pro- 
vide the  body's  architectural  framework. 

Cellular  adhesion,  which  plays  a  crucial  role  in 
cell,  tissue,  and  organ  structure  and  in  cell  move- 
ments, depends  on  sp)ecialized  cell  adhesion  recep- 
tors that  are  conneaed  to  the  intracellular  cytoskele- 
ton.  It  is  also  likely  that  cells  can  signal  to  one 
another  via  cell  adhesion  receptors.  Decisions  as  to 
whether  a  cell  remains  stationary,  or  where  and 


tXtXmXH 


Introduction 


postsynaptic  cell 


Action  potential 
triggers  entry  of 
calcium  ions  (Ca^*) 
into  the  presynaptic 
terminal 


©  ©  @  ©J 


Synaptic  vesicles 
fuse  with  the  presynaptic 
membrane,  releasing 
transmitter. 


The  transmitter  binds  to  proteins 
in  the  postsynaptic  membrane, 
changing  their  conformation  and 
either  opening  an  ion  channel  or 
activating  a  second  messenger 
system  within  the  postsynaptic  cell 


,©  ©  ®  ©  ^  ®  . 


--<WWUV^  ^-rWWWV^ 


The  transmitter  is  removed 
from  the  synaptic  cleft,  and 
the  postsynaptic  receptors 
revert  to  their  original 
conformation. 


Figure  7.  A  summary  of  the  essential  events  at  a  chemical  synapse  following  the  arrival  of  an  action  potential  in  the  axon 
terminal. 

Adapted  from  Alberts,  B.,  Bray,  D.,  Lewis,  J.,  Raff,  M.,  Roberts,  K.,  and  Watson,  J. D.  1989.  Molecular  Biology  of  the  Cell,  2nd 
edition.  New  York:  Garland,  p.  1077. 


Figure  8.  A:  A  section  of  human  skin  stained  with  an  anti- 
body against  the  protein  keratin.  Note  how  the  cells  are 
packed  together  to  form  the  distinctive  structure  of  skin.  The 
dark  structures  within  each  cell  are  the  nuclei  that  do  not 
contain  keratin  and  so  are  not  labeled. 

B:  A  fluorescence  micrograph  showing  the  presence  of  kera- 
tin filaments  in  cultured  skin  cells.  The  keratin  filaments 
have  been  revealed  by  a  fluorescently  labeled  antibody 
against  the  keratin  protein. 

Courtesy  of  Elaine  V.  Fuchs. 
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when  it  should  move,  and  what  shape  it  should  take 
up,  are  all  affected  by  cell  adhesion  events.  And 
these,  in  turn,  are  largely  dependent  on  the  pattern 
and  functions  of  cell  adhesion  receptors  deployed 
on  the  surfaces  of  the  cells. 

Eluf  idating  the  normal  functions  of  cells  is  an 
important  first  step  in  understanding  how  these 
processes  go  awry  in  a  number  of  human  diseases. 
For  example,  it  is  now  known  that  alterations  in 
certain  normal  genes  (called  oncogenes)  can 
contribute  to  cancer.  It  is  also  clear  that  many 
oncogenes  encode  proteins  that  are  involved  in 
the  regulation  of  specific  cellular  functions: 
some  oncogene-encoded  proteins  are  growth  fac- 
tors; others  are  cell  surface  receptors;  yet  others 
are  signaling  molecules,  protein  kinases,  and 
transcription  factors.  Many  other  human  diseases 
are  known  to  be  due  to  defects  of  one  or  another 
of  the  cellular  processes  reviewed  above.  Thus 
disturbances  in  insulin  production  lead  to  dia- 
betes mellitus,  defects  in  the  extracellular  matrix 


can  cause  osteogenesis  imperfecta,  and  abnormal- 
ities in  cell  adhesion  receptors  may  result  in 
various  bleeding  disorders.  Indeed,  one  of  the 
major  insights  in  pathology  and  medicine  is  that 
all  disease  processes  are  ultimately  attributable 
to  the  breakdown  of  one  or  more  cellular  func- 
tions. In  the  twenty-first  century,  we  can  be  fairly 
certain  that  medicine  will  be  concerned  largely 
with  the  identification  and  treatment  of  specific 
disordered  cell  functions.  We  already  know  of 
many  disorders  that  can  be  attributed  to  distur- 
bances in  particular  organelles.  What  is  particu- 
larly encouraging  to  researchers  in  cell  biology 
today  is  that  new  avenues  are  beginning  to  be 
perceived  for  therapy,  as  the  molecular  bases  of 
various  disordered  functions  become  known.  In- 
deed, one  of  the  especially  appealing  aspects  of 
modern  cell  biological  research  is  the  immediacy 
with  which  fundamental  research  advances  are 
having  an  impact  on  medically  important 
problems. 
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Genetics  Program 

The  emergence  of  what  is  sometimes  referred 
to  as  the  "new  genetics"  has  contributed  more  to 
our  fundamental  understanding  of  biology  and 
medicine  in  the  past  two  decades  than  any  other 
advance  in  biomedical  science.  Not  surprisingly, 
this  development  has  come  to  assume  a  central 
place  in  all  biological  research,  making  it  possi- 
ble to  examine  biological  processes  at  a  level  of 
resolution  that  was  considered  quite  impossible 
just  20  years  ago.  "Gene  cloning,"  "recombinant 
DNA  technology,"  "genetic  engineering,"  and 
the  "Human  Genome  Initiative"  are  phrases  that 
have  entered  everyday  language,  but  the  possibili- 
ties they  offer  for  major  advances  in  biology  and 
medicine  have  yet  to  be  fully  appreciated.  Given 
the  central  role  of  genetics  in  modern  biology 
and  the  current  sense  of  excitement  that  the  new 
genetics  has  generated,  it  is  appropriate  that  the 
Genetics  Program  is  by  far  the  largest  research 
program  within  HHMI. 

Historically,  one  of  the  first  applications  of 
Mendel's  classic  laws  of  inheritance  was  to  the 
analysis  of  certain  human  diseases.  In  the  early 
years  of  this  century,  Archibald  Garrod,  an  En- 
glish physician,  noted  that  a  number  of  relatively 
rare  diseases  tended  to  occur  in  families,  often  in 
families  with  consanguineous  marriages.  The  pat- 
tern of  occurrence  of  these  rare  diseases  followed 
Mendel's  laws,  discovered  almost  50  years  ear- 
lier. Mendel's  work  on  inherited  characteristics 
in  plants  was  neglected  for  many  years,  but  after  it 
was  rediscovered,  around  the  turn  of  the  century, 
it  was  quickly  established  that  the  laws  of  genet- 
ics are  universal  and  govern  inheritance  in  organ- 
isms as  disparate  as  peas  and  worms,  mice  and 
fruit  flies,  bacteria  and  human  beings.  Beginning 
with  Garrod,  these  genetic  laws  were  applied  to  a 
host  of  inherited  diseases,  as  it  was  realized  that 
the  genetic  makeup  of  an  individual  can  have  a 
profound  effect  on  his  or  her  health  and  well- 
being.  However,  an  understanding  of  what  genes 
are  and  how  they  function  had  to  await  the  discov- 
ery, in  1944,  that  the  genetic  material  is  DNA, 
and,  in  1953  and  1961,  of  the  double  helix  and 
the  genetic  code,  respectively. 

Much  of  our  early  understanding  of  the  action 
of  genes  came  from  experiments  that  took  advan- 
tage of  the  universality  of  gene  action  by  using 
simple  organisms,  especially  bacteria  and  their 
viruses,  as  model  systems.  The  cardinal  discovery 
was  the  identification  of  deoxyribonucleic  acid 
(DNA)  as  the  fundamental  chemical  in  which  ge- 
netic information  is  encoded.  But  the  finding  that 


DNA  has  a  double-stranded,  mirror  image  struc- 
ture provided  the  first  clear  insight  as  to  how  this 
information  could  be  replicated  and  passed  on 
from  one  generation  of  organisms  to  the  next  (Fig- 
ures 9  and  1 0) .  Understanding  how  the  chemical 
language  of  DNA  could  be  used  to  direct  the  syn- 
thesis of  other  cellular  constituents,  especially 
proteins,  came  with  the  discovery  of  the  nature  of 
the  genetic  code.  These  great  advances  will  al- 
ways be  viewed  as  the  high  watermark  of  the  early 
molecular  stage  of  modern  genetics. 

Notwithstanding  these  dramatic  developments 
— arguably  the  most  important  in  biology  since 
the  publication  of  Darwin's  On  the  Origin  of  Spe- 
cies in  1859 — the  molecular  details  of  the  genes 
of  higher  organisms  remained  hidden  from  view 
by  the  enormous  complexity  of  their  genomes 
(Figure  11).  Fortunately,  the  genes  of  simple  or- 
ganisms were  accessible,  because  they  are  rela- 
tively few  in  number  (involving  in  many  cases  an 
assemblage  of  as  few  as  3,000  base  pairs,  as  the 
building  blocks  of  DNA  are  called)  compared  to 
the  human  genome,  which  probably  contains 
about  3  billion  base  pairs. 

The  problem  of  genetic  complexity  has  been 
finally  overcome  in  the  past  1 0  years  by  the  devel- 
opment of  the  powerful  new  genetic  methods 
known  collectively  as  recombinant  DNA  tech- 
nology. This  technology  allows  researchers  to 
isolate  specific  genes  from  complex  mixtures,  to 
prepare  them  in  sufficiently  large  amounts  that 
their  entire  molecular  structure  can  be  deter- 
mined, and  to  move  them  from  one  group  of  cells 
or  from  one  organism  to  another,  so  that  their 
functional  properties  can  be  identified  and  their 
products  produced  in  abundance.  In  a  number  of 
cases,  medically  valuable  products  such  as  hu- 
man insulin,  growth  hormone,  antihemophilic 
factor,  and  TPA  (tissue  plasminogen  activator) 
have  been  commercially  manufactured  in  this 
way  and  are  now  being  used  therapeutically. 

Most  of  these  accomplishments  depend  on  the 
technique  commonly  referred  to  as  gene  clon- 
ing. In  this  process  a  gene  (for  example,  the  hu- 
man insulin  gene,  which  is  a  few  thousand  base 
pairs  in  length)  is  isolated  from  a  complex  mix- 
ture of  human  DNA  fragments  and  transferred 
into  the  genetic  apparatus  of  a  much  simpler  or- 
ganism such  as  a  bacterium.  Since  bacteria  multi- 
ply very  rapidly,  the  inserted  human  insulin  gene 
is  amplified  (cloned)  along  with  the  genes  of  its 
simple  host.  It  is  then  a  relatively  straightfor- 
ward process  to  purify  the  cloned  gene  and  to 
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Figure  9.  The  famous  DMA  dou- 
ble helix. 

From  Alberts,  B.,  Bray,  D., 
Lewis,  J.,  Raff,  M.,  Roberts,  K., 
and  Watson,  J. D.  1989.  Molecu- 
lar Biology  of  the  Cell,  2nd  edi- 
tion. New  York:  Garland,  p.  99. 
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Figure  10.  An  illustration  of  how  DNA  is  repli- 
cated so  as  to  form  two  new  strands  that  are 
exactly  complementary  to  the  original  DNA 
sequences  from  which  they  derive  their 
sequences. 

From  Raven,  P.H.,  and  Johnson,  G.B. 
1988.  Understanding  Biology.  St.  Louis,  MO: 
Times  Mirror/Mosby  College  Publishing, 
p.  277. 
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chromosome  of  150,000,000  nucleotide  pairs,  containing  about  3000  genes 


Figure  11.  The  organization  of  genes  on  a  typical  vertebrate  chromosome.  Proteins 
that  bind  to  the  DNA  in  regulatory  regions  determine  whether  a  gene  is  transcribed; 
although  often  located  on  the  5'  side  of  a  gene,  as  shown  here,  regulatory  regions  can 
also  be  located  in  introns,  in  exons,  or  on  the  3'  side  of  a  gene.  The  intron  sequences 
are  removed  from  the  primary  RNA  transcripts  that  encode  protein  molecules  to 
produce  a  messenger  RNA  (mRNA )  molecule.  The  figure  given  here  for  the  number  of 
genes  per  chromosome  is  only  a  minimal  estimate. 

From  Alberts,  B.,  Bray,  D.,  Lewis,  J.,  Raff,  M.,  Roberts,  K.,  and  Watson,  J. D.  1989. 
Molecular  Biology  of  the  Cell,  2nd  edition.  New  York:  Garland,  p.  487. 
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Figure  12.  A  drawing  of  the  hemoglo- 
bin molecule  and  the  single-amino 
acid  change  in  the  fi-chain  of  the  mole- 
cule that  causes  sickle  cell  anemia. 

From  Raven,  P.H.,  and  Johnson, 
G.B.  1988.  Understanding  Biology.  St. 
Louis,  MO:  Times  Mirror/Mosby  Col- 
lege Publishing,  p.  255. 


Figure  13-  We  can  determine  where  particular  genes 
are  being  expressed  by  using  a  radioactively  labeled 
probe  that  selectively  binds  to  the  RNA  transcript  of 
the  gene  and  examining  the  distribution  of  the  radio- 
activity in  a  tissue  autograph.  This  technique,  known 
as  in  vitro  hybridization,  was  used  in  this  case  to  show 
the  expression  of  one  of  the  keratin  genes  in  a  hair 
follicle.  The  micrograph  on  the  left  is  viewed  under 
darkfield  optics  ( the  exposed  silver  grains  in  the  auto- 
radiograph  appear  as  white  dots);  that  on  the  right  is 
seen  with  conventional  optics,  and  the  radioactivity  is 
marked  by  black  dots. 

Courtesy  of  Elaine  V.  Fuchs. 
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determine  the  sequence  of  nucleotides  that  en- 
codes the  relevant  genetic  instructions. 

One  of  the  most  important  developments  to 
emerge  from  gene  cloning  and  sequencing  is 
that  they  permit  geneticists  to  determine  exactly 
how  a  gene  may  have  been  altered  (genetic  alter- 
ations of  this  kind  are  called  mutations)  to  pro- 
duce an  inherited  disease.  While  a  majority  of  the 
investigators  in  the  HHMI  Genetics  Program  are 
directing  their  efforts  toward  understanding  the 
principles  that  govern  the  action  of  genes,  a  large 
and  growing  number  are  trying  to  identify  the 
mutations  responsible  for  some  of  the  esti- 
mated 4,000  human  genetic  defects.  Among  the 
genes  being  studied  are  those  responsible  for 
muscular  dystrophy,  cystic  fibrosis,  several  forms 
of  hemoglobinopathy,  chronic  granulomatous 
disease,  phenylketonuria,  polyposis  coli,  neu- 
rofibromatosis, osteogenesis  imperfecta,  and  he- 
mophilia. Most  of  the  genes  responsible  for  these 
diseases  have  already  been  cloned  and  their 
structures  determined;  in  this  way  the  precise  mo- 
lecular effects  of  the  mutations  that  cause  the  dis- 
orders are  now  being  determined.  Information 
that  has  been  derived  in  this  way  is  being  used  in 
several  ways.  For  example,  in  some  cases  it  is  be- 
ing used  to  counsel  affected  families  about  the 
risks  they  face  in  having  additional  children;  in 
others  it  is  being  used  to  develop  tests  that  are 
critical  for  prenatal  diagnosis,  and,  in  the  case  of 
hemophilia  and  chronic  granulomatous  disease, 
it  is  already  being  used  to  provide  appropriate 
therapy.  Information  about  some  of  these  genes 
and  their  protein  products  has  also  told  us  a  great 
deal  about  the  role  of  the  relevant  proteins  in 
development  (e.g.,  the  protein  dystrophin, 
which  is  normally  found  in  muscle  but  is  missing 
in  most  cases  of  muscular  dystrophy)  or  about  the 
mechanisms  required  for  normal  function  (e.g., 
in  the  lungs  and  pancreas  in  cystic  fibrosis,  or  for 
the  mechanisms  of  blood  clotting  in  hemo- 
philia). It  is  the  continuous  interplay  between 
the  development  of  new  basic  knowledge  and  its 
application  to  the  understanding  of  human  dis- 
ease that  is  so  dramatically  informing  modern 
medicine. 

The  execution  of  a  genetic  program  in  a  cell 
obviously  cannot  be  left  to  chance.  To  bring 
about  an  ordered  series  of  changes  in  a  develop- 
mental or  metabolic  process,  each  genetic  in- 
struction must  be  activated  at  a  specific  time,  and 
the  product  of  the  gene  must  be  produced  in  a 
specific  amount  commensurate  with  the  needs  of 
the  organism.  As  was  pointed  out  in  the  section 


on  cell  biology,  different  sets  of  genes  are  ex- 
pressed in  each  cell  type  so,  for  example,  those 
expressed  in  muscle  cells  are  likely  to  be  quite 
different  from  those  expressed  in  skin  cells  or  in 
bone.  Since  with  few  exceptions  (one  of  which 
will  be  discussed  in  the  section  on  immunology) 
each  cell  contains  the  same  genetic  information, 
there  must  be  some  mechanism  or  mechanisms 
that  insure  that  the  appropriate  genes  are  being 
expressed  in  each  cell  type.  The  processes  that 
govern  these  crucial  steps  are  generally  referred 
to  as  gene  regulation. 

Once  again,  the  study  of  inherited  human  dis- 
eases has  revealed  mutations  that  can  disrupt 
those  regulatory  programs.  For  example,  it  is  now 
known  that  certain  inherited  anemias  are  brought 
about  by  specific  mutations  that  affect  the  regula- 
tory apparatus  of  the  genes  that  encode  the  red 
blood  cell  protein,  hemoglobin  (Figure  12).  An 
even  larger  number  of  examples  could  be  cited 
from  work  on  simpler  organisms  to  show  how 
powerful  this  mutational  analysis  has  been  for 
our  understanding  of  genetic  regulation.  How- 
ever, it  will  suffice  to  say  here  that  the  work  of  a 
considerable  number  of  HHMI  investigators  has 
this  as  its  primary  theme. 

As  discussed  in  the  section  on  cell  biology  and 
regulation,  genetic  information  is  usually  con- 
veyed through  a  series  of  steps  from  the  nucleus 
of  the  cell  to  the  surrounding  cytoplasm,  where  it 
is  decoded  or  translated  to  form  the  protein  prod- 
ucts that  execute  the  relevant  genetic  program. 
This  process  involves  making  many  copies  of  the 
gene,  in  the  closely  related  chemical,  RNA.  The 
process  of  copying  a  gene  from  DNA  into  RNA  is 
called  transcription.  This  is  one  of  the  key 
points  at  which  the  flow  of  genetic  information 
can  be  regulated.  Genes  that  are  not  scheduled 
for  expression  in  a  particular  cell  type  are  not 
transcribed;  while  still  present,  they  remain  si- 
lent and  ineffective. 

Since  it  plays  such  a  central  role  in  regulating 
the  expression  of  genetic  information,  it  is  not 
surprising  that  the  study  of  transcription  is  one  of 
the  most  active  areas  in  modern  genetic  research 
(Figure  13).  Investigators  have  taken  two  success- 
ful approaches  in  an  effort  to  understand  this  pro- 
cess. One  involves  the  study  of  mutations  that 
disrupt  regulation  in  a  variety  of  organisms.  The 
other  is  biochemical  and  involves  isolating  spe- 
cific proteins  that  are  parts  of  the  transcription 
machinery  and  then  determining  their  mode  of 
action  at  a  molecular  level.  The  ultimate  goals  of 
both  approaches  are  to  provide  a  complete  ac- 
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count,  in  chemical  terms,  of  the  processing  of 
genetic  information  and  to  provide  a  sound  basis 
for  our  understanding  of  the  types  of  disturbances 
in  this  process  that  can  lead  to  various  genetic 
defects  and  many  diseases  in  adult  life. 

It  is  perhaps  worth  commenting  on  the  broad 
spectrum  of  organisms  that  are  currently  used  as 
models  in  genetic  investigations.  Yeast,  worms, 
fruit  flies,  bacteria,  viruses,  mice,  and  human  be- 
ings all  provide  instructive  paradigms.  For  exam- 
ple, because  the  structure  of  the  nervous  system 
is  relatively  well  understood  in  the  primitive 
worm,  Caenorhabditis  elegans,  it  has  proved  to 
be  particularly  useful  for  elucidating  the  genetic 
program  that  directs  the  formation  of  the  nervous 
system.  The  fruit  fly  Drosophila  melanogaster, 
which  has  an  uncommonly  rich  background  of 
genetic  variation,  has  proved  to  be  the  most  valu- 
able organism  for  studying  the  genes  involved  in 
early  embryonic  development.  Similarly,  the 
mouse,  with  a  generation  time  of  just  a  few 
months,  is  a  convenient  stand-in  for  genetic  ex- 
periments that  cannot  be  contemplated  in  most 
mammals  or  human  beings.  Thus  it  has  become 
an  everyday  technique  to  introduce  new  genes 
into  the  genetic  makeup  of  mice.  Many  of  the 
resulting  animals,  called  transgenic  mice,  have 
proved  to  be  powerful  models  for  certain  human 
diseases.  Transgenic  mice  that  are  genetically 
cancer-prone  or  that  have  sickle  cell  anemia  have 
been  generated  recently  and  are  proving  useful 
not  only  for  understanding  the  molecular  bases  of 
these  disorders  but  also  for  exploring  possible 
means  for  their  treatment.  Complementing  this 
approach  has  been  the  development  of  tech- 
niques that  allow  one  to  knock  out  specific  genes 
selectively  and  to  generate  mice  of  essentially  any 
specified  genotype.  The  power  of  this  new 
methodology  for  understanding  mammalian  de- 
velopment, for  studying  the  function  of  the  ner- 
vous system,  and  for  modeling  known  human 
diseases  has  already  captured  the  attention  of 
many  geneticists,  including  several  in  the 
HHMI  Genetics  Program.  At  the  other  end  of  the 
spectrum,  the  simple  baker's  yeast,  Saccharo- 
myces  cerevisiae,  is  proving  to  be  especially 
useful  for  cloning  large  segments  of  mamma- 
lian chromosomes.  Special  carrier  elements 
called  YACs  (yeast  artificial  chromosomes) 
have  been  developed  that  can  accommodate  up 
to  500,000  base  pairs  of  DNA.  Since  this  is  more 
than  20  times  as  large  as  the  fragments  that  have 
traditionally  been  cloned  in  bacteria  and  vi- 
ruses, this  approach  and  the  associated  method 


for  separating  large  DNA  fragments  are  begin- 
ning to  play  an  important  part  in  the  interna- 
tional efi'ort  to  map  the  human  genome. 

The  development  of  a  complete  map  of  the 
human  genome  is  one  of  the  great  challenges 
engaging  the  attention  of  geneticists  worldwide, 
including  several  HHMI  investigators.  Such  a  map 
will  permit  the  identification  of  genes  that  are 
close  to,  or  responsible  for,  a  large  number  of  as 
yet  uncharacterized  genetic  diseases.  Further- 
more, knowledge  of  the  map  will  allow  the  devel- 
opment of  easily  identifiable  genetic  markers  for 
specific  diseases.  These  genetic  markers,  called 
restriction  fragment  length  polymorphisms 
(RFLPs) ,  can  be  used  to  detect  carriers  of  many 
genetic  disorders,  to  determine  paternity,  and  to 
identify  individuals  for  forensic  purposes  (a  pro- 
cess commonly  referred  to  in  the  press  as  DNA 
fingerprinting) . 

Four  or  five  years  ago  an  extremely  sensitive 
technique  for  detecting  these  DNA  marker  frag- 
ments was  developed.  This  technique  (the  poly- 
merase chain  reaction  [PCR])  greatly  amplifies 
the  genetic  signal  and  has  made  these  studies 
both  simpler  and  faster.  With  this  and  other 
emerging  technologies,  the  structures  of  the  ge- 
nomes of  several  simple  bacteria  should  be 
known  in  the  near  future,  and  the  way  is  already 
clear  to  begin  the  systematic  structural  analysis  of 
the  genomes  of  a  number  of  more  complex  organ- 
isms, such  as  the  nematode  worm  C.  elegans  and 
the  fruit  fly  Drosophila.  It  is  confidently  pre- 
dicted that  with  further  improvements  in  the 
technology  for  sequencing  genes,  the  complete 
structure  of  the  human  genome  should  be  known 
within  15  or  20  years. 

While  the  new  genetics  has  opened  the  door  to 
an  understanding  of  an  important  range  of  funda- 
mental problems  such  as  the  mechanism  of  chro- 
mosome replication,  the  genetic  basis  of  embry- 
onic development,  and  the  control  of  gene 
regulation,  it  also  holds  promise  for  an  assault  on 
the  even  more  complex  problems  that  confront 
those  interested  in  human  disease.  Cancer  has  a 
clear  genetic  basis.  Specific  genes,  generally 
those  concerned  with  cell  growth  and  gene  regu- 
lation, have  been  shown  to  be  involved  in  many 
different  malignancies.  Other  disorders,  such  as 
heart  disease  and  hypertension,  have  clear  ge- 
netic components.  Even  distressing  behavioral 
disorders  like  manic  depression  and  schizophre- 
nia have  a  genetic  basis  that  may  offer  important 
clues  to  their  nature  and  perhaps,  in  time,  to  their 
treatment. 
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The  real  power  of  the  new  genetics  is  that  it 
allows  investigators  to  approach  these  complex 
problems  using  a  reductionist  approach.  If,  for 
instance,  a  gene  contributes  to  the  development 
of  a  behavioral  disorder,  it  is  reasonable  to  as- 
sume that  it  also  plays  an  important  role  in  pro- 
gramming normal  behavior.  By  identifying  such 
genes  and  understanding  the  chemistry  of  their 
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protein  products,  we  should  obtain  important  in- 
sights about  the  biological  basis  of  behavior. 
Thus  the  ultimate  promise  of  the  new  genetics  is 
that  it  will  help  us  to  understand  diseases  that  are 
not  commonly  thought  of  as  being  primarily  ge- 
netic and,  in  doing  so,  it  will  tell  us  much  about 
those  biological  processes  that  lie  at  the  very  core 
of  our  humanity. 
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Immunology  Program 

One  of  the  most  important  developments  that 
occurred  during  the  evolution  of  vertebrates  was 
the  emergence  of  protective  mechanisms  that  en- 
able animals  to  defend  themselves  against  inva- 
sion by  foreign  organisms,  especially  viruses,  bac- 
teria, and  a  number  of  disease-causing  parasites. 


These  defenses  employ  two  distinct  but  related 
strategies:  the  first  detects,  with  exquisite  sensi- 
tivity, the  presence  of  foreign  organisms,  cells,  or 
proteins;  the  second  involves  a  series  of  mecha- 
nisms that  act  cooperatively  to  rid  the  body  of  the 
invading  organisms  or,  at  the  least,  to  neutralize 
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their  potentially  harmful  effects.  The  task  of  ef- 
fecting both  strategies  falls  to  the  immune 
system. 

Recognizing  the  strategic  importance  of  the 
immune  system  in  both  health  and  disease,  the 
Institute  selected  Immunology  to  be  one  of  its 
earliest  research  programs.  The  wisdom  of  that 
decision  has  been  amply  borne  out  by  the  truly 
remarkable  progress  that  has  been  made  in  immu- 
nology in  the  past  two  decades.  With  the  notable 
exception  of  molecular  genetics,  no  field  of  bio- 
medical research  has  witnessed  such  an  astonish- 
ing series  of  successes  at  almost  every  level,  from 
understanding  the  immune  system's  unique  rec- 
ognition mechanisms  to  the  elucidation  of  the 
cellular  and  chemical  means  used  to  destroy  or 
neutralize  invading  organisms. 

The  body's  initial  line  of  defense  against  inva- 
sion by  foreign  organisms  is  the  continuously  pa- 
trolling system  of  macrophages  and  other  types 
of  blood-borne  phagocytic  cells  that  act  both  as 
an  early  warning  system  and  as  a  "first-strike"  de- 
fense. These  cells  respond  by  ingesting  and 
breaking  up  the  invading  organisms  and  by  re- 
leasing soluble  signaling  molecules  like  inter- 
leukin-1  that  serve,  among  other  things,  to  mobi- 
lize the  next  line  of  defense,  the  immune 
response  (Figure  14).  This  response  involves 
two  classes  of  lymphocytes,  called  T  and  B  cells, 
reflecting  their  origin  from  the  thymus  and  bone 
marrow,  respectively. 

The  first  step  in  the  immune  response  is  the 
activation  of  a  special  subclass  of  T  lympho- 
cytes called  helper  T  cells.  Macrophages  pres- 
ent fragments  of  foreign  proteins,  or  antigens, 
on  their  surfaces.  Recognition  of  these  antigens 
by  specialized  receptors  found  on  helper  T  cells 
then  initiates  the  two  responses:  a  cell-mediated 
immune  response  and  a  humoral  immune  re- 
sponse. The  cell-mediated  response  involves 
principally  the  stimulation  of  another  subclass  of 
T  lymphocytes  called  cytotoxic  T  cells  that  rec- 
ognize and  destroy  infected  cells.  The  humoral 
response,  on  the  other  hand,  involves  the  activa- 
tion of  the  second  major  class  of  lymphocytes,  the 
B  cells,  to  produce  circulating  antibodies.  Anti- 
bodies recognize  and  neutralize  soluble  antigens 
and  mark  cells  or  viruses  bearing  antigens  for  de- 
struction by  phagocytic  cells. 

One  of  the  central  problems  in  immunology 
concerns  the  way  in  which  lymphocytes  recog- 
nize antigens.  The  complexity  of  this  problem 
may  be  gauged  from  the  observation  that  humans 
and  other  higher  vertebrates  are  capable  of  form- 


ing antibodies  against  virtually  any  molecule  or 
part  of  a  molecule  (epitope),  including  even 
those  that  do  not  occur  naturally  but  are  chemi- 
cally synthesized  in  a  laboratory.  How  does  this 
occur?  And  how  does  the  immune  system  distin- 
guish foreign  molecules  from  those  produced  by 
its  own  cells?  In  a  word,  how  do  lymphocytes 
distinguish  self  from  non-self? 

The  key  to  the  first  issue,  as  we  now  know,  is  to 
be  found  in  the  almost  unlimited  variety  of  re- 
ceptors on  the  surfaces  of  lymphocytes.  The  dis- 
covery of  how  just  a  few  hundred  genes  are  capa- 
ble of  producing  such  extraordinary  receptor 
diversity  is  one  of  the  great  success  stories  of  mod- 
ern immunology.  The  essential  features  of  the  im- 
mune system's  capacity  for  generating  molecular 
diversity  can  be  summarized  briefly  by  stating 
that  lymphocyte  receptors,  like  antibodies,  are 
formed  by  pairs  of  protein  chains  that  are  chemi- 
cally linked  to  form  a  complex  receptor  struc- 
ture. Each  chain  of  the  pair  has  a  constant  domain 
and  a  variable  domain.  The  variable  domain  of 
the  two  chains  is  responsible  for  antigen  recogni- 
tion and  the  discrimination  between  self  and 
non-self.  The  constant  (invariant)  domain  is 
physically  linked  to  other  membrane  proteins  of 
the  receptor  complex  that  activates  the  lympho- 
cyte's internal  signaling  and  effector  mecha- 
nisms. T  and  B  cells  triggered  via  their  antigen 
receptors  respond  to  auxiliary  signaling  mole- 
cules by  proliferating  and  differentiating  to  a  ma- 
ture effector  stage.  In  the  case  of  B  cells,  the  matu- 
ration process  results  in  the  generation  of 
plasma  cells  that  produce  large  amounts  of  anti- 
body for  secretion  into  bodily  fluids,  chiefly  the 
bloodstream. 

The  complex  structure  of  the  variable  parts  of 
the  receptors  is  due  to  several  processes.  First, 
and  most  important,  the  genes  responsible  for 
this  portion  of  the  receptor  are  assembled  from  a 
large  number  of  different  gene  segments  (Figure 
15).  Each  gene  segment  exists  in  several — and  in 
some  cases  hundreds — of  different  copies.  These 
segments  randomly  recombine  to  form  new  genes 
that  encode  the  virtually  limitless  repertoire  of 
recognition  elements.  To  take  just  one  example, 
T  cells  form  their  receptors  by  combining  a  num- 
ber of  different  gene  sequences:  V  (variable),  D 
(diversity),  and  J  (joining)  segments.  From  this 
array  any  given  T  cell  derives  1  from  about  100 
possible  V  segments,  1  from  about  6  D  segments, 
and  1  from  about  50  J  segments  to  form  its  so- 
called  a-  or  heavy  chain,  and  about  1  in  20  V, 
1  in  2  D,  and  1  in  12  J  segments  to  form  its 
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Pathogens  Humoral  immunity 


Figure  14.  When  a  pathogen  invades  the  body,  the  immune  system  responds  with  three  types  of  reaction.  The  cells  of  the 
humoral  immune  system  ( B  cells )  secrete  antibodies  that  can  bind  to  the  pathogen.  Cells  of  the  cellular  system  (T  cells )  carry 
out  two  major  types  of  functions.  One  type  of  T  cell  (the  cytotoxic  T  cells,  CTL )  develops  the  ability  to  kill  pathogen-infected 
cells.  Helper  T  cells,  on  the  other  hand,  secrete  factors  (lymphokines)  that  stimulate  the  body's  overall  response. 

From  Molecular  Cell  Biology,  2nd  edition.  James  Darnell,  Harvey  Lodish,  and  David  Baltimore,  p.  1005.  Copyright  ®  1990 
by  Scientific  American  Books,  Inc.  Reprinted  with  permission  by  W.H.  Freeman  and  Company. 
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Figure  15.  The  specificity  of  antibodies  and  the  receptors  on  T  cells  is  brought  about  by  a  series  of  complex 
rearrangements  of  various  gene  segments  like  those  shown  in  this  figure. 

From  Molecular  Cell  Biology,  2nd  edition.  James  Darnell,  Harvey  Lodish,  and  David  Baltimore,  p.  1027. 
Copyright  ®  1990  by  Scientific  American  Books,  Inc.  Reprinted  with  permission  by  W.H.  Freeman  and 
Company. 
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Figure  16.  A:  A  molecular  model  of  an  antibody  mol- 
ecule. Each  amino  acid  is  represented  by  a  small 
sphere.  The  heavy  chains  are  colored  blue,  the  light 
chains  red.  The  four  chains  wind  about  one  another 
to  form  a  Y  shape,  with  two  identical  antigen-bind- 
ing sites  at  the  arms  of  the  Y  and  a  tail  region  that 
serves  to  direct  the  antibody  to  a  particular  portion 
of  the  immune  response. 

B:  A  schematic  drawing  of  an  antibody  molecule. 
Each  molecule  is  composed  of  two  identical  light  (L ) 
chains  and  two  identical  heavy  (H)  chains.  Carbo- 
hydrate is  sometimes  associated  with  the  H  chain. 
While  the  antigen-binding  sites  are  formed  by  a  com- 
plex of  both  H  and  L  chains,  the  tail  region  is  formed 
by  H  chains  alone. 

From  Raven,  P.H.,  and  Johnson,  G.B.  1988.  Under- 
standing Biology.  St.  Louis,  MO:  Times  Mirror/Mosby 
College  Publishing,  p.  692. 
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Figure  17.  Contemporary  model  of  immune  system  development  in  mammals.  T  and  B  cells,  which  form  the  major 
recognition  and  effector  limbs  of  the  immune  system,  are  derived  from  multipotent  stem  cells  that  also  give  rise  to  the 
other  types  of  blood  cells.  Definition  of  the  T  cell  receptors  and  accessory  cell  surface  molecules  has  allowed  the 
identification  of  two  major  sublineages  of  the  thymus- derived  Tcells,  each  of  which  expresses  a  different  type  of  T  cell 
receptor  and  exhibits  specialized  function.  The  B  lineage  cells,  generated  in  hemopoietic  tissues,  initially  express  one 
class  of  antibody  receptors  called  immunoglobulin  M  (IgM),  but  may  then  switch  to  the  expression  of  other  antibody 
classes,  each  of  which  has  special  biological  advantages.  Note  the  third  lineage  of  lymphocytes,  the  natural  killer 
cells.  They  are  the  most  recently  recognized  members  of  the  lymphocyte  family,  but  may  well  prove  to  be  the  most 
phylogenetically  ancient  elements  in  this  constellation  of  cells  involved  in  host  defense. 
Courtesy  of  Max  D.  Cooper. 
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Figure  18.  A  schematic  representation  of 
how  the  thymus  gland  selects  T cells  that  are 
able  to  recognize  foreign  but  not  self 
proteins. 

Prom  Molecular  Cell  Biology,  2nd  edition. 
James  Darnell,  Harvey  Lodish,  and  David 
Baltimore,  p.  1037.  Copyright  ®  1990  by 
Scientific  American  Books,  Inc.  Reprinted 
with  permission  by  W.H.  Freeman  and 
Company. 
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light  chain.  The  random  recombination  of  V,  D, 
and  J  segments  in  the  two  chains  can  thus  code  for 
literally  millions  of  different  possible  receptor 
structures.  The  receptors  on  B  cells  are  formed  in 
basically  the  same  way,  although  the  numbers 
of  V,  D,  and  J  segments  available  for  selection 
and  recombination  differ  somewhat.  Antibodies 
(which  are  secreted  by  plasma  cells)  are  gener- 
ated in  much  the  same  way  as  their  receptors,  and 
have  the  same  almost  unlimited  capacity  for  di- 
versity (Figure  16). 

At  one  time  it  was  thought  that  antigens  were 
capable  of  shaping  the  structure  of  lymphocyte 
receptors  and  antibodies  so  that  the  binding  sites 
of  receptors  and  antibodies  would  mold  them- 
selves in  some  way  to  fit  the  shape  of  the  antigen, 
much  as  a  rubber  glove  molds  itself  to  fit  one's 
hand.  We  now  know  that  this  "instructional"  hy- 
pothesis is  wrong.  Rather,  as  indicated  above,  the 
immune  system  produces  very  large  numbers  of 
different  types  of  receptors  and  antibodies,  and 
collectively  these  can  "fit"  essentially  every  pos- 
sible antigen  (Figure  1 7) .  Each  T  or  B  cell  bears 
only  one  type  of  receptor  on  its  surface  (although 
there  are  thousands  of  receptor  molecules  of  that 
given  structure  on  each  cell).  In  the  same  way, 
each  B  cell  secretes  antibodies  of  only  a  single, 
defined  structure.  Thus  the  capacity  of  the  body 
to  respond  to  an  enormous  variety  of  different 
antigens  is  due  to  the  existence  of  an  enormous 
number  of  different  T  or  B  cells,  each  able  to  rec- 
ognize a  single  antigen  (or  more  commonly,  a 
part  of  a  complex  antigen  known  as  an  antigenic 
determinant).  And  when  the  cell  recognizes 
and  binds  to  an  antigen,  it  responds  by  proliferat- 
ing to  form  a  large  number  of  cells  of  the  same 
type.  Such  a  population  of  cells,  all  derived  from 
a  single  progenitor,  is  known  as  a  clone,  and  the 
hypothesis  put  forward  to  account  for  the  selec- 
tive proliferation  of  lymphocytes  of  particular  re- 
ceptor type  in  response  to  a  specific  antigen  is 
known  as  the  clonal  selection  hypothesis.  This 
theory,  first  advanced  by  Sir  F.  Macfarlane  Burnet, 
has  withstood  every  test  and  is  rightly  viewed  as 
one  of  the  cornerstones  of  modern  immunology. 

The  intriguing  question  as  to  how  the  cells  of 
the  immune  system  distinguish  foreign  mole- 
cules from  those  on  the  surface  of  the  cells  of 
their  own  host  was  also  addressed  by  Burnet.  He 
suggested — and  there  is  now  a  large  body  of  evi- 
dence to  support  this  view — that  lymphocytes 
that  recognize  the  body's  own  tissues  (so-called 
self  antigens)  are  selectively  eliminated  during 
early  development — a  process  known  as  clonal 


deletion.  The  mechanisms  responsible  for  clonal 
deletion  of  T  and  B  cells  are  still  under  investiga- 
tion, but  at  least  for  the  T  cells  it  appears  that 
during  development  the  thymus  may  actively  se- 
lect, for  export  to  the  rest  of  the  immune  system, 
only  those  T  cells  that  are  capable  of  functioning 
in  the  host  (Figure  18).  These  "useful"  T  cells 
are  allowed  to  survive  and  mature,  while  the  po- 
tentially harmful  cells  die  and  are  removed.  This 
process  results  in  the  death  of  about  90  percent 
or  more  of  the  T  cells  that  are  initially  formed. 
The  bone  marrow  selection  of  B  cells  for  survival 
may  be  equally  stringent.  Although  this  seems  an 
astonishingly  wasteful  process,  comparable  cell 
deaths  are  known  to  be  a  rather  common  feature 
in  the  development  of  virtually  all  organs  and  of 
all  multicellular  organisms. 

For  reasons  that  remain  to  be  elucidated,  in 
some  conditions — commonly  referred  to  as  au- 
toimmune disorders — the  immune  system  may 
mistakenly  mount  an  attack  on  components  of  the 
host  organism's  own  cells.  For  example,  the  neu- 
rological condition  myasthenia  gravis  involves 
the  production  of  circulating  antibodies  directed 
against  the  receptor  molecules  on  the  surfaces  of 
muscle  cells  that  normally  enable  them  to  re- 
spond to  the  release  of  the  neural  transmitter  ace- 
tylcholine from  the  motor  nerves.  When  the  re- 
ceptor molecules  are  damaged  or  destroyed, 
there  is  a  progressive  loss  of  neuromuscular  con- 
trol, and  if  the  respiratory  muscles  are  involved 
the  condition  may  be  fatal.  Similarly,  type  I  or 
juvenile-onset  diabetes  is  now  known  to  be  due 
to  the  combined  attack  of  T  cells  and  antibodies 
directed  against  the  jS-cells  of  the  pancreas  that 
normally  produce  insulin,  the  hormone  that  regu- 
lates sugar  metabolism. 

Another  topic  of  considerable  current  interest 
in  immunology  concerns  antigen  presenta- 
tion. We  now  know  that  this  involves  a  complex 
set  of  genes  called  the  major  histocompatibil- 
ity complex  (MHC).  The  membrane  proteins 
encoded  by  these  genes  (of  which  there  are  two 
types  called  class  I  and  class  II)  are  able  to  selec- 
tively bind  short  segments  of  partially  digested 
protein  antigens,  termed  peptides.  These  pep- 
tides, arising  from  protein  breakdown  inside  the 
cell,  reach  the  surface  of  the  cell  together  with 
the  relevant  MHC  molecule.  Recent  x-ray  crystal- 
lographic  studies  indicate  that  the  peptide  anti- 
gen is  lodged  within  a  distinctive  groove  on  the 
outer  surface  of  the  MHC  molecule,  where  it  can 
be  detected  by  a  lymphocyte  bearing  the  appro- 
priate receptor.  The  receptors  on  T  cells  are  spe- 
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cialized  to  recognize  antigens  only  in  the  form  of 
such  a  peptide:MHC  complex.  There  are  acces- 
sory molecules  on  the  surfaces  of  T  cells,  called 
CD4  and  CDS,  that  are  selectively  expressed  on 
cells  that  recognize  antigens  presented  by  MHC II 
and  MHC  I  molecules,  respectively.  Functionally 
these  accessory  molecules  form  part  of  the  T  cell 
receptor  for  peptide:MHC  complexes  by  binding 
to  both  the  MHC  molecule  and  the  T  ceil  recep- 
tor. For  this  reason  CD4  and  CDS  are  sometimes 
called  co-receptors. 

Like  the  antigen  receptors  on  B  and  T  cells, 
MHC  molecules  show  considerable  diversity. 
However,  this  diversity  is  not  due  to  the  recombi- 
nation of  different  gene  segments  but  rather  to 
genetic  polymorphism.  There  may  be  as  many  as 
100  different  genetic  sequences  (alleles)  at  a 
single  MHC  locus,  and  T  cells  are  selected  during 
development  only  if  they  can  recognize  peptides 
presented  by  self  MHC  molecules.  How  this  oc- 
curs is  unknown,  but  its  role  in  T  cell  selection  is 
obviously  fundamental. 

The  CD4-bearing  T  cells  (also  called  T4  cells, 
helper  lymphocytes,  or  CD4"^  cells)  have  become 
widely  known  because  of  their  role  in  the  devel- 
opment of  AIDS  (acquired  immune  deficiency 
syndrome) .  The  virus  that  causes  AIDS — the  hu- 
man immunodeficiency  virus  (or  HIV) — selec- 
tively invades  these  cells,  because  the  CD4^  mole- 
cule fortuitously  serves  also  as  a  specific  receptor 
for  the  virus  (Figure  1 9)  •  On  entering  the  CD4^  T 
cells,  the  genetic  material  of  the  virus  (which  is 
formed  of  RNA)  becomes  transcribed  into  DNA, 
and  this,  in  turn,  becomes  integrated  into  the  T 
cell's  own  genome.  In  this  way  the  virus  subverts 
the  cell's  genetic  machinery  and,  when  activated, 
the  cell  produces  more  and  more  virus,  which 
ultimately  kills  the  cell.  When  the  cell  dies  it  re- 
leases virus  into  the  bodily  fluids,  where  it  is  free 
to  invade  other  CD4^  T  cells,  and  the  whole  pro- 
cess may  be  repeated  until  finally  the  entire  T  cell 
population  is  depleted.  Since,  as  we  have  seen, 
these  cells  are  essential  for  mounting  both  cell- 
mediated  and  humoral  immune  responses,  pa- 
tients with  AIDS  become  progressively  more  vul- 
nerable to  all  forms  of  infection  and  commonly 
succumb  to  opportunistic  infections  that  would 
normally  be  easily  overcome. 

A  second  important  component  of  the  immune 
system  is  the  complement  system,  which  con- 
sists of  a  complex  series  of  serum  and  cell  mem- 
brane proteins  (Figure  20).  These  proteins  per- 
form essential  roles  in  the  immune  response  to 
foreign  organisms  such  as  bacteria  and  viruses. 


and  in  the  response  to  tumors.  Deficiencies  of  any 
of  the  complement  proteins  may  lead  to  diseases, 
including  those  that  involve  infection,  hemolysis 
of  red  blood  cells,  or  autoimmune  diseases  such 
as  systemic  lupus  erythematosus. 

The  complement  system  is  activated  by  two 
general  mechanisms.  First,  antibodies  (Ab)  can 
activate  complement  when  they  bind  their  anti- 
gen (Ag).  In  addition  to  this  so-called  classical 
pathway  there  is  an  alternative  pathway  that  is 
continuously  active  at  a  low  level  marking  for- 
eign organisms  for  which  there  are  no  preformed 
antibodies  available. 

In  addition  to  these  roles,  complement  pro- 
teins help  to  regulate  the  immune  system  by  an- 
other mechanism.  This  involves  the  interaction  of 
specific  activated  complement  protein  fragments 
with  receptors,  or  binding  proteins,  that  are  on 
the  surface  of  immune  system  cells.  These  recep- 
tors allow  for  communication  with  the  interior  of 
the  cell,  and  their  activation  leads  to  a  change  in 
the  function  or  fate  of  the  cell. 

Overall,  the  complement  system  plays  a  funda- 
mental role  in  normal  or  abnormal  immune  re- 
sponses. Current  study  in  this  area  is  directed  to- 
ward understanding  not  only  the  molecular 
mechanisms  of  complement  activation  and  regu- 
lation but  also  the  general  effects  on  the  immune 
response  of  experimentally  altering  complement 
function. 

The  devastating  consequences  of  AIDS,  congen- 
ital immunodeficiency  disorders,  and  the  fre- 
quent rejection  of  transplanted  organs  have  made 
the  public  more  aware  of  the  importance  of  the 
immune  system  in  medical  practice.  The  develop- 
ment of  immunosuppressive  drugs  has  gone  a 
long  way  toward  overcoming  the  problem  of  tis- 
sue rejection,  and  there  is  now  considerable  inter- 
est in  the  possible  development  of  vaccines  to 
limit  the  spread  of  HIV.  Many  of  us  remember 
how  some  30  years  ago  poliomyelitis  was  to  all 
intents  and  purposes  eliminated  in  this  country 
by  the  development  of  the  Salk  and  Sabin  vac- 
cines, and  we  are  all  conscious  of  the  fact  that 
many  illnesses  such  as  measles,  rubella,  whoop- 
ing cough,  and  even  smallpox  have  been  brought 
under  control.  But  it  is  still  not  widely  appre- 
ciated that  the  immune  system  is  itself  subject  to 
a  number  of  serious  disorders  such  as  lymphoma 
and  leukemia.  Our  ability  to  deal  with  these  ma- 
lignant conditions  is  still  very  limited,  but  we  are 
beginning  to  understand  what  may  cause  them. 
While  these  disorders  present  the  most  urgent 
challenges  to  clinical  immunology,  even  rela- 
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Figure  19-  Stereoview  of  a  region  on  the  macromolecule  CD4,  the  T  cell  receptor  for  HIV.  CD4 
contains  four  tandem  immunoglobulin- like  domains,  D1-D4.  The  view  represents  Dl's  major 
binding  region  for  the  virus  coat  glycoprotein  gpl20.  Atoms  of  residues  41-59  are  shown  in  red. 
They  are  enveloped  by  the  surface  in  contact  with  a  water  molecule  probe.  This  figure  was  produced 
with  the  display  program  QUANTA. 

Reprinted  by  permission  from  Ryu,  S.-E.,  Kwong,  P.D.,  Truneh,  A.,  Porter,  T.G.,  Arthos,/.,  Rosen 
berg,  M.,  Dai,  X.,  Xuong,  N.-h.,  Axel,  R.,  Sweet,  R.W.,  and Hendrickson,  W.A.  Nature  348:419-426. 
Copyright  ®  1990  Macmillan  Magazines  Ltd. 
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Figure  20.  The  complement  system.  Ag,  antigen;  Ab,  antibodies. 
Courtesy  of  V.  Michael  Holers. 
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tively  minor  allergic  disorders  continue  to  pose 
problems  both  for  the  practicing  physician  and 
for  the  patients  who  suffer  from  them.  Many  of 

Investigators  in  the  Immunology  Program 
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Neuroscience  Program 

Among  the  most  challenging  problems  in  bio- 
medical research  are  those  posed  by  the  human 
brain.  How  do  we  perceive  the  world  around  us? 
How  do  we  learn  from  past  experiences?  How  do 
we  store  and  recall  information  derived  from 
those  experiences?  How  do  we  determine  when 
to  act  and  what  actions  to  carry  out?  What  is 
thought?  And  what  are  the  neural  mechanisms 
that  underlie  language?  In  a  word,  how  are  all 
those  aspects  of  our  lives  that  most  specifically 
define  our  humanity  instantiated  in  the  function- 
ing of  our  brains?  The  answers  to  these  questions 
still  lie  far  in  the  future,  but  in  the  past  two  de- 
cades considerable  progress  has  been  made  in  our 
understanding  of  some  of  the  cellular  and  molec- 
ular mechanisms  involved  in  brain  function.  Rec- 
ognizing this,  in  1983  the  Institute  initiated  its 
Neuroscience  Program,  which,  until  recently, 
has  been  largely  focused  on  the  ways  in  which 
nerve  cells  conduct  signals  and  communicate 
with  each  other  and  with  the  effector  tissues  of 
the  body  (such  as  muscle  and  gland  cells)  and  on 
the  cellular  mechanisms  involved  in  the  develop- 
ment of  the  nervous  system. 

Modern  neuroscience  is  founded  on  two  funda- 
mental concepts  that  derive  from  the  late  nine- 


the  reports  in  this  volume  indicate  how  these  and 
other  problems  associated  with  the  immune  sys- 
tem are  currently  being  addressed. 
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teenth  and  the  early  years  of  the  twentieth  cen- 
tury. The  first  of  these,  commonly  referred  to  as 
the  neuron  doctrine,  is  that  the  fundamental 
functioning  units  of  the  nervous  system  are  nerve 
cells,  or  neurons.  Among  the  cells  of  the  body, 
neurons  are  distinguished  anatomically  by  the 
fact  that  they  all  extend  processes  (some  of  con- 
siderable length)  that  are  of  two  general  types: 
shorter  tapering  processes  (dendrites)  that 
mainly  serve  to  receive  information  from  other 
cells,  and  longer  processes  (axons),  of  more  uni- 
form diameter,  that  serve  to  transmit  information 
to  other  parts  of  the  nervous  system  or  to  the  body 
at  large.  The  second  basic  concept  is  that  infor- 
mation in  the  nervous  system  is  principally  en- 
coded in  a  series  of  signals  called  nerve  im- 
pulses, or  action  potentials.  These  are  brief, 
usually  all-or-nothing  electrical  changes  in  the 
nerve  cell  membrane  that  are  propagated  along 
the  axons  at  rates  between  about  3  and  300  feet 
per  second.  A  necessary  corollary  of  this  concept 
is  the  notion  that  nerve  cells  communicate  this 
encoded  information  to  each  other  at  specific 
sites  called  synapses,  where  the  axon  of  one  cell 
functionally  interacts  with  the  dendrites  or  the 
bodies  of  other  neurons. 
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The  essential  morphological  features  of  neu- 
rons were  established  in  the  1870s  and  1880s, 
with  the  aid  of  a  number  of  selective  staining  pro- 
cedures, notably  the  metallic  impregnation  tech- 
nique developed  by  the  Italian  microscopist  Ca- 
millo  Golgi.  And  the  fundamental  principle  of 
the  neuron  doctrine,  namely,  that  nerve  cells  are 
anatomically  and  functionally  discrete  entities, 
was  convincingly  demonstrated  around  the  turn 
of  the  century  by  the  great  Spanish  neurohistolo- 
gist,  Santiago  Ramon  y  Cajal.  The  biophysical 
mechanisms  responsible  for  the  nerve  impulse 
and  for  synaptic  transmission  were  established  in 
the  early  1950s,  principally  through  the  work  of 
Hodgkin,  Huxley,  Katz,  and  Eccles  (Figures  21, 
22,  and  23).  In  brief,  activation  of  a  nerve  cell 
results  in  the  successive  opening  of  pores  or  ion 
channels  along  the  length  of  the  axon  that  result 
in  the  temporary  reversal  of  the  voltage  between 
the  inside  and  the  outside  of  the  axon  (this  tran- 
sient change  in  potential  is  the  action  potential; 
see  Figure  7) .  When  the  action  potential  reaches 
the  ends  of  the  axon  it  causes  the  release  of  a 
neurotransmitter  that  diffuses  across  the  micro- 
scopic gap  between  the  axon  terminal  and  the 
postsynaptic  cell.  The  binding  of  the  neurotrans- 
mitter to  specialized  receptors  in  the  membrane 
of  the  postsynaptic  cell  in  turn  triggers  a  response 
in  that  cell  which  may  either  be  the  opening  of  an 
ion  channel  or  the  activation  of  a  second  intracel- 
lular messenger  in  the  cell.  In  either  case  the 
binding  of  the  transmitter  to  the  receptor  is  re- 
flected in  the  generation  of  a  graded  voltage 
change  across  the  membrane  of  the  postsynaptic 
cell,  called  a  synaptic  potential.  Depending  on 
the  nature  of  the  transmitter  receptor,  the  re- 
sponse may  be  either  excitatory  or  inhibitory; 
i.e.,  the  postsynaptic  cell  may  either  be  activated 
or  rendered  less  likely  to  discharge  an  impulse. 
Finally,  the  released  neurotransmitter  is  either 
broken  down  by  a  specific  enzyme  within  the 
synaptic  cleft,  or  taken  up  by  selective  transport 
mechanisms  into  the  axon  terminal  (where  it  can 
be  reutilized)  or  into  the  surrounding  nonneural 
(glial)  cells. 

In  the  past  10  years  we  have  learned  a  good 
deal  about  the  molecular  mechanisms  involved 
in  both  impulse  conduction  and  synaptic  trans- 
mission, largely  as  the  result  of  the  successful 
cloning  of  the  genes  for  a  number  of  the  ion  chan- 
nels involved  (e.g.,  for  Na"*^,  K^,  and  Ca^"^)  and  for 
many  neurotransmitter  receptors  like  those  for 
acetylcholine,  glutamate,  7-aminobutyric  acid 
(GABA),  serotonin,  norepinephrine,  dopamine. 


and  various  peptides.  From  the  nucleotide  se- 
quence of  these  genes  it  has  been  possible  not 
only  to  deduce  the  primary  amino  acid  sequence 
of  the  channel  or  transmitter  proteins  (and  from 
this  to  infer  the  probable  arrangement  of  the  rele- 
vant protein  in  the  membrane)  but  also  to  gener- 
ate hybridization  probes  to  identify  other  related 
channels  or  receptors.  And  using  some  of  the  es- 
tablished techniques  of  genetic  engineering,  like 
site-directed  mutagenesis,  it  has  been  possible  in 
some  cases  to  establish  the  regions  within  the 
channel  molecules  that  are  sensitive  to  changes 
in  voltage  (Figure  24),  or  the  ligand-binding  and 
second  messenger-activating  domains  of  recep- 
tors. One  example  will  suffice  to  demonstrate  the 
importance  of  this  approach  to  our  understand- 
ing of  these  fundamental  processes. 

It  has  been  known  for  almost  50  years  that  the 
relatively  simple  molecule  acetylcholine  is  the 
transmitter  at  the  junctional  region  between  mo- 
tor nerve  fibers  and  muscle  cells  and  also  at  cer- 
tain synapses  in  the  brain  and  spinal  cord.  With 
the  discovery  in  the  1970s  that  the  clinical  con- 
dition myasthenia  gravis  (previously  discussed 
in  the  immunology  section)  is  caused  by  circulat- 
ing antibodies  directed  against  the  receptor  for 
acetylcholine  in  the  muscle  membrane,  a  major 
effort  was  mounted  to  purify  and  biochemically 
characterize  the  acetylcholine  receptor  (AChR) . 
This  work  served  to  establish  that  the  AChR  con- 
sists of  five  subunits:  two  designated  a,  and  one 
each  called  ^,  7,  and  b. 

In  the  early  1980s  Heinemann,  Patrick,  and 
their  colleagues  succeeded  in  cloning  the  genes 
for  the  a-subunit,  and  in  1983  Numa  and  his  co- 
workers presented  the  complete  nucleotide  se- 
quences encoding  all  four  kinds  of  subunits. 
From  these  sequences  we  gained  several  impor- 
tant insights.  First,  the  four  subunits  showed  a 
high  degree  of  homology,  which  suggested  that 
their  genes  were  probably  derived — by  duplica- 
tion and  divergence — from  a  single  ancestral 
gene.  Second,  the  similarities  in  the  predicted 
amino  acid  sequences  suggested  that  the  subunits 
are  probably  arranged  to  form  a  central  pore  or 
channel  between  them.  Third,  the  presence  in 
each  subunit  of  four  hydrophobic  regions,  each 
about  20  amino  acids  long,  immediately  sug- 
gested the  probable  disposition  of  the  subunits, 
with  four  transmembrane  domains  (Ml,  M2,  M3, 
and  M4)  and  intervening  intra-  and  extracellular 
linking  segments.  More  recent  work  has  identi- 
fied a  fifth  subunit  type  f  and  has  established  the 
precise  location  of  the  acetylcholine-binding  site 
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Figure  21.  A  schematic  diagram  of  a  typi- 
cal neuron.  The  arrows  indicate  the  di- 
rection in  which  nerve  signals  are  con- 
veyed. The  largest  axons  in  the  human 
brain  and  spinal  cord  extend  for  about 
three  feet  and  have  a  diameter  of  less 
than  1/1, 000th  of  an  inch.  Many  axons 
are  covered  by  an  insulating  layer 
known  as  the  myelin  sheath.  The  myelin 
sheath  is  interrupted  at  intervals  known 
as  nodes  of  Ranvier. 

Prom  Alberts,  B.,  Bray,  D.,  Lewis,  J., 
Raff,  M.,  Roberts,  K.,  and  Watson,  J.D. 
1989.  Molecular  Biology  of  the  Cell,  2nd 
edition.  New  York:  Garland,  p.  1061. 
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Figure  22.  A  few  of  the  many  types  of  neu- 
rons in  the  vertebrate  nervous  system. 
(S.  Ramon  y  Cajal,  Histologic  du  Systeme 
Nerveux  de  I'Homme  et  des  Vertebres. 
Paris:  Maloine,  1901-191 1;  reprinted, 
Madrid:  C.S.I.C.,  1972.) 

From  Alberts,  B.,  Bray,  D.,  Lewis,  J., 
Raff,  M.,  Roberts,  K.,  and  Watson,  J.D. 
1989.  Molecular  Biology  of  the  Cell,  2nd 
edition.  New  York:  Garland,  p.  1061. 
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Figure  23-  A:  The  anatomical  arrangement  of  a  typical  reflex,  in  this  case  the  knee  jerk.  Each  cell  actually  represents  a  population 
of  many  neurons.  Information  about  stretch  of  the  quadriceps  femoris  muscle  is  conveyed  by  afferent  neurons  to  several  loci 
within  the  central  nervous  system.  In  the  spinal  cord,  afferent  neurons  act  directly  on  the  motor  neurons  to  the  quadriceps  and, 
by  means  of  inhibitory  interneurons,  indirectly  on  the  motor  neurons  to  the  antagonistic  muscle,  the  biceps.  Both  of  these 
actions  combine  to  produce  the  coordinated  expression  of  the  reflex  behavior.  In  addition,  information  is  conveyed  to  higher 
regions  of  brain  to  update  them  about  the  information  coming  into  the  nervous  system  and  about  the  behavior  that  is  being 
generated.  These  higher  centers,  in  turn,  can  act  to  modify  the  reflex 
behavior. 

B:  The  sequences  of  signaling  changes  that  produce  the  reflex  action. 
Graded  stretch  of  the  muscle  produces  a  graded  receptor  potential  in  the 
muscle  spindle  of  the  afferent  neuron  that  propagates  passively  to  the  trig- 
ger zone  at  the  first  node  of  Ranvier.  If  the  potential  is  sufficiently  large,  it 
will  trigger  an  action  potential  that  will  propagate  actively  along  the  axon 
to  the  terminal  region.  At  the  terminal  the  change  in  membrane  potential, 
produced  by  the  action  potential,  gives  rise  to  a  secretory  potential  that 
leads  to  the  release  of  transmitter  substance.  The  transmitter  diffuses 
across  the  synaptic  cleft  and  interacts  with  receptor  molecules  on  the  mem- 
brane of  the  postsynaptic  motor  cell  to  initiate  a  synaptic  potential.  The 
synaptic  potential  then  propagates  passively  to  the  initial  segment  of  the 
axon,  where  it,  in  turn,  initiates  an  action  potential  that  propagates  to  the 
terminals  of  the  motor  neuron.  This  action  potential  leads  ultimately  to  a 
synaptic  potential  in  the  muscle,  which  initiates  an  action  potential  that 
causes  the  contraction  of  the  muscle. 

Reprinted  by  permission  of  the  publisher  from  Kandel,  E.R.,  and 
Schwartz,  J.H.  Principles  of  Neural  Science,  2nd  edition,  p.  20.  Copyright 
1985  by  Elsevier  Science  Publishing  Co.,  Inc. 
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Figure  24.  A  local  reduction  of  the  voltage  differential  may 
induce  a  sodium  or  potassium  channel  to  change  confor- 
mation from  one  allosteric  form  to  another,  thus  opening 
the  channel  and  leading  to  the  free  passage  of  ions.  This 
change  is  presumably  caused  by  key  electrically  charged 
amino  acids  in  the  channel  protein,  which  shift  their  orien- 
tation in  response  to  the  changed  electrical  field. 

From  Discovering  Enzymes,  by  David  Dressier  and  Hun- 
tington Potter,  p.  233-  Copyright®  1991  by  David  Dressier. 
Reprinted  with  permission  by  W.H.  Freeman  and  Company. 
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and  the  existence  of  a  family  of  nicotinic  AChRs 
in  the  central  nervous  system;  much  has  also  been 
learned  about  the  regulation  of  the  receptor  dur- 
ing muscle  development  and  after  denervation, 
and  about  the  process  of  receptor  desensitization. 

The  nicotinic  AChR  belongs  to  a  large  class  of 
neurotransmitter  receptors  that  operate  by  selec- 
tively opening  ion  channels.  Another,  somewhat 
larger  class  of  receptors  acts  through  second  mes- 
sengers. For  example,  the  adrenergic  receptors, 
which  are  responsible  for  controlling  a  number 
of  vital  functions  such  as  heart  rate  and  blood 
pressure,  act  through  the  intermediary  of  a  class 
of  so-called  G  proteins  to  activate  the  enzyme 
adenylate  cyclase  and  increase  the  intracellular 
level  of  the  important  second  messenger,  cAMP. 
The  second  messenger,  in  turn,  usually  acts  by 
stimulating  protein  kinases  that  modify  (by  add- 
ing phosphate  groups)  other  proteins,  including 
ion  channels  and  proteins  that  regulate  gene  ex- 
pression in  the  responding  cell. 

The  regulation  of  gene  expression  by  synapti- 
cally  mediated  second  messenger  systems  has  be- 
come one  of  the  most  active  areas  for  research  in 
molecular  neuroscience.  Whereas  neurotransmit- 
ters usually  result  in  changes  that  have  a  time 
course  measured  in  the  millisecond  to  second 
range,  many  of  the  most  intriguing  phenomena  in 
neuroscience  are  those  that  occur  over  periods  of 
hours,  days,  and  even  months  or  years.  Recent 
work  has  demonstrated  that  in  addition  to  their 
more  or  less  immediate  and  short-lasting  effects, 
under  appropriate  conditions  (such  as  those  we 
commonly  associate  with  learning  and  memory) 
neurotransmitters  may,  through  second  messen- 
gers, activate  a  number  of  transcriptional  regula- 
tory proteins  that  "turn  on"  various  classes  of 
genes.  These,  in  turn,  may  regulate  the  expres- 
sion of  yet  other  genes  and  thus  unleash  a  com- 
plex cascade  of  events  within  the  responding 
nerve  cell,  modifying  its  growth  and  altering  its 
responsiveness  to  later  neurotransmitter  activa- 
tion over  long  periods  of  time. 

One  of  the  major  beneficiaries  of  the  applica- 
tion of  the  new  genetics  to  the  nervous  system  has 
been  the  field  of  developmental  neuroscience. 
Indeed  it  is  no  exaggeration  to  say  that  since  the 
late  1970s  this  field  has  been  transformed  from 
an  essentially  descriptive  science  into  one  in 
which,  for  the  first  time,  mechanistic  explana- 
tions are  emerging  to  account  for  the  growth  of 
nerve  cells  and  their  processes,  for  the  deploy- 
ment of  cells  into  peripheral  ganglia  and  within 
cortical  layers  or  nuclear  groups  in  the  central 


nervous  system,  for  the  formation  of  specific  pat- 
terns of  connections,  and  for  the  elimination  of 
redundant  cells  and  inappropriate  connections. 

Because  of  the  complexity  and  inaccessibility 
of  the  mammalian  central  nervous  system,  until 
recently  much  of  the  most  definitive  work  on 
neural  development  has  been  carried  out  in 
simpler  forms  such  as  the  nematode  C.  elegans 
and  the  fruit  fly  Drosophila.  It  is  difficult  to  sum- 
marize the  broad  sweep  of  this  work,  except  to 
say  that  it  has  served  to  clarify  the  genetic  mecha- 
nisms that  determine  the  distinct  front-to-back 
and  top-to-bottom  organization  of  all  developing 
organisms,  that  determine  not  only  which  cells 
will  become  neurons  but  also  how  many  neurons 
will  be  generated  and  what  type  they  will  be 
(e.g.,  sensory  cells,  interneurons,  or  motor 
cells),  and  that  determine  finally  whether  the 
neurons  that  are  initially  formed  will  survive.  In 
some  instances  it  is  clear  that  the  character  or 
phenotype  of  the  nerve  cells  is  determined  by 
their  lineage;  in  other  cases  cell-cell  interactions 
are  more  important,  and  the  nature  of  the  signals 
that  developing  cells  transmit  to  their  neighbors 
is  currently  being  elucidated.  Of  special  impor- 
tance are  the  molecules  on  the  surfaces  of  cells 
that  enable  them  to  recognize  and  aggregate  with 
other  cells  of  like  kind  or  enable  them  to  migrate 
along  other  cells  or  across  territories  filled  with 
extracellular  matrix  materials.  While  much  re- 
mains to  be  discovered,  the  first  fruits  of  this  har- 
vest hold  great  promise  for  future  progress  in  this 
important  field. 

Finally,  no  account  of  progress  in  molecular 
neuroscience  would  be  complete  without  refer- 
ence to  the  striking  developments  in  our  under- 
standing of  the  basis  of  some  of  the  major  genetic 
disorders  that  affect  the  nervous  and  related  mus- 
cular systems.  Perhaps  the  most  striking  of  these 
developments  has  been  the  cloning  of  the  gene 
for  Duchenne  and  Becker  muscular  dystrophy. 
These  are  X-linked  recessive  disorders  that,  in  the 
more  severe  (Duchenne)  form  lead  inexorably 
from  muscular  weakness  to  muscular  atrophy  and 
finally  death.  The  extreme  size  of  this  gene  (it 
comprises  about  1  percent  of  the  X  chromosome 
and  almost  0.1  percent  of  the  total  human  ge- 
nome) renders  it  especially  vulnerable  to  muta- 
tion, and  in  many  of  the  identified  mutations,  the 
protein  encoded  by  the  gene,  dystrophin  (which 
appears  to  be  critical  for  coupling  muscle  excita- 
tion and  contraction),  is  either  absent  or  mark- 
edly deficient. 

Some  years  ago  the  general  location  of  the  gene 


Iv 


Introduction 


responsible  for  the  severe  neurological  disorder 
known  as  Huntington's  disease  (HD)  was  deter- 
mined using  RFLPs,  as  discussed  in  the  section  on 
genetics.  Although  the  HD  gene  itself  has  so  far 
eluded  us,  there  is  every  reason  to  be  optimistic 
that  within  a  year  or  two  it  will  be  identified  and 
cloned.  In  the  meantime,  the  relevant  RFLP  has 
provided  a  useful  marker  for  identifying  carriers 
of  the  disordered  gene.  There  is  similarly  reason 
for  optimism  that  in  the  near  future  the  genetic 
basis  for  the  two  major  affective  disorders,  manic 
depression  and  schizophrenia,  will  be  eluci- 
dated. Careful  studies  of  family  histories  and  of 
identical  twins  raised  apart  have  clearly  estab- 
lished that  both  illnesses  have  an  important  ge- 
netic component,  and  while  neither  is  probably 
due  to  a  single  genetic  mutation,  RFLP  analysis 
and  other  genetic  approaches  should  reveal  the 
genes  involved.  That  such  complex  behavioral 
disorders  might  yield  to  this  type  of  approach  was 
unthinkable  only  a  decade  ago;  as  noted  in  the 
section  on  genetics,  nothing  serves  to  emphasize 
more  dramatically  the  power  of  the  new  genetics 
or  the  exciting  possibilities  it  portends. 

The  dramatic  advances  in  cellular  and  molecu- 
lar neuroscience  should  not  obscure  the  fact  that 
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the  distinctive  role  of  the  nervous  system  in  the 
economy  of  an  organism  is  its  capacity  to  inte- 
grate sensory  information  that  is  received  (both 
from  within  and  from  outside  the  body)  and  to 
organize  it  into  patterns  of  behavior  that  allow 
the  organism  to  respond  to  changes  in  its  environ- 
ment in  appropriate  ways.  And  for  human  beings, 
it  is  through  the  nervous  system  that  we  learn 
both  from  personal  experience  and  the  accumu- 
lated wisdom  of  previous  generations  not  only 
how  to  survive  but  how  to  enjoy  and  profit  from 
the  richness  of  mental  experience  and  meaning- 
ful social  interactions.  To  understand  all  this  we 
will  need  to  learn  a  great  deal  more  about  the 
activities  of  large  populations  of  nerve  cells, 
about  the  computational  capacity  of  complex 
neural  networks,  and  about  the  extraordinary 
ways  in  which  the  human  brain,  with  its  100  bil- 
lion or  more  neurons  and  its  more  than  1  trillion 
synapses,  receives  and  processes  information.  We 
are  at  the  threshold  of  being  able  to  understand 
the  logic  of  the  simplest  organisms  at  both  the 
molecular  and  systems  levels;  the  exploration 
of  human  behavior  in  these  same  terms  stands  as 
perhaps  the  greatest  challenge  to  modern 
science. 
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the  complexes  formed  by  proteins  that  interact 
with  RNA  and  DNA.  Underlying  this  approach  is 
the  belief  that  fundamental  insights  into  the  func- 
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tional  role  of  biologically  interesting  molecules 
can  best  come  from  understanding  the  forms  of 
the  molecules  themselves.  As  Francis  Crick,  the 
co-discoverer  of  the  double-helical  structure  of 
DNA  (see  Figure  9),  remarked:  "to  understand 
function  it  is  essential  to  study  structure."  It  was 
with  this  in  mind  that  in  1985  the  Institute  made 
a  substantial  commitment  to  develop  a  new  Pro- 
gram in  Structural  Biology. 

At  present,  x-ray  crystallography  is  the  most 
powerful  approach  for  visualizing  the  three-di- 
mensional structures  of  large  molecules  (com- 
monly called  tnacromolecules) .  An  essential  pre- 
requisite for  x-ray  analysis  is  the  availability  of 
crystals  of  the  molecule  or  molecular  complex 
that  are  suitable  for  recording  the  diffraction  of 
x-rays.  The  production  of  crystals,  in  turn,  re- 
quires chemically  homogeneous  preparations. 
Moreover,  molecules  that  are  not  spatially  uni- 
form (too  "floppy")  must  be  broken  down  or 
molecularly  dissected  into  defined  and  rigid  com- 
ponents. For  example,  to  crystallize  antibody  mol- 
ecules, it  was  important  first  to  cleave  them  into 
their  principal  fragments,  Fab  and  Fc,  because 
these  pieces  are  normally  connected  by  a  flexible 
hinge.  And  because  of  antibody  diversity  (de- 
scribed in  the  section  on  immunology) ,  it  became 
necessary  to  study  Fab  fragments  from  monoclonal 
immunoglobulins.  A  continuing  challenge  to 
structural  biologists  is  the  development  of  strate- 
gies for  crystallizing  membrane  proteins — by  sol- 
ubilizing  them  with  detergents,  dissecting  them 
into  pieces,  or  altering  them  by  mutation. 

Genetic  engineering  has  transformed  struc- 
tural biology.  This  approach,  which  makes  it  pos- 
sible to  produce  large  quantities  of  pure  proteins, 
also  allows  an  investigator  either  to  choose  a  suit- 
able fragment  for  study  or  to  modify  genetically 
the  molecule  to  be  crystallized.  Other  method- 
ological advances  in  crystallography  itself  are 
transforming  the  field  by  extending  the  range  of 
problems  that  can  be  tackled  routinely. 

There  are  essentially  four  stages  in  determining 
a  structure  by  x-ray  diffraction  analysis  (Figures 
25  and  26):  1)  diffraction  experiments  (data  col- 
lection); 2)  complex  computations  that  pro- 
duce, in  effect,  an  image  of  the  molecule(s)  in 
the  crystal;  3)  interpretation  of  the  computed 
image  in  terms  of  a  molecular  model;  and  4)  re- 
finement of  the  model  by  further  computation. 
Synchrotron  x-ray  sources,  which  are  a  thou- 
sand or  more  times  stronger  than  conventional 
laboratory  x-ray  generators,  are  making  it  possi- 
ble to  study  structures  that  could  not  previously 
be  solved.  (HHMI  is  currently  developing  a 


synchrotron  resource  for  use  by  the  biological 
community  at  the  National  Synchrotron  Light 
Source  at  Brookhaven  National  Laboratory  on 
Long  Island.)  Recent  examples  from  HHMI  labo- 
ratories are  the  human  class  I  major  histocompati- 
bility antigen  and  the  DNA  virus  SV40.  At  the 
same  time,  jxjsition-sensitive  x-ray  detectors  have 
greatly  extended  the  applications  of  conven- 
tional radiation  sources.  Lastly,  novel  computa- 
tional methods  have  made  the  production  of  a 
molecular  image  (phase  determination)  less 
dependent  on  extensive  ancillary  data  from 
heavy-atom  modified  crystals  and  have  made  re- 
finement of  models  less  cumbersome  and  more 
objective. 

In  the  1950s  and  1960s,  x-ray  crystallography 
revealed  the  structures  of  the  first  biologically 
important  molecules,  including  DNA,  hemoglo- 
bin, and  insulin.  In  the  1970s  it  revolutionized 
the  field  of  enzymology  by  making  it  possible  to 
visualize  directly  the  active  sites  of  enzymes.  In 
the  1 980s  it  made  comparably  far-reaching  con- 
tributions to  virology,  immunology,  and  mem- 
brane biology  by  revealing  the  structures  of  vi- 
ruses, antibodies,  and  a  photosynthetic  reaction 
center.  What  can  we  expect  in  the  1990s?  It 
seems  reasonable  to  predict  the  following: 

1.  Structures  of  different  classes  of  proteins 
or  protein/nucleic- acid  complexes.  Three  de- 
cades of  biological  crystallography  have  left  sev- 
eral major  areas  unexplored.  For  example,  we 
have  yet  to  know  what  any  of  the  major  proteins 
of  the  cytoskeleton  and  of  cellular  motility  look 
like  (actin,  myosin,  tubulin,  and  so  forth).  We 
have  yet  to  visualize  any  of  the  membrane  recep- 
tors referred  to  above,  and  we  have  yet  to  see  an 
ion  channel,  a  ribosome,  an  RNA  polymerase,  or  a 
ribozyme;  and,  with  the  exception  of  transfer 
RNAs  (tRNAs),  little  is  known  of  the  three-dimen- 
sional structure  of  most  RNAs  and  RNA-protein 
complexes.  Progress  toward  some  of  these  goals 
is  reported  in  this  volume;  others  will  no  doubt 
be  achieved  before  long,  as  more  and  more 
workers  are  drawn  into  the  field  and  as  new  tech- 
niques are  developed. 

2.  Time-resolved  images  of  events  at  the  ac- 
tive site  of  an  enzyme.  New  ways  of  using 
synchrotron  x-ray  radiation  permit  very  rapid 
measurements  of  diffraction  data,  so  that  in  prin- 
ciple it  should  be  possible  to  follow  the  struc- 
tural changes  that  occur  during  an  enzymatic  re- 
action. If  we  understood  these  changes,  it  might 
be  possible  to  develop  enzymes  with  usefully  al- 
tered properties  and  to  synthesize  enzyme  inhibi- 
tors with  enhanced  specificity. 
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Figure  25.  The  use  of  x-rays  to  determine  the  structure  of  enzymes.  When  an  enzyme  is 
in  a  crystalline  form,  its  molecules  are  arranged  in  regularly  repeating  arrays.  A  beam 
of  x-rays  passed  through  the  enzyme  is  diffracted  to  give  a  pattern  of  spots  on  a  detec- 
tor screen  behind  the  sample.  Each  spot  contains  information  about  the  electron  den- 
sity ( and  hence  the  types  of  atoms:  carbon,  oxygen,  nitrogen,  etc.)  in  various  parts  of 
the  crystal.  This  information  can  be  used  to  construct  electron  density  maps  whose 
contours  indicate  the  type  of  atoms  present.  The  contour  map  shown  at  the  right 
represents  a  short  segment  of  polypeptide  backbone  with  a  tyrosine  side  chain. 
Adapted  from  Geoffrey  Zubay,  Biochemistry,  Addison- Wesley,  1983. 
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Figure  26.  A:  The  x-ray  diffraction  pattern  given  by  the  enzyme  chymotrypsin. 
Courtesy  of  Thomas  A.  Steitz. 

B:  A  representation  of  the  three-dimensional  structure  of  chymotrypsin.  Carbon  atoms  are  shown  in 
black,  nitrogen  in  blue,  oxygen  in  red,  hydrogen  in  white,  and  sulfur  in  yellow.  The  diameter  of  the  enzyme 
is  about  45  A  (somewhat  less  than  a  millionth  of  an  inch).  The  hydrophilic  side  chain  of  arginine- 145  is 
clearly  visible  projecting  outward  from  the  right  side  of  the  molecule.  The  ridges  and  grooves  on  the  surface 
of  the  chymotrypsin  molecule  are  as  unique  as  the  mountains  and  craters  of  the  moon,  and  herein  lies  the 
fulfillment  of  the  lock-and-key  mechanism  hypothesized  by  Emil  Fischer  at  the  turn  of  the  century. 

Courtesy  of  Polygen  Corporation,  Waltham,  MA. 
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3.  The  role  of  molecular  recognition  in  the 
regulation  of  cellular  activity.  How  do  proteins 
that  control  transcription  recognize  specific  DNA 
sequences?  How  do  cell  surface  proteins  in  the 
immune  system  recognize  and  present  antigens? 
The  answers  to  these  questions  are  beginning  to 
emerge  from  crystal  structures  of  molecular  com- 
plexes, such  as  the  complexes  formed  by  regula- 
tory proteins  with  DNA,  those  formed  by  binding 
proteins  with  their  appropriate  ligands,  of  anti- 
bodies with  antigens,  and  of  MHC  molecules  with 
peptides.  These  current  efforts  give  consider- 
able promise  for  understanding  how  hormones  or 
neurotransmitters  trigger  a  cascade  of  events 
that  involves  the  formation  and  dissociation  of 
protein  assemblies  inside  cells.  A  large  and  medi- 
cally significant  class  of  regulatory  interactions 
involves  the  protein  products  of  oncogenes  or 
proto-oncogenes . 

As  our  knowledge  of  important  proteins  rap- 
idly increases,  we  can  only  hope  that  our  capacity 
to  anticipate  aspects  of  structures  not  yet  deter- 
mined will  keep  pace.  The  goal  of  accurately  pre- 
dicting the  three-dimensional  structure  of  any 
protein  from  its  amino  acid  sequence  is  still  a 
long  way  off.  But  recent  advances  in  computa- 
tional chemistry  make  it  possible  to  predict  the 
effects  of  small  perturbations,  such  as  point  mu- 
tations, on  the  folding  of  a  protein  and  to  calcu- 
late differences  in  binding  free  energies  for  re- 
lated ligands.  And  systematic  approaches  to 
designing  drugs,  such  as  antagonists  or  inhibitors 
of  enzymes,  are  beginning  to  emerge  now  that  we 
can  carry  out  meaningful  calculations  on  known 
structures. 

Investigators  in  the  Structural  Biology  Program 

Agard,  David  A.,  Ph.D. 
Briinger,  Axel  T.,  Ph.D. 
Burley,  Stephen  K.,  Ph.D. 
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Fox,  Robert  O.,  Ph.D. 
Harrison,  Stephen  C,  Ph.D. 
Hendrickson,  Wayne  A.,  Ph.D. 
Kuriyan,  John,  Ph.D. 


Nuclear  magnetic  resonance  (NMR)  meth- 
ods offer  an  alternative  route  to  determining  the 
three-dimensional  structures  of  peptides  and 
small  proteins.  The  DNA-binding  domains  of  tran- 
scriptional activators  and  repressors  are  good 
candidates  for  this  type  of  analysis.  The  past  two 
years  have  seen  the  determination  by  NMR  of  the 
structure  of  a  developmentally  important  DNA 
sequence  known  as  a  homeodomain  and  of  a 
DNA-binding  structure  known  as  a  zinc  finger. 
NMR  has  the  great  advantage  that  it  circumvents 
the  need  to  crystallize  the  protein  to  be  studied. 
At  the  other  end  of  the  size  scale,  imaginative 
combinations  of  light  and  electron  microscopy 
have  begun  to  reveal  important  patterns  and  re- 
gularities in  very  large  structures,  such  as  chro- 
mosomes, viruses,  and  receptors.  New  methods 
for  recording  images  and  enhancing  contrast  in 
light  microscopy  make  it  possible  to  record  in 
real  time  the  events  of  intracellular  transport  or 
the  process  of  chromosome  condensation.  As  the 
molecules  that  generate  these  large-scale  intra- 
cellular motions  are  characterized,  it  should  be- 
come possible  to  relate  such  changes  to  the  spe- 
cific molecular  recognition  events  that  control 
them.  It  is  fortunate,  but  not  coincidental,  that  as 
biologists  have  become  increasingly  aware  of  the 
need  to  know  the  precise  structure  of  the  mole- 
cules that  mediate  the  phenomena  they  are  inter- 
ested in,  a  whole  range  of  new  experimental 
methods  has  been  developed,  and  a  new  genera- 
tion of  structural  biologists  has  emerged  to  assist 
them  and  to  advance  their  understanding  of  the 
complex  relationships  between  structure  and 
function. 
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NERVE  cells  are  specialized  to  generate,  trans- 
mit, and  receive  rapid  electrical  messages. 
Electrical  impulses,  called  action  potentials,  last 
about  1,000th  of  a  second  and  can  travel  along 
specialized  nerve  cell  extensions  at  speeds  over 
100  mph.  Chemical  transmitter  substances  re- 
leased onto  the  nerve  cell  by  other  nerve  cells 
control  the  precise  timing  of  these  electrical 
pulses.  We  are  trying  to  understand  how  these 
pulses  are  generated  and  how  transmitters  im- 
pinging on  the  cell  control  them. 

Cell  membranes  are  normally  effective  barriers 
to  the  movement  of  ions  (electrically  charged 
atoms)  between  the  cell  environment  and  the 
cell  interior.  This  insulating  property  allows  the 
inside  of  a  nerve  cell  to  have  a  different  electrical 
voltage  from  the  outside,  or  from  a  neighboring 
cell.  The  electrical  activities  described  above  are 
regulated  by  special  protein  molecules,  called 
ion  channels,  which  are  embedded  in  the  cell 
membrane.  There  are  many  types  of  ion  channel. 
Each  type  has  a  specific  role,  but  all  have  in  com- 
mon a  unique  feature  that  allows  certain  ions  to 
travel  easily  through  them.  The  protein  chains 
that  make  up  an  ion  channel  molecule  are 
arranged  to  create  a  minute  tunnel,  through 
which  certain  types  of  ions — for  example,  so- 
dium, potassium,  or  calcium  ions — can  quickly 
move. 

The  direction  that  the  ion  moves  is  not  con- 
trolled by  the  tunnel  but  by  the  ion  concentra- 
tions and  the  transmembrane  voltage.  The  tunnel 
does,  however,  control  the  type  of  ion  that 
moves.  Thus  the  sodium  channel  only  allows  so- 
dium ions  to  pass.  Because  sodium  ions  are  abun- 
dant outside,  but  not  inside,  nerve  cells,  the  exis- 
tence of  open  sodium  channels  leads  to  an  inward 
stream  of  sodium  ions,  making  the  cell  interior 
positive.  On  the  other  hand,  when  potassium 
channels  open,  potassium  streams  out  of  the  cell, 
making  it  negative.  Because  these  tunnels  are  not 
always  open  (indeed  are  closed  most  of  the 
time),  it  is  supposed  that  the  channel  must  have 
some  sort  of  gate. 

The  basic  electrical  pulse  of  a  nerve  cell  is  a 
positive-negative  sequence  reflecting  the  rapid 


opening  of  sodium  channels  followed  by  their 
closing  and  the  opening  of  potassium  channels. 
What  triggers  the  opening  and  closing  of  the  ion 
channel  gates?  It  has  been  known  for  some  time 
that  sodium  channel  opening  is  triggered  by  a  pos- 
itive change  in  the  membrane  voltage.  We  have 
recently  shown  that  an  important  trigger  for  po- 
tassium channel  opening  is  a  brief  increase  in  cy- 
toplasmic calcium  just  beneath  the  cell  mem- 
brane. Some  of  our  most  recent  work  has  focused 
on  how  this  cytoplasmic  signal  is  generated. 

We  have  split  the  problem  into  two  parts,  using 
individual  nerve  cells  isolated  from  bullfrogs. 
First  we  analyzed  how  calcium  gets  through  the 
membrane  from  the  outside.  We  employed  the 
voltage-clamp  technique,  in  which  an  electrical 
connection  is  made  to  the  cell  interior  via  a  glass 
micropipette;  this  allows  application  of  voltages 
to  the  cell  and  measurement  of  ion  movements 
through  channels.  The  conclusion  from  this 
study  is  that  the  cell  membrane  contains  numer- 
ous channels  specialized  to  allow  calcium 
entry — channels  that  open  quickly  when  the 
transmembrane  voltage  becomes  positive  and 
stay  open  as  long  as  the  voltage  remains  positive. 
Only  one  type  of  calcium  channel  seems  to  be 
operating  (other  scientists  report  a  more  compli- 
cated process  in  other  tissues) . 

The  second  part  of  the  problem  was  to  deter- 
mine how  calcium  spreads  within  the  cell.  Here 
we  used  a  calcium-sensitive  fluorescent  dye  and  a 
scanning  laser  microscope.  The  laser  beam  is  fo- 
cused to  a  small  spot,  which  can  be  rapidly 
scanned  over  the  cell.  For  example,  the  spot  can 
be  scanned  from  membrane  to  membrane  in  a  few 
thousandths  of  a  second.  We  find  that  calcium 
moves  quickly  through  the  cytoplasm  once  it  has 
entered  through  the  open  calcium  channels. 

Computer  calculations  show  that  the  move- 
ment of  calcium  can  be  accounted  for  by  the  fa- 
miliar physical  process  of  diffusion,  without  any 
need  for  special  mechanisms.  Calcium  move- 
ments, however,  are  modified  to  some  extent  by 
binding  to  cytoplasmic  molecules  (such  as  pro- 
teins, or  the  dye  we  introduced  into  the  cell  to 
measure  calcium). 
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We  are  also  interested  in  other  nerve  cell  ion 
channels  that  are  not  primarily  regulated  by  cal- 
cium. For  example,  we  have  recently  been  able  to 
record  directly  the  activity  of  a  potassium  ion 
channel  that  we  had  previously  postulated  and 
called  the  M  channel.  This  channel  operates  in  a 
membrane  potential  range  (—60  to  —30  mV)  in 
which  calcium  entry  is  not  important.  Neverthe- 
less, agents  (e.g.,  the  neurotransmitter  acetylcho- 
line and  the  widely  consumed  drug  caffeine)  that 
alfect  intracellular  calcium  can  also  affect  the  M 
channel.  In  recent  experiments  we  have  used  two 
different  methods  to  raise  intracellular  calcium 
and  explore  this  relationship.  One  method  uses 
light  to  release  calcium  from  a  molecular  cage; 


the  other  involves  perfusing  the  cell  interior  with 
various  calcium-buffer  mixtures.  Both  ap- 
proaches show  that  M  channel  activity  increases 
when  calcium  levels  increase  moderately  above 
normal.  Transmitters  that  decrease  M  channel  ac- 
tivity must  therefore  work  by  some  other,  un- 
known, mechanism. 

We  have  also  recently  studied  a  neurotransmit- 
ter-operated  synaptic  channel  in  relay  cells  of  the 
lateral  geniculate  nucleus  (LGN).  Excitatory 
synapses  on  LGN  relay  cells  from  both  the  eye 
and  cerebral  cortex  activate  voltage-dependent 
NMDA  (7V-methyl-D-aspartate)  channels.  We  are 
currently  trying  to  understand  the  voltage-depen- 
dent responses  of  these  cells. 
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Calcium  diffusion  in  a  nerve  cell  body.  Pseudocolor  is  used  to  represent  calcium  ion  concentra- 
tion at  various  points  along  a  diameter  of  a  spherical  nerve  cell.  The  top  edge  of  the  image 
corresponds  to  the  top  edge  of  the  cell  ( Oiim ),  and  the  bottom  edge  of  the  image  to  the  bottom  edge 
of  the  cell  (40  nm ).  Time  is  represented  from  left  to  right.  Thus  at  0  time,  all  points  along  the  cell's 
diameter  had  a  low  calcium  level  (blue ).  Calcium  remained  low  until  the  cell  was  given  a  voltage 
pulse  (diagrammed  at  top  of  figure),  which  caused  calcium  channels  in  the  cell  membrane  to 
open.  Calcium  immediately  entered  the  edge  of  the  cell  (yellow  and  then  red)  and  rapidly  spread 
toward  the  center.  The  color  bar  shown  is  only  approximate.  Image  obtained  by  combined  whole- 
cell  recording  and  FLUO  confocal  fluorescence  microscopy. 

Modified  from  Hernandez- Cruz,  A.,  Sala,  P.,  and  Adams,  P.R.  1990.  Science  247:858-862. 
Copyright  1990  by  the  AAAS. 
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The  receptor-binding  domain  of  apolipoprotein  E,  the  first  important  protein  involved  in 
cholesterol  metabolism  to  be  solved  by  x-ray  crystallography.  This  protein  is  responsible  for 
targeting  plasma  lipoproteins  such  as  chylomicrons,  VLDL  ( very  low  density  lipoprotein ), 
and  HDL  (high-density  lipoprotein)  to  their  appropriate  cellular  receptors  (the  LDL 
receptor). 

1.  Electron  density  map  showing  two  a-helices  from  the  recently  determined  crystal 
structure  of  apolipoprotein  E. 

2.  Diagram  showing  that  apolipoprotein  E  is  an  unusually  elongated,  antiparallel,  four- 
helix  bundle,  with  an  additional  short  helix  linking  helices  1  and  2.  The  structure  is  built 
from  repeating  sequence  elements  shown  by  different  colored  regions. 

3-  A  cutaway  surface  revealing  the  hydrophobic  interface  between  helices  1  and  4.  The 
structure  is  substantially  stabilized  by  a  leucine  zipper  motif.  The  juxtaposed  leucines  from 
one  helix  are  shown  in  gold,  the  other  in  orange. 

Research  of  David  A.  Agard. 
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\  postdoctoral  work  was  done  on  high-resolution  electron  microscopic  crystallography  at  the  MRC  Labora- 
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ji^H     ing  of  the  a-lytic  protease  gene  with  Sydney  Brenner. 


THIS  laboratory  is  primarily  concerned  with 
exploring  the  fundamental  relationships  be- 
tween structure  and  function  at  the  molecular 
and  cellular  levels.  Four  areas  of  investigation  are 
actively  pursued:  three-dimensional  analysis  of 
diploid  chromosome  structure  and  topology; 
studies  on  the  structural  determinants  of  specific- 
ity, using  a-lytic  protease  as  a  model  system; 
functional  and  structural  analysis  of  the  role  of 
the  precursor  in  proper  folding  of  a-lytic  pro- 
tease; and  determination  of  the  first  three-dimen- 
sional crystal  structure  of  an  important  protein 
in  human  cholesterol  metabolism — apolipopro- 
tein  E. 

Three-Dimensional  Analysis  of 
Chromosome  Structure 

We  study  chromosome  structure  in  close  col- 
laboration with  John  Sedat  (HHMI,  University  of 
California,  San  Francisco) ;  only  a  subset  of  these 
studies  will  be  discussed  here.  Our  primary  aim 
in  this  area  is  to  provide  a  physical  basis  for  un- 
derstanding chromosome  behavior  and  function 
by  directly  determining  the  three-dimensional 
structure  of  eukaryotic  chromosomes  as  a  func- 
tion of  both  transcriptional  state  and  the  cell  cy- 
cle stage.  To  accomplish  this  goal  we  are  attempt- 
ing to  understand  how  fibers  of  nucleosomes 
are  folded  into  higher-order  structures  within 
the  chromosome  and  what  role  specific  chro- 
mosomal proteins  play  in  determining  these 
structures. 

We  have  had  to  develop  the  necessary  technolo- 
gies (hardware  and  software)  to  allow  us  to  exam- 
ine complex  noncrystalline  specimens  in  three 
dimensions,  using  electron  microscopy  (EM)  and 
light  microscopy.  The  past  year  has  seen  signifi- 
cant software  developments  for  three-dimen- 
sional image  reconstruction  with  both  kinds  of 
microscopes.  Many  of  the  tedious  aspects  of  EM 
tomography  have  now  been  automated,  greatly 
speeding  the  task  of  generating  a  three-dimen- 
sional reconstruction. 

The  Role  of  Topoisomerase  II  in 
Chromosome  Structures 

Current  work  focuses  on  deepening  our  under- 


standing of  chromosomal  structure  by  combining 
three-dimensional  observation  methods  with  bio- 
chemical probes  in  an  effort  to  correlate  struc- 
tural aspects  with  specific  macromolecular 
components.  Topoisomerase  II  is  a  major  chromo- 
somal protein  that  is  important  for  relieving  the 
torsional  stress  of  supercoiling.  It  is  also  postu- 
lated to  play  a  crucial  role  in  organizing  the 
higher-order  structure  of  chromosomes.  We  have 
begun  to  investigate  the  localization  and  function 
of  topoisomerase  II  in  chromosomal  samples  pre- 
pared by  methods  that  preserve  their  in  vivo 
structure.  We  have  examined  prophase,  meta- 
phase,  and  anaphase  embryonic  cycle  12  and  13 
nuclei,  and  metaphase  and  anaphase  Kc  nuclei. 

Our  data  show  that  although  topoisomerase  II 
is  clearly  associated  with  mitotic  chromosomes, 
it  is  concentrated  at  specific  sites  along  them 
rather  than  localized  to  an  internal  core.  These 
sites  can  be  spatially  coincident  with  the  chro- 
mosome or  adjacent  to  the  chromosome  arm. 
This  seems  incompatible  with  a  purely  structural 
role  for  the  enzyme.  The  nature  of  the  topoiso- 
merase II  sites  is  still  unknown,  but  we  hypothe- 
size that  those  we  have  recorded  are  locations  of 
the  enzyme's  activity  during  chromosome  con- 
densation and  segregation.  We  have  observed  that 
topoisomerase  II  is  specifically  localized  to  re- 
gions of  nondisjunction  in  failed  mitoses. 

We  are  extending  this  work  by  examining  the 
distribution  of  the  enzyme  at  high  resolution  in 
the  electron  microscope  and  its  dynamics  in  live 
Drosophila  embryos.  We  have  recently  devel- 
oped several  monoclonal  antibodies  to  topoiso- 
merase II  as  an  important  aid  to  our  studies. 

The  Structural  Basis  of  Enzyme  Specificity 

We  had  previously  chosen  a-lytic  protease  as  a 
model  for  investigating  structural  and  energetic 
aspects  of  enzyme-substrate  specificity,  because 
its  binding  pocket,  made  of  the  side  chains  of 
three  amino  acids  (Met  192,  Met  2 1 3 ,  Val  2 1 7A) , 
provided  a  large  volume  that  could  be  experimen- 
tally manipulated.  To  probe  the  structural  basis 
for  specificity,  we  are  combining  solution  kinetic 
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analysis,  x-ray  crystallographic  structural  analy- 
sis, and  site-directed  mutagenesis.  Of  key  impor- 
tance has  been  the  availability  of  tight-binding 
peptide  boronic  acids,  which  provide  an  excel- 
lent model  for  the  reaction  transition  state  or 
nearby  intermediates.  Approximately  30  high- 
resolution,  extremely  well-refined  crystal  struc- 
tures have  now  been  determined  and  analyzed. 
These  have  provided  significant  new  insights  into 
the  structure  of  the  transition  state  and  the  im- 
portance of  substrate  hydrogen  bonding  for  its 
stabilization,  as  well  as  fundamental  information 
on  steric  exclusion,  substrate  specificity,  and  en- 
zyme flexibility. 

By  mutation  we  have  been  able  to  alter  dramati- 
cally the  pattern  of  specificity  while  maintaining 
or  even  increasing  enzyme  activity.  Detailed 
structural  analysis  of  two  mutants  as  free  enzymes 
and  as  complexes  have  provided  surprising  in- 
sights into  the  mechanism  and  complexity  of 
specificity  and  have  indicated  the  crucial  role  of 
protein  flexibility  in  selectivity.  During  the  past 
year  we  have  made  numerous  other  mutations 
and  examined  their  kinetic  and  structural 
properties. 

In  the  past  year  we  began  a  collaboration  with 
Vladimir  Basus  (University  of  California,  San  Fran- 
cisco) to  perform  a  complete  two-dimensional 
nuclear  magnetic  resonance  (NMR)  analysis  on 
the  structure  of  a-lytic  protease.  We  hope  this 
will  provide  new  insights  to  correlated  motions 
within  the  enzyme  and  be  useful  as  well  for  the 
folding  studies  mentioned  below. 

Recently  we  have  developed  a  new  method  for 
predicting  the  energetics  of  protein-substrate  in- 
teractions. This  approach,  based  on  Ponder- 
Richards  rotamers,  combined  with  energetics  and 
solvation  terms,  can  predict  k^^JK^  with  stun- 
ning accuracy.  We  have  used  this  method  to  de- 
sign a  new  enzyme  with  particular  properties, 
and  so  far  the  predictions  have  been  remarkably 
accurate. 

Structural  and  Biochemical  Probes  of 
Folding  of  a-Lytic  Protease 

a-Lytic  protease  is  synthesized  as  a  prepro- 
enzyme.  Experiments  in  Escherichia  coli  have 


demonstrated  that  the  1 66-amino  acid  precursor 
domain  is  absolutely  required  for  the  proper  fold- 
ing of  the  198-amino  acid  protease  domain.  Fur- 
thermore, we  have  shown  that  proper  folding  can 
be  accomplished  either  in  vivo  or  in  vitro  with 
the  pro  region  covalently  attached  or  in  trans. 
Our  current  thinking  is  that  a  high-energy  barrier 
exists  between  the  folded  and  unfolded  states 
that  the  mature  protease  cannot  cross  by  itself. 
The  precursor  acts  as  a  "foldase"  to  stabilize  the 
transition  state  for  folding,  essentially  serving  as  a 
template  on  which  the  mature  enzyme  finds  its 
active  conformation. 

Current  efforts  are  focused  on  analyzing  this 
folding  reaction  in  vitro.  Amazingly,  we  have 
been  able  to  trap  and  purify  a  stable  folding  inter- 
mediate that  is  rapidly  refolded  upon  addition  of 
the  pro  region.  We  plan  to  use  a  combination  of 
physical  approaches  including  two-dimensional 
NMR  to  probe  the  structure  of  the  intermediate 
and  the  role  of  the  pro  region  in  the  final  stage  of 
folding. 

Structure  of  Apolipoprotein  E 

Apolipoprotein  E  is  an  important  protein  in 
cholesterol  metabolism  in  mammals.  It  is  one  of 
two  proteins  that  can  bind  to  the  low-density  li- 
poprotein (LDL)  receptor  (the  other  is  apolipo- 
protein B)  and  thus  has  a  major  role  in  triglycer- 
ide and  cholesterol  metabolism.  The  protein 
itself  has  two  distinct  structural  and  functional 
domains.  The  amino-terminal  22-kDa  domain 
contains  the  receptor  binding  functionality, 
whereas  lipid  binding  resides  primarily  with  the 
10-kDa  carboxyl-terminal  domain.  In  collabora- 
tion with  the  Mahley  group  at  the  Gladstone 
Foundation  Laboratories  for  Cardiovascular  Dis- 
ease, we  have  obtained  crystals  of  the  22-kDa  re- 
ceptor-binding domain  and  recently  finished  the 
high-resolution  structure  determination.  The 
protein  is  an  unusually  elongated  four-helix  bun- 
dle. Although  the  surface  is  exceptionally 
charged,  positive  and  negative  charged  groups 
are  precisely  balanced,  except  in  what  we  be- 
lieve is  the  receptor-binding  region.  Currently, 
we  are  examining  the  structures  of  two  human 
mutants  that  disrupt  receptor  binding. 
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ION  channels  are  the  molecular  units  of  electri- 
cal signaling  in  cells.  They  are  proteins  that 
regulate  the  movement  of  ions,  such  as  sodium, 
calcium,  and  potassium,  into  and  out  of  cells. 
They  are  responsible  for  the  conversion  of  exter- 
nal sensory  signals  to  the  electrical  language  of 
the  nervous  system  and  the  integration  of  these 
signals  to  generate  appropriate  behavior.  In  addi- 
tion, ion  channels  are  important  for  the  genera- 
tion and  regulation  of  the  heart  beat,  for  contrac- 
tion of  muscles,  and  for  the  release  of  hormones 
in  the  bloodstream.  A  large  variety  of  ion  channel 
types  are  found  in  the  body.  They  are  specialized 
to  select  for  certain  species  of  ions  and  to  open 
and  close  in  response  to  a  number  of  different 
stimuli,  such  as  the  binding  of  a  neurotransmitter 
molecule  or  a  change  in  the  voltage  that  exists 
across  a  cell's  membrane.  Our  laboratory  is  inter- 
ested in  the  molecular  mechanisms  of  ion  chan- 
nel function.  One  of  our  major  goals  is  to  under- 
stand the  conformational  changes  that  occur  as 
the  channels  respond  to  appropriate  stimuli. 

Voltage-gated  ion  channels  are  an  important 
functional  class.  As  their  name  implies,  they  can 
open  in  response  to  changes  in  the  electrical  po- 
tential across  the  cell  membrane,  a  property  cru- 
cial for  the  generation  of  electrical  signals  and 
their  transmission  throughout  the  body.  These 
molecules  have  a  way  to  measure  the  electrical 
potential  and  open  accordingly.  In  addition, 
some  of  them  inactivate,  or  become  unavailable 
for  opening  after  use.  In  recent  years  we  have 
studied  the  molecular  mechanisms  of  inactiva- 
tion  of  a  class  of  potassium  channels  that  were 
cloned  in  Drosophila. 

These  channels  are  products  of  the  Shaker 
gene.  They  exhibit  the  fastest  inactivation  of  any 
potassium  channels  yet  cloned.  William  Zagotta 
and  I  began  by  using  single-channel  recording 
methods  to  study  the  gating  properties  of  wild- 
type  Shaker  channels  in  their  native  tissue.  Such 
methods  allow  us  to  record  the  behavior  of  a  sin- 
gle-channel molecule  as  it  opens  and  closes  on  a 
millisecond  time  scale. 

We  determined  that  the  conformational 
changes  associated  with  opening  the  channel  de- 


pended strongly  on  the  membrane  voltage  and 
therefore  involved  a  substantial  rearrangement  of 
an  electrically  charged  part  of  the  channel  in  the 
membrane.  On  the  other  hand,  the  inactivation 
process  did  not  involve  significant  charge  rear- 
rangement. This  result,  combined  with  our  abil- 
ity to  alter  inactivation  by  internal  enzymes,  led 
us  to  the  hypothesis  that  inactivation  involved  a 
conformational  change  on  the  inside  of  the  mem- 
brane that  blocked  the  flow  of  potassium  ions 
through  the  channel. 

Dr.  Zagotta,  Toshinori  Hoshi,  and  I  further  stud- 
ied this  hypothesis  by  making  altered  channels 
with  recombinate  DNA  methods  and  expressing 
the  normal  and  altered  channels  in  frog  oocytes. 
Our  results  demonstrated  that  the  first  10  or  so 
hydrophobic  amino  acids,  and  the  positively 
charged  amino  acids  in  positions  16  through  19, 
are  important  in  the  inactivation  mechanism.  By 
altering  these  amino  acids,  we  can  change  the 
inactivation  rate  in  a  graded  manner  up  to  about 
20  times  slower  than  normal. 

Deletion  and  insertion  mutations  in  an  adjacent 
region  alter  the  inactivation  rate  as  a  function  of 
the  length  of  the  remaining  amino  acid  chain, 
suggesting  that  this  region  acts  as  a  spacer.  Our 
results  are  strikingly  consistent  with  the  "ball  and 
chain"  model  of  inactivation  originally  proposed 
for  the  voltage-dependent  sodium  channel  by 
Armstrong  and  Bezanilla  in  1977. 

The  following  model  of  the  molecular  mecha- 
nism of  inactivation  emerges  from  our  results. 
The  amino  terminus  of  the  Shaker  channel  acts  as 
an  agonist  for  inactivation.  The  inactivation  re- 
ceptor is  located  elsewhere  on  the  cytoplasmic 
side  of  the  molecule.  When  the  inactivation  ago- 
nist is  bound  to  the  receptor,  the  channel  closes. 
The  first  1 0  or  so  hydrophobic  amino  acids  of  the 
Shaker  protein  either  form  the  core  of  the  inacti- 
vation particle  or  provide  the  hydrophobic  inter- 
actions necessary  for  the  particle  to  bind  to  the 
inactivation  receptor.  The  positively  charged  resi- 
dues electrostatically  interact  with  the  receptor, 
which  is  likely  to  be  negatively  charged.  Muta- 
tions in  this  region  presumably  disrupted  the  in- 
activation particle  and  precluded  inactivation. 

Amino  acid  residues  21  and  beyond  form  the 
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"chain."  Deletion  mutations  in  this  region  accel- 
erated the  inactivation  process  by  shortening  the 
chain  part  of  the  sequence.  We  tested  this  model 
further  by  applying  a  solution  containing  the  free 
synthetic  inactivation  particle  to  the  inside  face 
of  mutant  channels  that  did  not  inactivate.  In  the 
presence  of  the  synthetic  inactivation  particle, 
the  mutant  channels  regained  inactivation,  con- 
sistent with  the  ball  and  chain  mechanism.  We 
are  now  using  synthetic  inactivation  particles  and 
mutant  channels  to  examine  the  detailed  struc- 
tural requirements  for  inactivation. 

Shaker  potassium  channels  also  exhibit  a 
slower  inactivation  process.  It  can  be  seen  both  in 
wild-type  channels  and  after  the  faster  inactiva- 
tion process  has  been  removed  by  mutagenesis. 
The  slow  inactivation  does  not  require  intact  fast 
inactivation.  Although  it  also  does  not  involve  re- 
arrangement of  charge  in  the  membrane,  this 
slower  inactivation  seems  to  involve  a  mecha- 


nism different  from  the  fast  process.  In  collab- 
oration with  Kathleen  Choi  and  Gary  Yellen 
(HHMI,  the  Johns  Hopkins  University)  we  have 
found  that  the  slow  inactivation  is  affected  by 
external  agents,  suggesting  that  the  conforma- 
tional changes  for  this  process  involve  external 
structures. 

The  slow  inactivation  process  occurs  by  greatly 
different  rates  in  variants  of  the  Shaker  channel 
with  differences  in  structure  at  the  carboxyl  end 
of  the  protein.  We  have  made  mutations  in  both 
of  these  variants  and  have  localized  to  a  single 
hydrophobic  amino  acid  in  a  membrane-span- 
ning region  of  the  channel  molecule  the  differ- 
ence responsible  for  the  slow  inactivation  differ- 
ences. Other  amino  acid  substitutions  at  this 
position  have  dramatic  effects  on  gating,  with 
larger  hydrophobic  amino  acids  leading  to  slower 
inactivation.  We  are  currently  investigating  this 
process  further. 
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ONGOING  studies  in  our  laboratory  are 
aimed  at  elucidating  the  mechanism  of  insu- 
lin action  on  the  expression  of  enzymes  that  reg- 
ulate cell  growth  and  metabolism.  We  have 
focused  particularly  on  regulation  of  the  glycer- 
aldehyde- 3 -phosphate  dehydrogenase  (GAPDH) 
gene  in  adipose  tissue  and  liver.  Insulin  induces 
GAPDH  mRNA  levels  8-fold  in  cultured  3T3-L1 
adipocytes  and  10-fold  in  fat  or  liver  tissue  iso- 
lated from  rats  fasted  and  then  fed  a  high-carbo- 
hydrate, low-fat  diet.  Expression  of  the  GAPDH 
gene  is  markedly  decreased  in  primary  adipocytes 
isolated  from  diabetic  animals  and  induced  above 
basal  levels  upon  replacement  of  insulin.  The  ef- 
fect of  insulin  on  this  gene  is  tissue  specific; 
GAPDH  mRNA  is  not  regulated  in  muscle.  Study 
of  insulin  regulation  of  this  gene  provides  a 
marker  of  the  metabolic  effects  of  insulin  on  gene 
expression. 

Activation  of  GAPDH  gene  expression  in  insu- 
lin-responsive tissues  correlates  with  the  pres- 
ence of  insulin-responsive  DNA-binding  proteins 
that  bind  specifically  to  elements  in  the  5'  flank- 
ing region  of  the  GAPDH  gene  and  confer  insulin- 
responsive  gene  expression  to  the  chlorampheni- 
col acetyltransferase  gene.  Within  60  minutes  of 
exposure  of  3T3  adipocytes  or  H35  hepatoma 
cells  to  insulin,  the  activity  of  these  sequence- 
specific  DNA-binding  proteins  is  increased  two- 
to  fourfold.  The  insulin-responsive  element  A 
(IRE-A)  DNA-binding  protein  (IRP-A)  is  induced 
four-  to  eightfold  in  liver  and  fat  during  the  pro- 
cess of  refeeding  a  fasted  rat  a  high-carbohydrate, 
low-fat  diet,  a  process  known  to  increase  circu- 
lating glucose  and  insulin  levels,  resulting  in  the 
induction  of  glycolytic  and  lipogenic  enzymes. 
IRP-A  binding  is  inhibited  in  the  fat  pads  of  dia- 
betic animals  and  is  induced  above  normal  levels 
when  insulin  is  administered  to  diabetic  animals. 
In  muscle,  where  GAPDH  activity  is  not  rate  limit- 
ing, IRP-A  binding  is  not  detectable.  These  obser- 
vations support  the  importance  of  GAPDH  gene 
regulation  in  vivo. 

The  Southwestern  screening  technique  was 
used  to  clone  a  gene  that  encodes  a  specific  IRP-A 


DNA-binding  protein.  The  cloned  cDNA  has  been 
used  to  examine  the  mechanism  by  which  insulin 
chronically  regulates  expression  of  the  GAPDH 
gene.  This  clone  is  expressed  in  liver  and  fat,  but 
not  in  muscle,  which  provides  an  explanation  of 
the  tissue-specific  regulation  of  GAPDH  gene  ex- 
pression. Expression  of  IRP-A  mRNA  is  inhibited 
in  diabetes  and  up-regulated  with  insulin  replace- 
ment; expression  is  induced  during  the  process 
of  fasting  and  refeeding.  In  contrast,  1  hour  of 
insulin  exposure  of  cells  does  not  appear  to  alter 
expression  of  the  IRP-A  mRNA.  Thus  it  appears 
that  the  activity  of  this  factor  is  regulated  acutely 
by  a  post-translational  modification  and  chroni- 
cally by  an  alteration  in  gene  expression. 

The  mechanism  by  which  insulin's  initial  in- 
teraction with  its  cell  surface  receptor  tyrosine 
kinase  stimulates  intracellular  processes  is  un- 
clear. Several  enzymes  that  are  regulated  by  insu- 
lin have  been  found  to  undergo  phospho-dephos- 
pho  intercon versions.  Binding  of  IRP-A  protein  to 
DNA  is  dependent  on  phosphorylation.  To  define 
the  steps  in  the  signal  transduction  pathway  of 
insulin  action  on  this  transcription  factor,  we 
will  use  antibodies  that  specifically  interact  with 
IRP-A  protein  and  examine  the  effect  of  insulin 
on  the  phosphorylation  state  of  the  IRP-A  protein. 
IRP-A  contains  a  binding  domain  that  is  perfectly 
conserved  across  species  from  rat  to  yeast  and  an 
acidic  domain  that  is  capable  of  activating  gene 
transcription.  These  domains  are  surrounded  by 
putative  phosphorylation  sites  for  insulin-sensi- 
tive kinases.  Future  efforts  will  be  aimed  at  deter- 
mining whether  any  of  these  putative  phosphory- 
lation domains  is  critical  for  activation  of  DNA 
binding  or  transcriptional  activity. 

Southern  analysis  with  the  conserved  IRP-A 
binding  domain  indicates  that  this  gene  belongs 
to  a  large  family  of  related  genes.  Over  the  next 
year,  the  potential  for  interaction  between  family 
members  will  be  explored.  In  the  process,  the 
regions  of  IRP-A  that  contact  DNA  and  the  regions 
that  are  required  for  protein-protein  interactions 
will  be  defined. 
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Studies  on  the  regulation  of  IRP-A  gene  expres- 
sion have  led  to  an  understanding  of  the  mecha- 
nism by  which  insulin  modulates  the  expression 
of  specific  genes  in  specific  tissues  involved  in 
the  maintenance  of  normal  glucose  and  lipid  me- 
tabolism. Ultimately  these  studies  will  lead  to  an 


understanding  of  the  signal  transduction  process 
by  which  insulin  modulates  the  expression  of 
these  genes.  Understanding  the  hormonal  control 
of  lipid  metabolism  at  a  molecular  level  will  pro- 
vide insights  into  two  disease  states  of  major  im- 
portance, obesity  and  diabetes. 
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WE  are  interested  in  the  molecular  mecha- 
nisms that  underlie  the  generation  of  a  spe- 
cific immune  response.  The  mammalian  immune 
system  functions  through  complex  interactions 
between  various  cells  and  their  products.  Cells 
that  effect  specific  immunological  responses  fall 
into  two  general  categories:  B  lymphocytes  that 
mediate  humoral  immunity  (i.e.,  production  of 
antibodies  against  foreign  antigens)  and  T  lym- 
phocytes that  mediate  cellular  immunity  (e.g., 
foreign  graft  rejection) .  B  and  T  lymphocytes  that 
actively  fight  infections  and  other  diseases  are 
generated  in  two  general  stages  (primary  and 
secondary) . 

During  primary  stages,  stem  cells  proceed 
through  a  developmental  program  that  ultimately 
leads  to  the  generation  of  a  multitude  of  individ- 
ual B  or  T  lymphocyte  clones  (each  clone  is  an 
essentially  identical  set  of  cells  derived  from  a 
common  parent).  Each  set  of  clonal  cells  ex- 
presses a  novel  receptor  on  its  surface  that  will 
recognize  a  unique  set  of  antigens. 

The  secondary  phase  of  lymphocyte  differen- 
tiation results  when  a  lymphocyte  meets  a  foreign 
antigen  (e.g.,  a  bacterial  cell  surface  component) 
that  is  recognized  by  its  surface  receptor;  this 
stimulates  the  lymphocyte  to  divide  and  mature 
into  an  effector  cell.  For  B  lymphocytes,  this  mat- 
uration event  involves  secretion  into  the  blood- 
stream of  its  specific  receptor  molecule;  this  se- 
creted product  is  the  antibody. 

The  ability  of  the  immune  system  to  respond 
specifically  to  a  vast  array  of  antigens  results  in 
substantial  part  from  the  unique  organization  of 
the  genes  that  encode  antigen  receptor  proteins. 
Unlike  most  genes,  antigen  receptor  genes  are  not 
inherited  intact  from  our  parents.  Instead,  these 
genes  are  encoded  in  cassettes  (gene  segments) 
in  the  germline  and  are  assembled  into  complete 
genes  only  during  the  somatic  differentiation  of 
lymphocytes.  Because  there  are  many  individual 
cassettes  that  encode  various  portions  of  antibod- 
ies and  because  these  can  be  put  together  in 
various  combinations  or  in  various  ways,  the 
body  can  randomly  assemble  a  vast  array  of  differ- 
ent antibody  genes  from  a  limited  amount  of  ge- 
netic material. 


Much  of  our  work  is  aimed  at  determining  the 
genetic  mechanisms  by  which  antibody  genes  are 
assembled  from  their  basic  cassettes,  the  role  of 
the  gene  assembly  process  in  the  generation  of 
antibody  diversity,  and  the  mechanisms  that  regu- 
late this  gene  assembly  process  and  ensure  that  it 
occurs  only  in  appropriate  cell  types.  We  are  also 
working  on  the  elucidation  of  molecular  signals 
that  control  the  various  steps  of  B  lymphocyte 
differentiation. 

One  experimental  system  employed  in  our 
studies  involves  the  generation  of  cell  lines  that 
undergo  the  same  types  of  antibody  gene  assem- 
bly events  in  culture  dishes  that  normal  develop- 
ing immature  lymphocytes  undergo  in  the  ani- 
mal. In  particular,  we  have  devised  and 
employed  assays  in  which  unrearranged  antibody 
gene  segments  are  isolated  from  normal  mouse  or 
human  chromosomes  and  then  propagated 
(cloned)  in  bacteria;  we  refer  to  these  cloned 
molecules  as  "recombination  constructs."  Var- 
ious types  of  recombination  substrates  can  be  in- 
troduced into  the  permanent  cell  lines  to  ask 
mechanistic  and  regulatory  questions  about  anti- 
body gene  assembly.  These  studies  have  allowed 
us  to  clarify  many  mechanistic  aspects  of  this 
gene  assembly  process  and  to  devise  schemes  for 
how  the  assembly  process  is  regulated.  For  exam- 
ple, we  were  able  to  show  that  both  antibody 
genes  and  the  analogous  genes  in  T  lymphocytes 
(encoding  the  T  cell  receptor)  are  assembled  by 
the  same  general  recombination  machinery. 

A  second  general  system  that  we  employ  to  ana- 
lyze control  of  the  recombination  mechanism  is 
the  transgenic  mouse.  By  injecting  purified  ge- 
netic material  into  fenilized  mouse  eggs  and  then 
implanting  them  into  foster  mothers,  it  is  possi- 
ble to  make  mice  that  carry  the  foreign  DNA  in 
their  germline.  We  have  insened  recombination 
substrates  that  contain  experimentally  manipu- 
lated antibody  gene  cassettes  (or  T  cell  receptor 
gene  cassettes)  into  the  mouse  genome  and  asked 
questions  about  the  genetic  and  cellular  mecha- 
nisms that  regulate  assembly.  These  studies  al- 
lowed us  to  extend  our  studies  on  clonal  cell 
lines  to  identify  more  precisely  several  genetic 


Genetic  Mechanisms  Involved  in  the  Generation  of  the  Antibody  Repertoire 


elements  that  govern  the  assembly  process.  Most 
notably,  we  have  shown  that  genetic  regulatory 
elements  previously  thought  to  control  only  the 
level  of  gene  expression  can  also  serve  to  control 
specific  gene  rearrangement. 

Currently  we  are  attempting  to  generate  sev- 
eral novel  mouse  models  to  study  factors  in- 
volved in  lymphocyte  differentiation.  One  ap- 
proach is  to  produce  a  mutant  mouse  line  that 
cannot  produce  endogenous  antibody  molecules. 
To  accomplish  this,  we  employed  new  methods 
that  permit  introduction  of  desired  genetic  muta- 
tions into  specific  genes  in  mouse  embryonal 
stem  cell  lines  to  create  cells  that  lack  germline 
gene  segments  necessary  for  forming  functional 
antibody  genes.  We  used  these  mutant  cell  lines 
to  create  chimeric  mice  that  contain  cells  lacking 
the  antibody  gene  cassettes.  Such  chimeric  mice 
should  generate  mutant  mice  that  inherit  in  their 
germline  an  inability  to  make  endogenous 
antibodies. 

We  will  use  the  mutant  animals  to  study  the 
role  of  antibody  gene  products  in  regulating  anti- 
body gene  assembly  and  lymphocyte  develop- 
ment. A  potential  practical  use  of  such  mice  may 
be  achieved  by  breeding  them  with  a  transgenic 
mouse  line  that  we  have  engineered  to  contain 
functional  human  antibody  gene  cassettes.  The 
hope  is  that  the  hybrid  animal  will  now  rely  on 
the  human  antibody  genes  for  its  immune  system, 
providing  a  more  effective  method  of  generating 
tailor-made  human  antibodies. 

We  also  are  interested  in  the  mechanism  by 
which  certain  intracellular  messenger  molecules 
(lymphokines  or  interleukins)  affect  the  develop- 
ment and  action  of  B  lymphocytes.  Certain  T  lym- 
phocytes regulate  aspects  of  the  immune  re- 
sponse; some  of  these  produce  lymphokines  that 
stimulate  activated  B  lymphocytes  (during  sec- 
ondary differentiation)  to  produce  antigen- 
specific  antibodies  that  differ  in  where  they  go  in 
the  body  and  how  they  mediate  their  protective 
activity  (referred  to  as  different  effector  activi- 
ties). The  difference  in  these  antibodies  results 
from  the  use  of  different  gene  cassettes  to  en- 
code, not  the  antigen-binding  portion,  but  the  ef- 
fector portion  of  the  antibody  molecule.  This  phe- 
nomenon results  from  a  different  type  of  gene 
recombination  assembly  event,  in  which  one  part 
of  an  already  active  antibody  gene  is  replaced 


with  a  different  one.  We  have  shown  that  particu- 
lar lymphokines  actually  direct  this  gene  rear- 
rangement event  by  giving  a  B  lymphocyte  a  sig- 
nal to  prepare  a  specific  effector  gene  cassette  for 
a  rearrangement  event.  Most  recently  we  have 
shown  that  the  ultimate  targets  of  the  lympho- 
kines are  novel  elements  that  control  the  tran- 
scription of  the  effector  gene  cassette. 

Finally,  we  are  interested  in  the  molecular  sig- 
nals that  direct  precursor  lymphocytes  through 
the  primary  differentiation  process.  One  focus  of 
this  work  is  on  three  related  but  distinct  genes, 
the  N-,  L-,  and  c-myc  genes.  These  genes  encode 
proteins  that  operate  in  the  nucleus  of  the  cell, 
presumably  to  regulate  the  expression  of  other 
genes.  The  members  of  this  gene  family  contrib- 
ute to  the  generation  of  various  types  of  tumors 
when  their  expression  is  altered  as  the  result  of 
mutations.  We  have  found  that  individual 
members  of  this  family  are  expressed  at  specific 
points  during  the  differentiation  of  lymphocytes. 
For  example,  the  N-myc  gene  is  expressed  only  in 
lymphocyte  precursors  where  expression  levels 
are  greatly  increased  following  interaction  of 
these  cells  with  a  factor  that  specifically  stimu- 
lates their  growth. 

To  define  further  the  function  of  this  gene  fam- 
ily, we  have  introduced  the  N-myc  and  L-myc 
genes  into  transgenic  mice  in  forms  in  which  they 
are  incorrectly  expressed  throughout  lympho- 
cyte development  (normally  they  are  expressed 
only  very  early  in  lymphocyte  development). 
Such  incorrect  expression  of  these  two  genes 
leads,  respectively,  to  defects  in  B  or  T  lympho- 
cyte development,  including  frequent  generation 
of  lymphocyte  tumors.  The  genetic  mechanisms 
by  which  incorrect  expression  of  the  deregulated 
myc  genes  affects  normal  lymphocyte  develop- 
ment are  being  analyzed. 

As  a  complementary  approach,  we  used  the 
gene-targeting  methodologies  outlined  above  to 
create  mutant  mice  that  lack  functional  N-myc 
genes.  Loss  of  N-myc  gene  function  appears  to  be 
lethal  at  an  early  stage  of  murine  development. 
Therefore,  to  study  normal  function,  we  are  at- 
tempting to  rescue  development  in  the  N-myc 
mutant  mice  lacking  functional  endogenous 
genes  by  breeding  these  animals  with  mice  that 
carry  functional  N-myc  transgenes  that  are  ex- 
pressed in  various  cell  lineages. 
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COMMUNICATION  between  neurons  depends 
on  precise  chemical  signals  that  are  released 
from  one  neuron  and  interpreted  by  adjacent  neu- 
rons. This  transfer  of  information  is  based  on  the 
synthesis  of  chemical  neurotransmitters  by  the 
presynaptic  cell,  their  release  into  the  synaptic 
cleft,  and  their  recognition  by  specific  receptor 
proteins  on  the  membrane  of  the  postsynaptic 
cell.  Our  research  focuses  on  the  structure  and 
expression  of  genes  encoding  proteins  with  im- 
portant roles  in  neurotransmission  and  synaptic 
function. 

We  have  been  interested  in  a  family  of  mem- 
brane proteins  that  play  a  key  role  in  synaptic 
function — the  sodium-dependent  neurotransmit- 
ter transporters.  For  precise  and  efficient  neuro- 
transmission, the  release  of  the  neurotransmitters 
into  the  synaptic  cleft  must  parallel  the  rise  and 
fall  of  presynaptic  excitation.  Inactivation  of  the 
majority  of  classic  neurotransmitters  usually  is 
achieved  by  rapid  reuptake  into  the  presynaptic 
terminal  or  surrounding  glial  cells  by  specific, 
high-affinity  neurotransmitter  transporters.  Neu- 
rotransmitter entry  is  coupled  directly  to  the 
transmembrane  sodium  gradient,  which  drives 
the  uphill  transport.  Neurotransmitter  transport- 
ers not  only  have  a  central  role  in  synaptic  trans- 
mission, but  they  are  also  the  site  of  action  for  a 
wide  range  of  drugs  with  both  therapeutic  and 
abuse  potential.  Clinically  important  drugs  that 
act  directly  on  these  transporters  include  the  tri- 
cyclic antidepressants,  which  inhibit  both  norepi- 
nephrine and  serotonin  uptake;  amphetamines, 
which  inhibit  norepinephrine  and  dopamine 
transport;  and  cocaine,  which  inhibits  all  three 
uptake  systems.  The  impact  of  these  agents  on 
society  underscores  the  importance  of  under- 
standing the  molecular  basis  of  their  actions. 

Until  recently  little  structural  information  has 
been  available  about  the  proteins  responsible  for 
reuptake  of  these  neurotransmitters  at  synapses. 
Previously  we  used  Xenopus  oocytes  as  a  system 
in  which  to  express  four  major  classes  of  trans- 
port activity  in  the  brain — those  for  catechol- 
amine, indoleamine,  choline,  and  excitatory  and 
inhibitory  amino  acid  (L-glutamate,  7-aminobu- 


tyric  acid  [GABA],  and  glycine)  transport.  After 
injection  of  mRNA  prepared  from  various  brain 
regions,  uptake  of  the  radiolabeled  transmitters 
can  be  measured  and  the  ion-dependence  and 
pharmacologic  specificity  of  these  transport  ac- 
tivities can  be  assessed  in  single  oocytes.  These 
and  other  studies,  which  confirmed  that  the  trans- 
port activities  are  encoded  by  single  mRNAs, 
paved  the  way  for  expression  cloning  of  the  genes 
encoding  these  carriers. 

This  year  we  utilized  a  mammalian  cell-based 
expression  system  to  clone  a  human  cDNA  encod- 
ing a  catecholamine  transporter.  The  transport  ac- 
tivity encoded  by  this  cDNA  has  the  substrate- 
specificity  and  pharmacologic  properties  of  a 
norepinephrine  transporter,  including  sensitivity 
to  antidepressants,  amphetamine,  and  cocaine. 
The  single  polypeptide  contains  regions  impor- 
tant for  binding  and  transport  inhibition  by  co- 
caine and  thus  provides  a  first  structural  insight 
into  the  family  of  cocaine  receptors.  The  cDNA 
sequence  predicts  a  protein  of  617  amino  acids, 
with  multiple  hydrophobic  regions  consistent 
with  the  presence  of  1 2  membrane-spanning  do- 
mains. The  amino  acid  sequence  has  significant 
homology  with  a  recently  cloned  GABA  trans- 
porter and  thus  identifies  a  new  family  of  brain 
transport  proteins.  Studies  are  under  way  to  de- 
fine and  characterize  other  members  of  this  car- 
rier family. 

The  availability  of  a  human  norepinephrine 
transporter  cDNA  also  provides  an  opportunity  to 
investigate  the  contributions  of  structural  ele- 
ments to  transporter  function  and  regulation. 
Many  tricyclic  antidepressants  accomplish  their 
therapeutic  effects  by  blockade  of  reuptake 
of  norepinephrine  transporters  and  subsequent 
elevation  of  synaptic  neurotransmitter  concen- 
trations. Delineation  of  the  structural  basis  of  an- 
tidepressant binding  and  function  through  muta- 
genesis studies  of  the  cloned  transporter  should 
aid  in  the  development  of  more-selective  thera- 
peutic agents  for  the  treatment  of  human  depres- 
sion. The  availability  of  the  human  cDNA  encod- 
ing the  norepinephrine  carrier  also  offers  the  first 
opportunity  to  determine  whether  alterations  in 
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transporter  genes  underlie  major  psychiatric  af- 
fective disorders. 

Other  ongoing  research  in  the  laboratory  has 
focused  on  genes  encoding  another  class  of  im- 
portant synaptic  molecules,  neuropeptides.  Us- 
ing the  calcitonin/CGRP  (calcitonin  gene-regu- 
lated peptide)  gene  family  as  a  model  system,  we 
are  investigating  the  mechanisms  that  regulate 
the  production  and  structural  diversity  of  these 
peptide  neurotransmitters.  The  peptides  of  the 
calcitonin/CGRP  family  are  produced  in  discrete 
sets  of  neurons  in  the  nervous  system.  We  are 
looking  for  the  specific  signal  sequences  within 
the  genes  for  these  peptides  that  allow  the  gene 
to  be  turned  on  in  these  sets  of  neurons.  In  addi- 
tion we  are  determining  which  sequences  are  in- 
volved in  regulating  the  amount  of  neuropeptide 
mRNA  produced.  Once  these  specific  control  se- 
quences are  identified,  our  next  step  is  to  search 
for  the  protein  factors  that  recognize  the  se- 
quences and  influence  gene  expression.  Thus  we 
can  use  a  molecular  biologic  approach  to  charac- 
terize the  individual  steps  that  combine  to  pro- 
duce complex  neurobiologic  processes. 

The  diversity  of  products  expressed  from  the 
calcitonin/CGRP  gene  family  depends  not  only 
on  the  presence  of  multiple,  independently  regu- 


lated genes  but  also  on  the  existence  of  alterna- 
tive, tissue-specific  pathways  for  pre-mRNA 
processing.  This  processing  is  an  important  mech- 
anism in  the  control  of  the  expression  of  RNA 
products  from  many  eukaryotic  genes.  Numerous 
gene  products  in  the  nervous  system  are  encoded 
within  complex  transcription  units  that  generate 
multiple  mRNAs  by  altering  where  in  the  gene 
the  transcripts  initiate,  how  they  are  spliced,  or 
how  they  terminate.  In  the  nervous  system,  these 
processes  may  be  particularly  important,  because 
they  increase  the  diversity  of  expression  by  al- 
lowing multiple  products  to  be  encoded  by  a  sin- 
gle gene.  We  would  like  to  know  how  the  same 
nuclear  RNA  transcripts  can  be  processed  in  alter- 
native ways  in  diff'erent  cells  to  generate  distinct 
products.  One  potential  determinant  of  a  process- 
ing choice  is  found  within  the  primary  RNA  tran- 
script, either  as  a  specific  sequence  or  secondary 
structural  feature.  A  second  determinant  is  found 
in  the  enzymatic  machinery  that  recognizes  these 
structural  features  and  targets  the  transcript  to  a 
particular  processing  pathway.  We  have  contin- 
ued to  generate  a  variety  of  mutated  and  hybrid 
genes  and,  by  introducing  these  constructs  into  a 
variety  of  cell  types,  have  identified  regions  criti- 
cal for  splicing  regulation. 
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WE  are  interested  in  liow  different  types  of 
nerve  cells  are  generated  during  the  devel- 
opment of  the  vertebrate  nervous  system.  We 
have  chosen  to  work  on  the  peripheral  auto- 
nomic nervous  system,  which  is  simpler  and 
more  accessible  than  the  brain.  Our  studies  have 
focused  on  the  development  of  two  specific  cell 
types:  the  sympathetic  neurons  that  lie  in  a  chain 
of  ganglia  along  the  spinal  cord,  and  the  chromaf- 
fin cells  of  the  adrenal  medulla.  These  two  cells 
are  closely  related,  yet  distinct  in  major  respects. 
The  former  are  true  neurons,  with  long  branching 
axons  and  dendrites  that  send  and  receive  electri- 
cal signals.  The  latter  are  small,  round  secretory 
cells  that  release  epinephrine  (adrenaline)  into 
the  bloodstream  during  fright  or  excitement. 

Studies  in  a  number  of  laboratories,  including 
our  own,  have  established  that  these  two  cell 
types  derive  from  a  common  progenitor  cell.  This 
cell  arises  on  top  of  the  neural  tube  (the  develop- 
ing spinal  cord),  as  part  of  a  transient  structure 
called  the  neural  crest.  Like  parachutists,  the 
neural  crest  cells  peel  off  the  neural  tube  in  a 
wave  and  migrate  downward  through  the  em- 
bryo. Some  of  them  arrest  their  migration  in  a 
chain  of  small  clumps  along  a  blood  vessel, 
where  they  eventually  become  sympathetic  neu- 
rons. Others  continue  their  migration  downward 
to  invade  the  developing  adrenal  gland,  where 
they  become  chromaffin  cells. 

Using  a  fluorescence-activated  cell  sorter  and 
specific  monoclonal  antibodies  to  tag  the  cells, 
we  have  succeeded  in  isolating  chromaffin  cell 
precursors  from  the  fetal  adrenal  gland  of  the  rat. 
By  manipulating  the  culture  environment  of  the 
cells,  we  have  shown  that  these  precursors  have 
two  possible  developmental  fates:  if  glucocorti- 
coid hormones  are  added  to  the  growth  medium, 
mimicking  the  environment  of  the  adrenal  gland, 
the  precursors  develop  into  chromaffin  cells.  If, 
on  the  other  hand,  fibroblast  growth  factor  (FGF) 
and  nerve  growth  factor  (NGF)  are  added  to  the 
medium,  the  precursors  develop  into  sympa- 
thetic neurons.  This  indicates  that  the  fate  of 
these  cells  is  determined  in  large  part  by  signals 
in  the  environments  to  which  they  migrate.  How- 


ever, these  precursors  seem  to  have  lost  the  abil- 
ity to  give  rise  to  some  other  derivatives  of  the 
neural  crest,  such  as  glial  cells.  Therefore,  while 
these  precursor  cells  have  a  choice  of  fate,  it  is  a 
restricted  one. 

Immortalization  of  the  Chromaffin-Neuron 
Precursor  Cell 

The  precursor  cell  is  endowed  with  a  limited 
repertoire  of  potential  developmental  fates.  What 
genes  and  proteins  determine  the  specific  reper- 
toire of  possible  fates?  What  genes  and  proteins 
select  a  particular  fate  for  actual  expression? 
These  are  the  major  questions  we  are  pursuing  in 
an  effort  to  understand  the  molecular  biology  of 
this  developmental  system. 

One  problem  in  studying  the  chromaffin- 
neuron  precursor  cell  at  the  molecular  level  is 
the  small  number  of  cells  that  can  be  isolated 
from  rat  fetuses.  To  circumvent  this  problem,  we 
have  applied  recently  developed  techniques  to 
immortalize  the  cells.  Using  a  defective  retrovi- 
rus as  a  "disposable  molecular  syringe,"  we  have 
injected  the  cells  with  a  gene,  y-myc,  that  allows 
them  to  divide  forever  in  the  culture  dish.  In  this 
way  we  can  generate  an  endless  supply  of  cells 
that  can  be  used  for  experiments  at  any  time, 
without  the  need  to  perform  long  hours  of  dissec- 
tion. Fortunately,  these  immortalized  precursor 
cell  lines  still  appear  capable  of  undergoing  dif- 
ferentiation into  sympathetic  neurons  when  ex- 
posed to  FGF  and  NGF. 

Neural  Development  in  Mammals  and 
Drosophila  Uses  Similar  Regulatory 
Molecules 

We  have  used  the  chromaffin-neuron  precursor 
cell  lines  to  isolate  new  genes  that  may  be  candi- 
dates for  controlling  the  development  of  these 
cells.  This  approach  involves  first  guessing  that  a 
particular  gene,  or  class  of  genes,  may  be  ex- 
pressed in  the  precursor  cell  and  then  using  the 
tools  of  molecular  biology  to  extract  the  gene  of 
interest.  What  guidelines  can  one  use  to  make 
educated  guesses  about  developmental  control 
genes?  One  suggestion  came  from  studies,  by 


15 


Control  of  Cell  Fate  During  Vertebrate  Neuronal  Development 


other  investigators,  which  showed  that  embry- 
onic control  genes  identified  by  mutational  analy- 
sis in  the  fruit  fly  Drosophila  have  counterparts 
in  the  mouse  and  human  that  are  remarkably  con- 
served in  their  structure.  We  therefore  reasoned 
by  analogy  that  Drosophila  genes  controlling 
neural  development  might  also  be  conserved  in 
mammals. 

One  important  set  of  genes  that  control  neuro- 
nal development  in  Drosophila  are  those  of  the 
achaete-scute  complex  (AS-C).  These  scute 
genes  encode  a  group  of  related  proteins  that  act 
by  binding  to  DNA,  thereby  controlling  the  activ- 
ity of  other  genes.  If  the  function  of  these  genes  is 
eliminated  by  mutation  in  the  fly,  certain  classes 
of  neurons  fail  to  form.  We  succeeded  in  isolating 
two  scM^e-related  genes  from  the  rat  chromafiin- 
neuron  precursor  cell  line.  The  structures  of 
these  genes  are  remarkably  similar  to  those  of 
their  fruit  fly  counterparts.  Moreover,  the  rat 


scM^e-related  genes  appear  to  be  expressed  specif- 
ically in  neuronal  precursor  cells,  like  their 
counterparts  in  the  fly.  These  data  indicate  that 
there  has  been  a  striking  parallel  conservation  of 
gene  structure  and  cell  type  specificity  during 
evolution.  They  further  suggest  that  the  scute- 
related  genes  may  control  the  development  of 
mammalian  neurons.  This  discovery  opens  up  a 
new  line  of  investigation,  which  will  be  aimed  at 
finding  both  the  precise  role  of  the  scute-veldxed 
genes  during  neuronal  development  and  the 
manner  in  which  these  genes  are  controlled. 
These  exciting  findings  suggest  that  the  molecu- 
lar mechanisms  controlling  nerve  cell  develop- 
ment in  vertebrate  and  invertebrate  organisms 
may  be  fundamentally  similar.  Studies  in  each 
system  will  therefore  contribute  to  our  under- 
standing of  the  other,  leading  to  a  unified  view  of 
the  basic  principles  that  guide  the  building  of  a 
nervous  system. 
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Confocal  microscopic  localization  of  the  JR-8D2  epitope  to  specific  fascicles  within  the  major 
nerve  tracts  of  Drosophila  melanogaster.  Neuronal  membranes  are  shown  in  red  with  the  staining 
from  the  fR-8D2  antibody  shown  in  green.  The  protein  recognized  by  fR-8D2  exhibits  an  exquisite 
localization  to  both  neurons,  which  pioneer  the  peripheral  nervous  system,  and  to  specific  fasci- 
cles in  the  central  nervous  system. 
From  Rothberg,  f.M.  1991.  Ph.D.  thesis,  Yale  University. 
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A fundamental  issue  in  the  development  of 
multicellular  organisms  is  how  an  individ- 
ual cell  acquires  a  specific  developmental  fate. 
The  molecular  rules  directing  neighboring  cells 
into  developmentally  discrete  paths  are  un- 
known. With  a  particular  interest  in  how  these 
rules  apply  to  the  nervous  system,  we  have  been 
studying  the  molecular  biology  of  early  neuro- 
genesis in  the  fruit  fly  Drosophila. 

The  central  nervous  system  in  Drosophila  de- 
rives from  a  set  of  precursor  cells,  the  neuro- 
blasts, which  segregate  from  the  epidermal  pre- 
cursors, the  dermoblasts,  in  the  very  early 
ectoderm.  For  the  past  several  years,  we  have 
been  analyzing  a  group  of  six  zygotically  acting 
genes  that  are  involved  in  this  process.  Known  as 
neurogenic  loci,  these  are  Delta  (Dl),  Enhancer 
of  split  [E(spl)],  mastermind  {mam),  big  brain 
{bib},  neuralized  (neu),  and  Notch  (N).  Muta- 
tions in  any  one  of  these  can  result  in  the  misrout- 
ing  of  epidermal  lineage  into  a  neuronal  develop- 
mental pathway. 

The  first  neurogenic  locus  to  be  characterized 
in  some  detail  was  Notch.  We  found  that  it  codes 
for  a  transmembrane  protein  homologous  to  the 
mammalian  epidermal  growth  factor  (EGF),  im- 
plying an  involvement  in  cell  surface  events.  In- 
deed, our  work  and  that  of  others — embryologi- 
cal  and  molecular  studies  of  some  of  the  other 
neurogenic  loci — strengthened  that  notion  and 
led  to  the  hypothesis  that  neuroblast  segregation 
depends  on  cell  interactions,  with  at  least  some 
of  the  neurogenic  loci  coding  for  proteins  in- 
volved in  intercellular  signaling. 

In  studying  the  cellular  mechanisms  in  which 
Notch  is  integrated,  we  have  been  following  two 
major  experimental  paths.  On  the  one  hand,  we 
are  exploring  the  cell  biology  of  Notch  and  the 
nature  of  its  genetic  interactions;  on  the  other, 
we  are  trying  to  gain  insights  into  the  functional 
meaning  of  certain  sequence  motifs  found  in  the 
Notch  protein.  For  instance,  we  are  characteriz- 
ing other  genes  sharing  the  EGF  sequence  motif 
with  Notch.  In  addition,  during  the  past  year,  we 
initiated  a  molecular  and  genetic  analysis  of  cer- 
tain nonneurogenic  genes  that  play  a  role  in  the 
differentiation  of  the  ectoderm. 


We  started  this  analysis  with  the  gene  fizzy 
(fzy) .  Mutations  of  this  gene  aff^ecting  the  pheno- 
type  have  suggested  that  it  functions  to  promote 
subsequent  differentiation  in  response  to  the 
postulated  "epidermalizing"  component  of  the 
signal  transmitted  by  the  neurogenic  gene  prod- 
ucts. As  such,  the  fzy  protein  has  a  pivotal  role  in 
implementing  the  response  of  presumptive  der- 
moblasts to  developmental  cues  transmitted  by 
those  products.  We  have  been  analyzing  the  ge- 
netic and  phenotypic  behavior  of  fzy  mutations 
and  have  initiated  its  molecular  analysis  by  the 
cloning  of  genomic  sequences  from  the  fzy 
region. 

In  an  attempt  to  examine  the  complexity  of  the 
genetic  circuitry  in  which  Notch  is  integrated, 
and  to  identify  genes  whose  products  may  di- 
rectly interact  with  the  Notch  protein,  we  de- 
signed a  genetic  screen  aimed  at  identifying  sup- 
pressors of  certain  Notch  mutations  associated 
with  the  gene's  extracellular  EGF  homologous 
domain.  This  screen  has  led  to  the  identification 
of  a  surprisingly  restricted  set  of  interacting  loci. 
These  include  the  neurogenic  genes  Delta, 
which  also  codes  for  a  transmembrane  protein 
with  EGF  homology,  and  mastermind.  In  addi- 
tion, a  third  gene  was  shown  capable  of  acting  as 
a  suppressor.  We  identified  this  gene  to  be  the 
deltex  locus,  hitherto  unlinked  with  neurogenic 
gene  action.  We  have  further  characterized  the 
phenotype  of  deltex  mutations  and  have  demon- 
strated allele-specific  interactions  between  del- 
tex and  Notch  alleles. 

To  investigate  the  possibility  of  intermolecular 
association  between  the  products  of  Notch  and 
Delta  genes,  we  have  examined  the  effects  of 
their  expression  on  aggregation  in  Drosophila  S2 
cells.  We  find  that  A^o^c^-expressing  cells,  by  a 
calcium-dependent  process,  form  mixed  aggre- 
gates specifically  with  cells  that  express  Delta. 
Furthermore,  we  have  determined  that  Notch  and 
Delta  proteins  interact  via  their  extracellular 
domains. 

Some  of  the  first  phenotypic  interactions  to  be 
described  between  neurogenic  loci  were  those 
involving  Notch  and  Enhancer  of  split.  Molecu- 
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lar  and  genetic  analysis  of  E( spl )  revealed  that 
the  locus  is  defined  by  several  transcription  units. 
Focusing  mostly  on  one  of  those  transcripts,  we 
have  show^n  that  it  codes  for  a  protein  with  strik- 
ing homology  to  the  bovine  /8-transducin,  known 
to  be  involved  in  signal  transduction.  We  found 
that'point  mutations  in  the  protein  are  capable  of 
dramatic  synergistic  interactions  with  mutations 
affecting  the  intracellular  region  of  Notch.  We 
were  rather  surprised  to  find  the  subcellular  lo- 
calization of  the  E(spl)  transducin  homologous 
protein  in  the  nucleus,  mastermind,  which  also 
interacts  with  Notch,  seems  to  code  for  a  nuclear 
protein  as  well.  This  raises  the  intriguing  possibil- 
ity that  Notch  communicates  with  the  nucleus 
relatively  directly. 

While  examining  identified  interactions  be- 
tween Notch  and  other  genes,  we  are  studying 
two  more,  Serrate  and  slit.  The  Serrate  locus 
captured  our  attention  by  virtue  of  the  synergistic 
genetic  interactions  we  noted  between  certain  al- 
leles of  Serrate  and  Notch.  We  were  particularly 
intrigued  to  find,  after  cloning  Serrate,  that  it  en- 
codes a  protein  product  of  1,404  amino  acids 
with  a  single  transmembrane  domain  and  1 4  EGF- 
like  repeats  in  the  extracellular  region.  Thus 
Serrate  represents  another  member  of  the  group 
of  EGF-containing  loci  in  Drosophila  and  pro- 
vides us  with  the  reasonable  working  hypothesis 
that  Notch  and  Serrate  may  interact  at  the  protein 
level. 

To  gain  more  insight  into  the  role  of  the  EGF 
sequence  motif  in  extracellular  interactions  and 
morphogenetic  events,  we  have  undertaken  a  mo- 
lecular and  biochemical  characterization  of  slit. 
This  gene  codes  for  a  secreted  protein  with  se- 
quences homologous  to  EGF  and  to  a  superfamily 


of  extracellular  matrix-binding  glycoproteins. 
Its  embryonic  localization,  mutant  phenotype, 
and  sequence  homology  suggest  that  it  mediates 
interaction  among  glial  cells,  axons,  and  the  ex- 
tracellular environment.  The  interactions  in- 
clude those  required  for  developing  cells  that  are 
necessary  for  axon  pathway  formation  and 
selection. 

In  conclusion,  work  in  the  past  year  extended 
our  knowledge  of  the  molecular  rules  underlying 
certain  early  neural  differentiation  events.  Analy- 
sis indicates  that  we  have  identified  elements  of  a 
specific  signal  transduction  mechanism  control- 
ling the  correct  segregation  of  neuroblasts  and 
involving  extracellular  signals  together  with  cy- 
toplasmic and  nuclear  components.  Moreover,  it 
seems  clear  that  this  interaction  mechanism  is  not 
exclusively  involved  in  early  neurogenesis  but  is 
used  in  various  developmental  times  and  tissues 
and  results  in  the  fine  tuning  of  the  differentia- 
tion of  certain  tissues,  including  that  of  the  ner- 
vous system. 

In  addition,  the  work  carried  out  this  year  de- 
fined new  questions  and  directions.  Most  signifi- 
cantly, we  have  started  to  examine  the  analogies 
that  may  exist  between  the  invertebrate  experi- 
mental model  and  vertebrates.  This  will  not  only 
allow  for  interesting  comparisons,  but  it  will  also 
permit  us  to  exploit  the  experimental  advantages 
offered  by  the  sophisticated  vertebrate  tissue  cul- 
ture systems.  We  will  be  able  to  address  a  variety 
of  cell  biological  questions  related  to  the  molecu- 
lar mechanism  within  which  Notch  and  the  other 
neurogenic  genes  are  integrated.  As  a  first  step  we 
have  isolated  the  human  Notch  homologue  and 
found  it  to  be  remarkably  similar  to  its  inverte- 
brate counterpart. 
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MANY  poorly  understood  but  relatively  com- 
mon and  serious  human  diseases  involve 
aberrations  of  the  immune  system. 

The  function  of  the  immune  system  is  to  recog- 
nize microorganisms  and  eliminate  them  from 
the  host.  A  major  means  of  accomplishing  this  is 
through  the  synthesis  of  antibodies  and  comple- 
ment components  that  attach  to  the  foreign  sub- 
stance. These  proteins  circulate  in  blood  and  are 
either  formed  in  response  to  the  organism  (anti- 
bodies) or  present  at  all  times  (complement) . 

The  work  in  this  laboratory  centers  on  the  com- 
plement system.  Complement  consists  of  pro- 
teins that  interact  with  each  other  in  a  reaction 
resembling  a  cascade  or  waterfall:  one  activates 
the  next.  The  activated  molecules  that  result 
from  this  destroy  the  invading  microbe  and  pro- 
mote the  inflammatory  response.  The  comple- 
ment system  is  a  powerful,  swift,  and  highly  ef- 
fective means  to  fight  infection.  As  might  be 
anticipated,  an  inherited  deficiency  of  a  comple- 
ment component  predisposes  an  individual  to 
infections. 

The  production  of  antibodies  is  triggered  by 
the  invasion  of  the  body  by  a  foreign  substance. 
The  binding  of  antibodies  to  a  target  such  as  a 
bacterium  initiates  a  series  of  reactions  in  which 
complement  proteins  swarm  onto  the  surface  of 
the  microbe.  Such  components  serve  as  a  ligand 
for  complement  receptors  on  blood  cells.  A  for- 
eign particle  soon  becomes  adherent  to  and  then 
ingested  by  these  blood  cells.  During  comple- 
ment activation,  small  fragments  (peptides)  that 
promote  the  inflammatory  response  are  liberated 
from  the  complement  proteins.  These  molecules 
dilate  blood  vessels  and  summon  scavenger  cells, 
called  phagocytes,  from  the  bloodstream.  The 
phagocytic  white  blood  cells,  upon  arrival  at  the 
site,  find  organisms  that  are  already  prepared  for 
ingestion,  because  they  are  coated  with  antibody 
and  complement.  This  phenomenon  is  known  as 
opsonization  (from  the  Greek  word  opsonein,  to 
prepare  for  the  table).  As  a  result,  the  infection  is 
sequestered  and  eliminated  via  phagocytosis. 

Sometimes,  however,  the  immune  system,  in- 
stead of  synthesizing  antibodies  to  foreign  mate- 


rials, seemingly  makes  a  mistake  and  produces 
antibodies  that  react  with  its  own  cells.  For  exam- 
ple, in  certain  immune  disorders,  the  individual 
makes  antibodies  to  his  or  her  own  red  blood 
cells.  Antibody  and  complement  then  attach  to 
the  red  blood  cells  and  destroy  them.  Such  condi- 
tions are  known  as  autoimmune  diseases,  in  this 
case  autoimmune  hemolytic  anemia. 

In  other  pathologic  conditions,  the  immune 
system  does  not  efficiently  eliminate  the  in- 
fectious particles,  and  excessive  quantities  of  im- 
mune complexes  form.  Immune  complexes  con- 
sist of  antibodies,  complement  proteins,  and  the 
foreign  particle.  These  immune  complexes  are 
proinflammatory  and  cause  tissue  damage  if  de- 
posited in  undesirable  locations  such  as  joints, 
skin,  and  kidney,  leading  to  arthritis,  dermatitis, 
and  glomerulonephritis,  respectively. 

Thus  there  are  two  general  ways  in  which  the 
humoral  immune  system  can  damage  its  own  tis- 
sue. The  first  is  to  produce  antibodies  to  self  com- 
ponents. The  second  is  to  form  excessive 
amounts  of  immune  complexes. 

Our  goal  is  to  understand  how  immune  com- 
plexes form  and  are  processed.  We  are  studying  a 
biologic  mechanism  that  evolved  to  remove  im- 
mune complexes  from  the  circulation.  This  pro- 
cess helps  prevent  the  pathologic  accumulation 
of  these  complexes  in  tissue.  This  system  can  be 
likened  to  an  "inner  space  shuttle."  The  physio- 
logic vehicle  for  this  journey  is  the  red  blood 
cell.  The  red  blood  cell  participates  in  this  reac- 
tion through  a  complement  receptor  protein  on 
its  surface.  These  receptor  proteins  latch  onto 
complement-coated  foreign  particles,  such  as  vi- 
ruses and  bacteria.  As  the  circulating  red  blood 
cell  passes  through  the  liver  or  spleen,  its  im- 
mune cargo  is  released  and  metabolized.  The  red 
blood  cell  then  returns  to  the  circulation  ready  to 
ferry  another  load. 

Our  laboratory  is  studying  the  complement  re- 
ceptor involved  in  this  process.  We  are  also  exam- 
ining the  complement  proteins  that  swarm  onto 
the  foreign  panicles.  Furthermore,  we  are  inves- 
tigating certain  "housekeeping"  proteins  of  the 
complement  system.  Because  of  the  powerful  de- 
structive capabilities  of  the  complement  system, 
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it  is  perhaps  no  surprise  that  the  body  must  keep 
it  tightly  regulated.  Special  proteins  are  synthe- 
sized to  protect  the  body's  own  cells  from  dam- 
age by  complement  factors.  Our  laboratory  dis- 
covered a  new  family  of  genes  that  encode  for  at 
least  six  complement  receptor  and  regulatory 
proteins.  Two  of  these  regulatory  proteins  occur 
on  almost  all  cells  of  the  body.  Recently  these 
regulatory  proteins  have  been  demonstrated  to  be 
expressed  in  relatively  high  concentrations  on  re- 
productive tissue,  including  placental  tissue  and 
sperm.  A  new  direction  for  the  laboratory  con- 
cerns the  role  of  these  proteins  in  reproduction. 
Modulating  the  function  of  these  regulatory  pro- 


teins may  also  be  important  in  improving  means 
to  kill  tumor  cells  or  in  allowing  xenografts  to 
survive  in  humans. 

Because  infectious,  autoimmune,  and  immune 
complex-mediated  illnesses  result  from  aberra- 
tions of  the  complement  system,  our  research  ef- 
forts are  helping  to  define  the  pathophysiologic 
basis  of  such  diseases.  In  many  autoimmune  and 
immune  complex-mediated  diseases,  there  is  an 
inherited  defect  in  the  handling  of  immune  com- 
plexes or  in  the  activation  or  regulation  of  the 
complement  system.  Variations  in  the  structure, 
function,  and  expression  of  complement  proteins 
are  important  aspects  of  autoimmune  diseases. 
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ALL  cells  communicate  with  their  environ- 
ment by  the  interaction  of  exogenous  mole- 
cules with  receptor  molecules  on  the  cell  sur- 
face. This  activates  one  of  many  possible 
intracellular  signaling  pathways  by  which  infor- 
mation from  the  environment  is  decoded  to  elicit 
an  appropriate  response  by  individual  cells  or  by 
the  organism.  Olfaction  provides  a  particularly 
clear  example  of  the  organism's  ability  to  recog- 
nize and  discriminate  a  vast  array  of  environmen- 
tal signals,  in  this  case  the  molecules  perceived  as 
odors. 

The  cellular  immune  system  provides  an  exam- 
ple of  the  organism's  ability  to  respond  to  diverse 
foreign  substances  within  the  internal  environ- 
ment. These  two  systems  pose  the  problem  of 
how  cells  encode  sufficient  information  to  recog- 
nize and  respond  to  a  highly  diverse  array  of  sig- 
naling molecules.  Our  laboratory  has  addressed 
these  problems  by  analyzing  the  structure  and 
function  of  the  genes  encoding  the  specific  re- 
ceptor molecules  in  the  immune  system,  the  ol- 
factory system,  and  the  brain. 

The  Molecular  Biology  of  Smell 

Vertebrates  have  evolved  an  extremely  sensi- 
tive mechanism  to  detect  and  discriminate  a  rich 
diversity  of  odors.  The  detection  of  chemically 
distinct  odors  results  from  the  specific  associa- 
tion of  odorous  signaling  molecules  with  discrete 
receptors  on  olfactory  neurons.  These  neurons, 
which  reside  within  the  epithelium  of  the  nose, 
then  transmit  the  information  directly  to  the 
brain.  Thus  the  detection  of  odors  is  restricted  to 
the  olfactory  neuron,  whereas  the  discrimination 
of  distinct  odors  requires  neural  processing  in  the 
olfactory  bulb  and  cerebral  cortex. 

To  address  the  question  of  how  the  organism  is 
capable  of  recognizing  more  than  10,000  dis- 
crete odors,  we  have  recently  cloned  a  large  fam- 
ily of  genes  likely  to  encode  an  extremely  large 
number  of  olfactory  receptors.  We  have  isolated  a 
multigene  family  encoding  a  vast  array  of  recep- 
tor molecules  that  traverse  the  membrane  seven 


times.  Expression  of  this  gene  family  is  restricted 
to  the  olfactory  epithelium. 

The  family  comprises  more  than  200  members, 
representing  one  of  the  largest  gene  families  in 
the  eukaryotic  chromosome.  All  the  genes  are 
characterized  by  sequence  and  structural  motifs 
shared  by  all  members  of  the  gene  family,  but  the 
individual  genes  each  exhibit  regions  of  signifi- 
cant sequence  divergence.  Analysis  of  the  se- 
quences of  the  members  of  the  family  of  putative 
olfactory  receptors  provides  insight  into  how  the 
family  has  been  generated  in  evolutionary  time 
and  how  these  genes  diverge  to  accommodate  rec- 
ognition of  a  large  array  of  odorants. 

Isolation  of  the  family  of  genes  encoding  the 
receptor  molecules  immediately  provides  one 
solution  to  the  problem  of  olfactory  perception: 
How  do  we  recognize  so  many  odors?  On  one 
extreme,  we  could  argue  that  the  recognition  of 
diverse  odorants  could  be  accomplished  by  a 
small  number  of  promiscuous  receptors  each  ca- 
pable of  interaction  with  several  structurally  dis- 
tinct odor  molecules.  Alternatively,  olfactory 
perception  could  result  from  a  large  number  of 
different  receptor  molecules  each  capable  of  in- 
teracting with  one  or  a  small  number  of  specific 
odorants.  The  size  of  the  gene  family  we  have 
characterized  suggests  that  there  are  indeed  a 
very  large  number  of  odorant  receptors,  each  ca- 
pable of  interacting  with  a  small  number  of  odor- 
ous ligands. 

These  observations  are  in  sharp  contrast  to 
other  sensory  systems,  such  as  vision  or  touch, 
where  discrimination  of  sensory  information  is 
accomplished  by  a  rather  small  number  of  recep- 
tor modalities. 

In  a  separate  series  of  experiments,  we  have 
asked  how  the  association  of  an  odorant  with  its 
receptor  can  generate  an  action  potential  trans- 
mitted to  the  brain.  Recent  evidence  from  other 
laboratories  suggests  that  olfactory  receptors  ac- 
tivate second  messenger  systems,  leading  to  an 
elevation  in  intracellular  cyclic  nucleotides.  We 
and  others  have  cloned  a  gene  encoding  a  cyclic, 
nucleotide-gated  ion  channel  that  is  unique  to 
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olfactory  sensory  neurons.  This  channel  traverses 
the  membranes  six  times,  but  also  contains  a  do- 
main within  the  cytoplasm  of  the  cell  that  binds 
cyclic  nucleotides.  The  interaction  of  odors  with 
their  specific  receptors  results  in  an  elevation  in 
cAMP,  which  in  turn  leads  to  the  opening  of  this 
channel. 

The  demonstration  that  the  olfactory  epithe- 
lium expresses  a  unique  ion  channel  controlled 
by  intracellular  levels  of  cyclic  nucleotides  now 
provides  a  mechanism  linking  odorant-elicited  in- 
creases in  cyclic  nucleotides  with  the  generation 
of  an  electrical  signal  transmitted  to  the  brain. 
The  cloning  of  a  highly  diverse  multigene  family 
of  odor  receptors,  along  with  an  ion  channel  re- 
sponsive to  intracellular  changes  elicited  by  the 
receptors,  should  provide  further  insight  into  the 
recognition  mechanisms  and  neural  processing 
events  that  permit  the  discrimination  of  a  diverse 
array  of  odors. 

Diversity  of  Recognition  in  the  Cellular 
Immune  System 

The  cellular  immune  system  must  recognize 
and  respond  to  an  even  greater  diversity  of  signal- 
ing molecules  in  the  form  of  foreign  antigens. 
This  diversity  is  encoded  by  an  extremely  large 
family  of  T  cell  receptor  genes.  Although  thymo- 
cytes express  a  T  cell  antigen  receptor,  T  cells 
segregate  into  discrete  functional  classes.  One 
set,  T  helper  cells,  initiate  an  immune  response 
to  foreign  antigen  by  activating  other  T  cells.  The 
second  subset,  T  killer  cells,  respond  to  foreign 
antigen  by  the  elaboration  of  cytotoxic  functions 


resulting  in  the  death  of  cells  expressing  an 
antigen. 

Most  T  helper  cells  express  an  additional  sur- 
face membrane  receptor,  the  CD4  molecule; 
most  T  killer  cells  express  a  different  but  related 
receptor,  the  CDS  molecule.  We  have  cloned  the 
genes  encoding  these  two  receptors  and  charac- 
terized their  functions.  Our  studies  and  others 
indicate  that  the  coordinate  engagement  of  a  T 
cell  antigen  receptor  (TCR)  and  a  CD4  receptor 
with  molecules  on  the  antigen-bearing  target  cell 
is  required  for  efficient  T  cell  help.  The  coordi- 
nate engagement  of  a  TCR  and  CDS  appears  to  be 
required  for  efficient  T  cell  killing. 

We  have  further  demonstrated  that  the  T  cell 
surface  receptors,  CD4  and  CDS,  are  not  only  re- 
quired for  the  mediation  of  an  efficient  cellular 
immune  response,  but  are  essential  for  shaping 
the  repertoire  of  T  cells  during  development.  We 
have  introduced  an  exogenous  CDS  gene  into 
mouse  embryos  to  generate  transgenic  mice  that 
express  a  CDS  gene  on  all  T  cells.  Analysis  of  the 
T  lymphocytes  within  the  thymus  and  periphery 
of  these  mice  leads  to  a  model  of  T  cell  develop- 
ment in  which  the  interaction  of  either  CD4  or 
CDS  with  appropriate  molecules  on  the  surface 
of  cells  within  the  thymic  epithelium  is  a  primary 
determinant  of  whether  a  cell  will  be  either  a 
helper  or  a  killer  cell. 

Thus  our  studies  on  the  structure  and  function 
of  receptors  from  the  immune  system,  olfactory 
system,  and  brain  provide  examples  of  how  the 
eukaryotic  chromosome  can  generate  the  diver- 
sity required  to  recognize  and  respond  to  a  vast 
array  of  signaling  molecules  in  the  environment. 
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VERY  little  is  known  about  the  genetic  control 
of  mammalian  development.  But  embryo- 
genesis  of  all  mammals  follows  a  similar  plan,  and 
the  basic  rules  discovered  in  one  species  are 
likely  to  apply  to  others.  By  studying  the  mouse,  a 
species  in  which  the  early  embryo  can  be  ob- 
served and  manipulated,  we  will  better  under- 
stand how  genes  control  human  development 
and  how  disruption  of  these  processes  may  lead 
to  abnormalities  such  as  miscarriages  and  birth 
defects. 

In  organisms  traditionally  subject  to  experi- 
mental genetic  analysis,  like  fruit  flies  and  nema- 
todes, mutations  in  a  particular  developmental 
pathway  can  be  selected  in  a  comprehensive 
screening  experiment.  In  mice,  however,  this  ap- 
proach has  been  hampered  by  the  inability  to 
study  and  recover  conditional  mutations  and  by 
the  inefficiency  of  generating  new  mutations 
through  the  insertion  of  mobile  genetic  ele- 
ments. As  a  result,  much  of  our  insight  into  mam- 
malian developmental  genetics  comes  from  the 
study  of  preexisting  mutations.  My  laboratory  is 
studying  a  group  of  previously  identified  genes 
that  affect  development  around  the  time  of  im- 
plantation. In  addition,  we  are  developing  a  sys- 
tem to  allow  the  conditional  disruption  of  genes 
with  recessive  phenotypes  in  cell  culture  and 
transgenic  mice. 


Genomic  and  Functional  Characterization 
of  the  Mouse  Segmentation  Gene 
kreisler  (fer) 

The  mouse  kr  gene,  originally  recognized  by 
its  effects  on  inner  ear  development  and  craniofa- 
cial morphogenesis,  interferes  with  the  normal 
formation  of  metameric  units  in  the  developing 
hindbrain,  so-called  rhombomeres.  The  ;fer  muta- 
tion is  x-ray  induced  and  therefore  likely  to  repre- 
sent a  structural  rearrangement  that  alters  a  rela- 
tively large  region  of  the  chromosome.  Located 
within  two  map  units  of  ^rare  two  additional  loci 
that  affect  fundamental  aspects  of  peri-implanta- 
tion development:  brachypod  {bp),  which  pro- 
duces limb  reduction  abnormalities  similar  to 


those  seen  in  the  inherited  human  disease  Ose- 
bold-Remondini  syndrome,  and  agouti  (A), 
which  represents  at  least  three  mutations  that, 
when  homozygous,  are  lethal  at  or  around  the 
time  of  implantation. 

Toward  the  eventual  goal  of  isolation  of  these 
genes,  a  physical  map  of  mouse  chromosome  2 
surrounding  kr,  A,  and  bp  has  been  constructed 
with  a  cell  fusion  technique  called  radiation  hy- 
brid mapping.  This  technique,  developed  by  Da- 
vid Cox,  is  based  on  the  likelihood  that  pieces  of 
mouse  DNA  closely  linked  to  each  other  will  stay 
together,  or  "cosegregate,"  after  x-irradiation 
and  fusion  to  hamster  cells.  Our  results  have  led 
to  the  isolation  of  a  mouse/hamster  hybrid  cell 
line  that  contains  the  kr,  A,  and  bp  loci  along 
with  very  little  "extra"  mouse  DNA.  One  of  the 
mouse  DNA  fragments  present  in  this  hybrid  cell 
line,  closely  linked  to  the  mouse  Src  gene,  de- 
tects a  chromosomal  alteration  in  kr/kr  mice. 
The  alteration  involves  a  chromosomal  break  lo- 
cated approximately  100,000  base  pairs  away 
from  the  beginning  of  the  Src  gene,  but  does  not 
interrupt  the  Src  coding  sequences.  Small  DNA 
fragments  recovered  and  mapped  by  this  tech- 
nique are  now  being  used  to  reach  the  fer  gene  by 
the  serial  isolation  of  overlapping  cosmid  and 
yeast  artificial  chromosome  clones. 

To  characterize  further  the  relationship  be- 
tween rhombomere  formation  and  kr,  we  have 
studied  the  expression  of  a  group  of  genes,  the 
Hox-2  cluster,  thought  to  play  a  role  in  determin- 
ing the  identity  of  individual  rhombomeres.  Dif- 
ferent members  of  the  cluster  normally  exhibit 
anterior  borders  of  expression  in  adjacent  rhom- 
bomeres. In  collaboration  with  Michael  Frohman 
and  Gail  Martin,  we  have  shown  that,  in  kr/kr 
embryos,  several  Hox-2  family  members  are  ex- 
pressed in  the  "wrong"  rhombomere (s),  which 
suggests  that  the  kr  gene  product  may  affect 
neural  tube  segmentation  by  controlling  the  ex- 
pression of  Hox-2  genes.  By  isolating  the  /fergene 
and  further  characterizing  the  kr/kr  phenotype, 
we  hope  to  learn  more  about  the  molecular  path- 
ways of  mammalian  segmentation  and  the  role  of 
Hox-2  genes  in  rhombomere  morphogenesis. 


25 


Developmental  Genetics  in  the  Mouse  and  Human 


Conditional  Inactivation  of  Recessive  Genes 
in  Embryonic  Stem  Cells 

An  important  development  in  mammalian  ex- 
perimental embryology  has  been  the  ability  to 
isolate  embryonic  stem  cells  from  preimplanta- 
tion  mouse  embryos,  which  can  be  modified  in 
cell  culture  and  then  used  to  reconstitute  an  in- 
tact animal.  Like  the  whole  organism,  these  cells 
are  diploid  and  contain  two  copies  of  every  auto- 
somal gene.  When  embryonic  stem  cells  are 
mixed  with  a  fragment  of  "reporter"  DNA,  such 
as  the  coding  regions  of  the  neomycin  resistance 
gene  or  the  ^-galactosidase  gene,  insertion  of  the 
exogenous  DNA  provides  a  "gene  trap,"  in  which 
expression  of  the  reporter  sequences  is  con- 
trolled by  regulatory  elements  of  an  endogenous 
gene.  Insertion  of  the  reporter  DNA  is  likely  to 
disrupt  expression  of  the  endogenous  gene,  but 
in  most  cases,  expression  from  the  uninterrupted 
homologue  will  be  sufficient  to  prevent  pheno- 
typic  effects. 

To  block  this  expression  in  a  single  step,  and  in 
a  conditional  fashion,  we  have  constructed  a  se- 
ries of  gene  trap  vectors  that  contain  an  inducible 
promoter  on  the  strand  opposite  the  reporter  se- 


quences. "Captured"  cell  clones  are  first  se- 
lected by  antibiotic  resistance  or  |8-galactosidase 
staining.  After  removal  of  selective  pressure  and 
activation  of  the  inducible  promoter,  an  anti- 
sense  transcript  is  generated  against  coding  se- 
quences of  the  endogenous  gene.  In  certain 
cases,  this  antisense  transcript  will  function  in 
trans,  thereby  inhibiting  expression  of  the  endog- 
enous gene  from  the  uninterrupted  homologue. 

Control  experiments  suggest  that  these  DNA 
vectors  are  capable  of  trapping  endogenous 
genes  and  that  basal  expression  of  the  inducible 
promoter  does  not  reduce  appreciably  the  fre- 
quency of  trapping.  We  are  currently  isolating  a 
panel  of  captured  cell  clones.  The  efficiency  of 
these  antisense  promoters  in  cis  will  be  analyzed 
by  measuring  expression  of  the  reporter  se- 
quences before  and  after  antisense  induction.  Po- 
tential effects  of  trans  inhibition  will  then  be 
tested  by  examining  the  phenotypes  of  chimeric 
mice  that  contain  the  mutant  embryonic  stem 
cells.  This  approach  will  allow  the  phenotypic 
effects  of  a  recessive  mutation  to  be  studied  in  a 
diploid  organism  by  altering  only  one  of  the  two 
copies,  and  may  be  applicable  to  many  organisms 
and  developmental  systems. 
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THE  division  of  one  cell  into  two,  generally 
known  as  the  cell  division  cycle,  is  among  the 
most  basic  of  biological  processes.  The  underly- 
ing molecular  pathways  that  regulate  and  elfect 
the  cell  cycle  are  gradually  being  elucidated  by  a 
combination  of  genetic,  biochemical,  and  cell  bi- 
ological studies. 

The  two  key  events  in  the  cell  cycle  are  the 
replication  of  nuclear  DNA,  which  occurs  during 
the  S  phase,  and  cell  division  or  mitosis,  often 
referred  to  as  M  phase.  Genetic  studies,  using  the 
fission  yeast,  a  single-celled  organism,  have  re- 
vealed that  both  the  S  and  M  phases  of  the  cell 
cycle  are  regulated  by  a  single  enzyme.  This  is  a 
protein  kinase  (which  transfers  phosphate  from 
ATP  to  a  target  substrate,  altering  the  substrate's 
biological  properties)  of  which  the  catalytic  sub- 
unit  is  called  cdc2.  This  enzyme  is  not  only  pres- 
ent in  yeast  cells,  but  also  in  those  of  higher  ver- 
tebrates, including  humans. 

The  cdc2  acts  as  a  multifunctional  enzyme  by 
virtue  of  its  association  with  a  class  of  proteins 
called  cyclins.  Each  cyclin,  of  which  many  types 
have  been  identified,  binds  to  cdc2  and  confers 


on  the  protein  kinase  a  particular  substrate  speci- 
ficity and  also  targets  it  to  particular  cellular  lo- 
cations. Different  cyclins  are  synthesized  and  de- 
graded at  particular  stages  of  the  cell  cycle, 
thereby  creating  different  patterns  of  cdc2  activ- 
ity during  the  S  and  M  phases  of  the  division 
cycle. 

The  critical  cdc2  kinase  is  not  regulated  exclu- 
sively by  passive  association  with  cyclins,  but 
also  by  its  own  state  of  phosphorylation.  In  partic- 
ular, cdc2  is  one  of  the  most  heavily  tyrosine- 
phosphorylated  proteins  in  actively  proliferating 
cells.  Tyrosine  phosphorylation  serves  to  inacti- 
vate the  M-phase  form  of  cdc2  (cdc2/cyclin  B), 
and  dephosphorylation  of  the  enzyme  triggers 
mitosis. 

Our  current  research  is  focused  on  the  phenom- 
ena discussed  above.  In  particular,  a  problem  of 
present  interest  is  the  mechanism  that  coordi- 
nates the  different  phases  of  the  cell  cycle.  For 
example,  why  does  cell  division  not  occur  until 
DNA  replication  is  completed?  The  genes  that 
control  such  cellular  decisions  are  being 
investigated. 
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Fission  yeast  cells  have  been  stained  with  a  dye  (DAPl )  that  reveals  DNA  and  hence  shows  the  cell 
nucleus.  These  particular  mutant  cells  lack  two  genes  (mikl  and  wee/ )  that  encode  the  inhibi- 
tory cdc2  tyrosine  kinase.  They  are  therefore  entering  mitosis  prematurely,  before  the  completion 
of  DNA  replication,  and  are  dividing  themselves  to  death. 
Research  of  David  H.  Beach. 
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THE  most  prevalent  developmental  strategy  in 
the  animal  kingdom  is  segmentation — that 
is,  division  of  the  embryo  into  a  series  of  similar 
segments  that  later  differentiate  and  become  spe- 
cialized for  particular  functions.  The  underlying 
molecular  events  are  best  understood  in  the  fruit 
fly  Drosophila,  w^here  segmentation  and  segmen- 
tal differentiation  are  governed  by  genes  ex- 
pressed in  a  temporally  and  spatially  ordered  hier- 
archy. Many  of  the  50-some  genes  within  this 
hierarchy  have  been  characterized  at  the  molecu- 
lar level;  most  encode  proteins  implicated  in  the 
control  of  gene  expression.  Much  progress  has 
been  made  in  learning  how  these  regulatory 
genes  and  their  products  interact  to  specify  each 
other's  expression  and  thereby  generate  a  de- 
tailed system  of  spatial  information. 

Little  is  known,  however,  about  how  this  spa- 
tial information  (in  the  form  of  localized  regula- 
tory proteins)  is  used  to  assemble  precisely  the 
complex  pattern  of  structures  that  arise  in  the 
course  of  embryogenesis.  My  laboratory's  long- 
term  goal  is  to  extend  knowledge  of  this  genetic 
hierarchy  by  identifying  and  characterizing  target 
genes  whose  products  are  more  directly  involved 
in  the  construction  of  the  embryo  than  are  pro- 
teins within  the  regulatory  hierarchy.  We  are  also 
studying  the  mechanisms  by  which  such  target 
genes  are  differentially  regulated. 

The  first  tiers  of  the  regulatory  hierarchy  con- 
tain maternally  expressed  genes,  whose  products 
establish  broad  polarities  within  the  embryo.  The 
segmentation  genes  of  the  middle  tiers  subdivide 
the  embryo  along  the  anterior/posterior  axis  into 
a  linear  array  of  homologous  segments.  The  eight 
homeotic  genes  of  the  final  tier  are  regulated  by 
the  segmentation  genes  and  function  to  produce 
the  specialized  structures  that  distinguish  the 
segments. 

The  basic  morphogenetic  pathways  for  the  re- 
peated segmental  unit  appear  to  be  controlled,  at 
least  in  part,  by  regulatory  input  from  segmenta- 
tion genes.  Homeotic  genes,  however,  are  the  lab- 
oratory's current  focus  because  of  their  unique 
roles  in  channeling  the  processes  of  morphogen- 
esis into  pathways  that  produce  the  diversity  of 


structures  distinguishing  the  segments.  The  pro- 
teins encoded  by  homeotic  genes  each  contain 
the  homeodomain,  a  61 -amino  acid  segment  as- 
sociated with  DNA-binding  activity  that  is  also 
present  in  some  of  the  segmentation  and  polarity 
genes.  Homeodomain  proteins  have  sequence- 
specific  DNA-binding  properties  and  are  gener- 
ally involved  in  control  of  gene  expression  at  the 
level  of  transcription. 

The  homeodomain,  first  identified  in  Drosoph- 
ila, has  been  found  in  all  other  multicellular  ani- 
mals examined,  including  vertebrates.  Perhaps 
the  most  striking  aspect  of  this  evolutionary  con- 
servation is  that  in  vertebrates,  homeodomain 
genes  closely  related  to  the  Drosophila  homeotic 
genes  (as  judged  from  conservation  of  homeodo- 
main sequences)  are  clustered  in  the  same  linear 
order  found  in  the  Drosophila  homeotic  gene 
complexes.  In  addition,  expression  along  the 
body  axis  is  colinear,  with  locations  of  particular 
genes  within  the  vertebrate  clusters  in  a  manner 
strikingly  similar  to  that  in  Drosophila.  These  ob- 
servations suggest  that  some  of  the  mechanisms 
of  positional  specification,  and  perhaps  even 
some  aspects  of  segmentation  and  segmental  dif- 
ferentiation, are  conserved  between  insects  and 
vertebrates. 

Diflferential  Recognition  of  DNA  Sequence 
by  Homeotic  Proteins 

A  major  determinant  of  a  regulatory  protein's 
effect  upon  a  target  gene  is  the  affinity  of  that 
protein's  interactions  with  nearby  DNA  se- 
quences. Therefore  a  key  component  of  any  tran- 
scriptional regulator's  biological  properties  is 
the  regulator's  sequence  preferences  in  binding 
to  DNA.  The  proteins  encoded  by  Drosophila  ho- 
meotic genes  pose  something  of  a  puzzle  in  this 
regard  because,  despite  great  sequence  similari- 
ties between  their  homeodomains,  they  all  have 
distinct  biological  functions  (i.e.,  they  each  im- 
plement distinct  segmental  differentiation  path- 
ways). To  enable  systematic  evaluation  of  DNA 
sequence  preferences,  we  have  developed  a 
method  by  which  the  optimal  DNA-binding  se- 
quence for  a  particular  protein  can  be  selected 
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from  a  large  population  of  molecules  that  in- 
cludes all  possible  sequence  combinations. 

This  method  has  been  used  to  determine  and 
compare  the  optimal  binding  sites  for  two  closely 
related  homeodomain  peptides  encoded  by  the 
Drosophila  homeotic  genes  Ultrabithorax 
(Obx)  and  Deformed  {Dfd).  The  optimal  sites 
for  these  proteins  hold  in  common  a  core  se- 
quence, TAAT,  that  is  very  important  for  binding. 
Additional  bases  to  either  side  of  the  TAAT  core 
are  also  recognized,  but  these  base  preferences 
dilfer  between  the  two  proteins.  Comparative 
studies  indicate  5-  to  10-fold  relative  preferences 
of  Dfd  and  Ubx  homeodomains  for  their  own  op- 
timal sites.  Additional  studies  are  under  way  to 
determine  which  of  the  amino  acid  sequence  dif- 
ferences between  Ubx  and  Dfd  are  responsible 
for  the  differences  in  DNA  recognition.  Addi- 
tional insight  into  the  nature  of  the  critical  molec- 
ular interactions  should  come  from  a  crystallo- 
graphic  study  in  collaboration  with  Carl  Pabo 
(HHMI)  and  Neil  Clarke,  at  the  Johns  Hopkins 
University,  of  a  Ubx  homeodomain  complex  with 
its  optimal  binding  site. 

In  vivo  studies  with  the  full-length  proteins 
indicate  that  their  DNA  sequence  preferences  are 
similar  to  those  of  the  homeodomain  peptides. 
The  full-length  C/fojtr  protein,  however,  unlike  the 
f/fcjc  homeodomain  peptide,  binds  cooperatively 
to  multiple  sites.  This  cooperativity  can  increase 
complex  stability  from  less  than  a  minute  for  a 
single  isolated  site  to  several  hours  for  four  or 
more  sites. 

The  most  striking  property  of  these  coopera- 
tive interactions  is  that  they  can  occur  between 
sites  separated  by  a  variety  of  distances,  ranging 


from  a  single  helical  turn  of  the  DNA  to  more  than 
20  turns.  The  long-range  interactions  involve  a 
mechanism  in  which  the  intervening  DNA  is 
curved  to  permit  contact  between  proteins 
bound  at  the  base  of  a  resulting  DNA  loop.  The 
affinity  for  binding  of  full-length  Ubx  protein  is 
therefore  a  function  of  the  number  of  sites  pres- 
ent within  a  DNA  fragment,  and  Ubx  proteins  ap- 
pear to  act  as  integrators  of  the  number  and  affin- 
ity of  core  binding  site  sequences. 

With  regard  to  differential  gene  regulation, 
these  results  suggest  the  following:  first,  differ- 
ences in  DNA  sequence  recognition  account  for 
at  least  some  of  the  biological  specificity  of  ho- 
meotic genes;  second,  the  molecular  basis  for  dis- 
crimination between  homeotic  protein-binding 
sites  may  be  interactions  with  bases  flanking  a 
common  TAAT  core;  third,  large  differential  af- 
finities for  a  DNA  region  might  be  generated  by 
cooperative  binding  of  homeotic  proteins  to  col- 
lections of  core  sites  that  individually  show  only 
small  differences  in  affinity. 

Identification  of  Ubx  Regulatory  Targets 

We  have  used  a  method  similar  to  the  one  for 
determination  of  optimal  DNA  binding  sites  to 
select  and  clone  DNA  fragments  containing  Ubx 
binding  sites  from  total  Drosophila  genomic 
DNA.  We  expect  that  genes  regulated  by  Z/fejc  will 
be  located  near  such  binding  sites.  We  have  also 
identified  sequences  from  Drosophila  cultured 
cell  lines  that  are  differentially  expressed  in  the 
presence  and  absence  of  Ubx  protein.  The  se- 
quences isolated  by  these  procedures  are  being 
analyzed  to  determine  how  they  function  in  the 
embryo. 
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RESEARCH  in  our  laboratory  is  aimed  at  molec- 
ular analysis  of  human  genetic  diseases,  with 
the  dual  goals  of  learning  more  about  the  diseases 
themselves  and  shedding  light  on  normal  human 
biology.  Currently  we  are  studying  cystic  fibrosis, 
deficiencies  in  urea  cycle  enzymes,  spinocere- 
bellar ataxia,  and  defects  in  cell  adhesion  mole- 
cules. Most  of  this  research  is  conducted  in  close 
collaboration  with  William  O'Brien. 

Cystic  fibrosis  (CF)  is  a  common  genetic  dis- 
ease affecting  approximately  1  in  2,500  Cauca- 
sians. Most  CF  patients  die  of  progressive  lung 
disease  during  childhood  or  young  adult  life. 
About  1  in  25  Caucasians  carry  an  abnormal  CF 
gene,  and  about  1  in  625  couples  are  at  high  risk 
of  having  an  affected  child.  Since  1985  our  labo- 
ratory has  used  DNA  analysis  for  carrier  detection 
and  prenatal  diagnosis  for  CF.  This  work  was 
changed  dramatically  in  late  1989  when  other 
laboratories  cloned  the  gene  for  CF  and  identified 
the  most  common  defect  (mutation)  in  the  gene. 
We  analyzed  more  than  200  North  American  fami- 
lies for  this  mutation;  the  defect  was  present  on 
76  percent  of  the  CF  chromosomes.  This  defect 
was  present  on  only  30  percent  of  CF  chromo- 
somes in  Ashkenazic  Jewish  families,  however, 
indicating  that  other  mutations  predominate  in 
that  population.  Direct  analysis  of  mutations 
greatly  improves  the  genetic  diagnostic  tests  that 
can  be  offered  to  families. 

One  option  for  prevention  of  CF  would  be  to 
test  all  couples  prior  to  reproduction,  to  identify 
carriers  of  the  abnormal  gene.  Although  a  single 
common  defect  is  present  in  the  majority  of  CF 
chromosomes,  dozens  of  different  mutations  af- 
fect the  remaining  fraction  of  abnormal  chromo- 
somes. Thus  no  one  simple  DNA  test  can  be  used 
to  identify  all  gene  carriers.  Our  laboratory  and 
other  laboratories  participating  in  a  large  collabo- 
ration have  identified  many  of  these  additional 
mutations.  The  laboratory  has  developed  more 
efficient  methods  for  detection  of  multiple  muta- 
tions. We  currently  test  for  the  five  most  common 
mutations  causing  CF  in  North  Americans,  and 
this  identifies  85  percent  of  carriers.  The  ability 
to  detect  90-95  percent  of  carriers  would  be 


preferable  before  offering  carrier  testing  to  all  re- 
productive-age couples,  and  the  laboratory  is 
working  to  achieve  this  level  of  detection. 

To  develop  a  treatment  for  CF,  it  is  desirable  to 
have  an  animal  model  for  easier  analysis  and  for 
therapeutic  trials.  Methods  are  available  to  de- 
velop an  animal  model  in  mice,  and  we  have 
characterized  the  sequence  of  the  mouse  CF  gene 
as  one  step  in  this  process.  The  mouse  gene, 
which  is  similar  to  the  human  gene,  has  been 
used  to  prepare  DNA  clones  that  are  suitable  for 
disrupting  the  normal  mouse  gene  in  cultured 
cells.  These  altered  cells  can  be  used  to  generate 
mice  affected  with  CF.  Despite  substantial  effort, 
it  has  proven  difficult  to  disrupt  the  CF  gene  in 
the  mouse.  It  would  also  be  desirable  to  study  the 
equivalent  of  the  CF  gene  in  Drosophila  (fruit 
flies)  or  yeast,  if  such  a  gene  exists.  Preliminary 
work  has  identified  some  related  genes  in  these 
organisms,  but  it  is  not  clear  if  these  genes  are 
functionally  equivalent  to  the  human  CF  gene. 

For  many  years  our  laboratory  has  studied  en- 
zymes of  the  urea  cycle,  with  particular  focus  on 
argininosuccinate  synthetase  (AS).  Genetic  defi- 
ciency of  AS  causes  citrullinemia,  a  disease  char- 
acterized by  accumulation  of  ammonia  in  the 
body,  which  leads  to  mental  retardation,  coma, 
and  death.  Patients  with  citrullinemia  have  a  re- 
cessive disease,  with  two  defective  copies  of  the 
gene.  Mutations  causing  citrullinemia  include 
deletions  of  segments  of  the  gene,  numerous  dif- 
ferent single-amino  acid  substitutions,  and  de- 
fects that  cause  abnormal  splicing  of  the  messen- 
ger RNA.  Fourteen  different  mutations  have  been 
defined;  almost  all  patients  inherit  a  different 
mutation  from  each  parent.  The  laboratory  has 
also  used  citrullinemia  as  a  model  disease  for  de- 
veloping somatic  gene  therapy.  The  human  gene 
for  AS  was  introduced  into  retrovirus  vectors  in 
the  form  of  a  cDNA.  The  viruses  produced  are 
able  to  infect  mouse  and  human  cells,  and  they 
have  been  used  to  infect  mouse  bone  marrow 
cells,  which  are  then  transplanted  into  lethally 
irradiated  recipient  mice.  The  human  gene  for  AS 
is  expressed  for  many  months  in  a  proportion  of 
the  transplanted  animals,  and  this  long-term  ex- 
pression in  mice  is  encouraging. 
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In  collaboration  with  Huda  Zoghbi,  we  are  at- 
tempting to  clone  the  gene  for  dominantly  in- 
herited spinocerebellar  ataxia,  a  neurological 
disease  that  is  passed  from  generation  to  genera- 
tion. The  symptoms  usually  begin  in  young  adult 
life  and  result  in  physical  disability,  mental  dete- 
rioration, and  death  within  about  10  years.  Re- 
cent studies  have  localized  the  gene  to  a  small 
region  of  chromosome  6.  Large  DNA  clones 
known  as  yeast  artificial  chromosomes  have  been 
isolated  from  this  region,  and  the  tedious  task  of 
identifying  and  cloning  the  disease  gene  is 
continuing. 

A  major  new  project  is  the  study  of  cell  adhe- 
sion molecules,  which  are  involved  in  cell- 
surface  and  cell-cell  adherence,  particularly  as 
they  relate  to  interactions  between  white  blood 
cells  and  the  walls  of  blood  vessels.  The  genetic 
disease  leukocyte  adhesion  deficiency  involves 
the  absence  of  one  of  these  molecules,  desig- 


nated CD  18,  on  the  white  blood  cells  and  results 
in  fatal  susceptibility  to  infections.  Some  muta- 
tions causing  this  disease  have  been  determined. 
The  disease  is  an  excellent  model  for  attempts  at 
somatic  gene  therapy,  and  the  gene  has  been  in- 
troduced into  retroviral  vectors.  A  major  focus  of 
this  work  is  to  generate  mouse  models  with  muta- 
tions in  these  cell  adhesion  genes.  The  mouse 
genes  for  CD  18,  intercellular  adhesion  mole- 
cule-1  (ICAM-1),  and  granule  membrane  pro- 
tein-140  (GMP-140)  have  been  isolated  and  se- 
quenced. For  CD  18  and  lCAM-1,  the  mouse  gene 
has  been  disrupted  in  cultured  cells,  and  the  cul- 
tured cells  are  being  injected  into  mouse  em- 
bryos to  obtain  mutant  animals.  Mouse  mutants 
would  be  valuable  for  studying  the  normal  bio- 
logic function  of  the  various  cell  adhesion  mole- 
cules. These  molecules  are  likely  to  be  important 
in  processes  such  as  infection,  inflammation,  and 
atherosclerosis. 
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DIABETES  mellitus  is  a  disorder  of  carbohy- 
drate metabolism  characterized  by  elevated 
blood  glucose  levels.  It  affects  individuals  of  all 
ethnic  groups.  In  the  United  States,  an  estimated 
6  million  persons  are  known  to  have  diabetes, 
and  there  is  probably  an  equal  number  with  un- 
recognized disease.  About  10  percent  of  persons 
over  age  65  have  diabetes,  and  the  complications 
of  the  cardiovascular,  kidney,  visual,  and  nervous 
systems  are  major  causes  of  morbidity  and 
mortality. 

Clinically  diabetes  is  a  heterogeneous  dis- 
order. One  form,  insulin-dependent  diabetes 
mellitus,  primarily  affects  children  and  adoles- 
cents. It  results  from  immunological  destruction 
of  the  insulin-producing  cells  of  the  pancreas; 
because  of  the  absolute  deficiency  of  insulin,  pa- 
tients require  insulin  therapy  for  survival.  The 
more  common  form  of  diabetes,  non-insulin- 
dependent  diabetes  mellitus  (NIDDM),  includes 
about  90  percent  of  diabetic  patients.  This  form, 
which  results  from  reduced  insulin  levels  in 
some  patients  and  a  relative  deficiency  in  others, 
is  due  to  abnormal  functioning  of  the  insulin- 
producing  cells  or  decreased  responsiveness  of 
tissues  to  insulin.  In  these  patients  the  blood  glu- 
cose levels  can  usually  be  controlled  by  diet  or  by 
drugs  that  can  be  taken  orally  to  improve  insulin 
secretion  and  action. 

As  with  other  common  diseases  such  as  athero- 
sclerosis and  hypertension,  genetic  factors  con- 
tribute to  the  development  of  NIDDM.  Our  long- 
term  goal  is  to  identify  the  genes  for  diabetes 
susceptibility  and  to  determine  how,  together 
with  environmental  and  life-style  factors,  they  re- 
sult in  the  elevated  blood  glucose  levels  that  de- 
fine this  disorder. 

We  have  taken  an  approach  that  applies  genet- 
ics and  molecular  biology.  Our  working  hypothe- 
sis is  that  a  relatively  small  number  of  potentially 
identifiable  major  genes  increase  the  risk  of  devel- 
oping diabetes  and  that  the  individual's  overall 
genetic  background,  together  with  environmen- 
tal and  life-style  factors,  influences  the  expres- 
sion of  the  major  susceptibility  genes. 


In  our  genetic  studies  we  are  examining  the 
segregation  of  DNA  polymorphisms  in  diabetes- 
prone  families.  We  have  recently  identified  a 
DNA  marker  on  the  long  arm  of  human  chromo- 
some 20  that  cosegregates  with  NIDDM  in  a 
larger  multigenerational  family  with  a  form  of 
NIDDM  termed  maturity-onset  diabetes  of  the 
young  (MODY).  The  diabetes  in  this  family  is 
characterized  by  low  serum  insulin  levels,  an  age 
of  onset  of  less  than  25  years,  and  an  autosomal 
dominant  mode  of  inheritance.  The  isolation  of 
this  MODY-susceptibility  gene  and  identification 
of  the  mutation  that  impairs  its  function  may  pro- 
vide important  clues  to  causes  of  other  forms  of 
NIDDM.  In  addition,  it  will  also  be  possible  to 
assess  the  contribution  of  this  locus  to  the  overall 
prevalence  of  diabetes  mellitus.  Since  the 
MODY-susceptibility  gene  on  chromosome  20  is 
unlikely  to  account  for  genetic  susceptibility  to 
NIDDM  in  all  individuals,  we  are  continuing  to 
search  for  other  markers  that  cosegregate  with 
NIDDM,  using  large  families  in  which  the  dia- 
betes is  not  linked  to  the  marker  on  chromosome 
20.  These  genetic  studies  should  lead  to  the  iden- 
tification of  additional  diabetes-susceptibility 
genes. 

Drawing  on  our  understanding  of  the  patho- 
physiology of  NIDDM,  we  are  also  cloning  and 
characterizing  genes  that  might  reasonably  con- 
tribute to  diabetes  susceptibility.  These  include 
genes  encoding  proteins  that  are  expressed  in  the 
insulin-producing  cell,  including  insulin,  islet 
amyloid  polypeptide  (a  newly  discovered  hor- 
mone-like peptide  of  uncertain  function),  and 
the  and  Ca^"*"  channels  that  are  involved  in  in- 
sulin secretion.  We  are  also  studying  the  genes  for 
proteins  responsible  for  uptake  and  metabolism 
of  glucose,  since  the  insulin-producing  cells  of 
NIDDM  patients  have  lost  the  ability  to  secrete 
insulin  in  response  to  increasing  circulating  glu- 
cose levels.  Other  candidate  genes  encode  pro- 
teins expressed  in  the  insulin-responsive  target 
cell,  including  the  insulin  receptor  and  the  mem- 
brane proteins  involved  in  glucose  uptake.  In  ad- 
dition, we  have  considered  genes  encoding 
serum  proteins  that  are  instrumental  in  the  trans- 
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port  of  triglycerides  and  cholesterol  and  thus  im- 
plicated in  the  development  of  cardiovascular 
disease,  since  there  are  data  suggesting  that  dia- 
betes and  cardiovascular  disease  may  have  com- 
mon genetic  antecedents. 

Genetic  variation  associated  with  these  candi- 
date genes  is  being  examined  in  groups  of  dia- 
betic and  nondiabetic  subjects,  as  well  as  in  fami- 
lies, to  assess  the  contribution  of  these  genes  to 
the  development  of  NIDDM.  The  results  of  the 
studies  to  date  suggest  that  there  is  variation  in 
the  genes  for  the  insulin  receptor  and  for  apoli- 
poproteins  Al  and  B  that  influences  the  pheno- 
typic  expression  of  a  major  susceptibility  gene. 

Our  isolation  of  the  genes  for  membrane  pro- 
teins involved  in  the  transport  of  glucose  across 
the  plasma  membrane,  in  addition  to  providing 
markers  for  our  genetic  studies,  has  revealed  un- 
expected functional  complexity  that  could  have 
important  implications  for  the  treatment  of  dia- 
betes. These  studies  indicate  that  facilitative  glu- 
cose transport  is  not  the  property  of  a  single  pro- 
tein but  rather  involves  a  family  of  at  least  five 
structurally  related  proteins.  These  proteins  have 
distinct  patterns  of  tissue  distribution  and  differ- 
ent kinetic  properties  and  are  independently  reg- 
ulated. These  features  allow  the  precise  disposal 


of  dietary  glucose  under  varying  physiological 
conditions. 

Data  from  our  laboratory  and  others  indicate 
that  decreased  levels  of  two  of  these  glucose 
transport  proteins  may  contribute  to  the  develop- 
ment of  the  diabetic  state.  In  adipose  tissue  there 
are  reduced  amounts  of  the  glucose  transport 
protein  GLUT4,  which  is  responsible  for  insulin- 
stimulated  glucose  uptake  by  this  tissue.  The  re- 
duced amounts  of  GLUT4  provide  a  molecular 
explanation  for  the  decreased  ability  of  adipose 
tissue  of  diabetic  and  obese  individuals  to  take  up 
glucose.  Because  adipose  tissue  participates  in 
the  disposal  of  dietary  glucose  after  a  meal,  the 
decreased  expression  of  this  protein  may  contrib- 
ute to  high  circulating  glucose  levels.  The  levels 
of  GLUT2,  the  glucose  transport  protein  of  insu- 
lin-producing cells,  are  also  decreased  in  dia- 
betes. Since  GLUT2  may  comprise  part  of  the  glu- 
cose-sensing mechanism  that  regulates  insulin 
secretion,  decreased  levels  of  this  protein  may  be 
responsible  for  the  abnormal  functioning  of  the 
insulin-producing  cells  of  diabetic  individuals.  A 
better  understanding  of  the  regulation  of  expres- 
sion of  GLUT2  and  GLUT4  may  facilitate  the  de- 
velopment of  novel  therapeutic  strategies  for  the 
treatment  of  diabetes. 
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OUR  research  is  centered  on  the  development 
of  the  peripheral  nervous  system  (PNS)  of 
the  fruit  fly  Drosophila  melanogaster.  We  be- 
lieve that  the  results  will  help  us  understand  the 
development  of  the  nervous  system  in  many  eu- 
karyotic  species.  We  have  focused  on  the  PNS  of 
Drosophila  because  its  cells  are  relatively  easy  to 
study  and  because  sophisticated  genetics  can  be 
applied  to  this  model  organism. 

We  are  presently  studying  two  genes  that  are 
expressed  at  the  outset  of  embryonic  nervous  sys- 
tem development  and  later  in  embryogenesis  in 
most  cells  of  the  PNS.  The  genes  were  identified 
in  enhancer  detector  screens  in  which  regulatory 
sequences  are  determined  by  means  of  a  jS-galac- 
tosidase  reporter  gene.  The  newfound  genes 
are  essential,  since  mutations  cause  embryonic 
lethality.  Here  we  describe  their  preliminary 
characterization . 

One  of  the  two  genes  is  named  couch  potato 
because  some  homozygous  mutant  flies  are  via- 
ble but  hypoactive  and  have  poor  jump  re- 
sponses, poor  flight  abilities,  and  slow  recovery 
from  ether  anesthesia.  Some  of  these  attributes 
are  probably  due  to  thoracic  muscle  defects  that 
we  observe  in  the  adult  fly.  More  than  10  inser- 
tional  mutations  in  couch  potato  were  recovered 
from  several  enhancer  detector  screens,  and  a 
careful  genetic  analysis  revealed  that  none 
caused  a  complete  loss  of  function  of  the  gene. 
Some  insertional  alleles  cause  recessive  embry- 
onic lethality,  but  no  defects  were  observed  in 
the  PNS  of  the  mutant  embryos. 

In  order  to  define  the  null  phenotype,  we  per- 
formed an  excision  mutagenesis  of  one  of  the  in- 
sertions and  recovered  an  embryonic  lethal  muta- 
tion in  the  gene  that,  by  genetic  criteria,  can  be 
defined  as  a  complete  loss-of-function  mutation. 
Some  embryos  lacking  the  couch  potato  gene  ex- 
hibit defects  in  the  neuronal  pathways  of  PNS  and 
lack  some  peripheral  nerve  cells.  Hence  the 
couch  potato  gene  seems  to  play  a  role  in  the 
development  of  the  PNS  as  well  as  in  the  develop- 
ment of  the  adult  thoracic  muscles. 
A  molecular  analysis  of  couch  potato  was  initi- 


ated with  a  genomic  DNA  fragment  next  to  one  of 
the  enhancer  detector  insertions.  This  fragment 
allowed  us  to  isolate  three  different  couch  potato 
cDNAs.  These  were  labeled  for  a  nonradioactive 
whole-mount  in  situ  hybridization  technique, 
which  showed  that  the  gene  is  expressed  in  the 
PNS,  including  the  glial  cells,  and  in  some  glial 
cells  of  the  central  nervous  system.  In  all  cases 
the  expression  patterns  of  |8-galactosidase  in  the 
embryos  that  carry  an  enhancer  detector  and  the 
transcripts  of  couch  potato  are  essentially  the 
same. 

All  the  insertions  that  were  recovered  are  clus- 
tered in  a  450-bp  genomic  fragment.  In  addition, 
there  is  a  very  good  correlation  between  the  de- 
fects that  we  observe  in  embryos  or  adults  and  the 
position  of  these  insertions;  insertional  alleles 
that  cause  a  particular  phenotype  are  clustered 
within  50  bp.  These  data,  combined  with  the  ge- 
netic data,  indicate  that  the  genomic  sequences 
next  to  the  insertions  contain  some  of  the  gene's 
regulatory  sequences  that  confer  the  tissue- 
specific  expression  pattern.  That  this  is  indeed 
the  case  has  been  shown  by  bringing  these  se- 
quences upstream  of  a  i8-galactosidase  reporter 
gene  in  transformed  flies.  The  transformed  em- 
bryos express  the  reporter  in  a  subset  of  cells 
(PNS  of  head  and  glial  cells)  in  which  couch  po- 
tato is  expressed,  showing  that  these  sequences 
contain  at  least  some  couch  potato  regulatory 
sequences. 

The  expression  of  the  couch  potato  gene  in  the 
glial  cells  is  interesting,  as  we  observe  defects  in 
the  pathways  of  the  peripheral  neurons  in  a  sub- 
stantial fraction  of  the  embryos  lacking  the  gene. 
The  glial  cells  may  play  a  key  role  in  helping  the 
neuron  find  its  normal  target  organ,  tissue,  or 
cell.  It  is  therefore  possible  that  the  lack  of  ex- 
pression of  the  gene  in  the  glial  cells  causes  a 
neuronal  pathway  defect. 

To  test  the  hypothesis  that  the  glial  cells  play  a 
role  in  neuronal  pathway  formation,  we  would 
like  to  ablate  them.  Hence  we  have  devised  a  tech- 
nique that  should  allow  specific  ablation  of  these 
cells  early  in  development.  It  makes  use  of  a  tem- 
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perature-sensitive  diphtheria  toxin,  DT-Ats,  that 
we  have  developed  in  yeast  and  tested  in  Dro- 
sophila eye  cells. 

Sequence  analysis  of  the  different  couch  po- 
tato cDNAs  indicates  that  the  gene  may  have  pe- 
culiar features.  The  three  cDNAs  share  a  long 
open  reading  frame  that  conforms  with  the  Dro- 
sophila codon  usage,  but  we  have  been  unable  to 
define  an  initiation  codon.  Translation  of  the 
open  reading  frame  indicates  that  the  gene  may 
encode  a  transmembrane  protein  that  shows  no 
homology  to  any  of  those  published  in  the  data 
banks. 

In  summary,  we  believe  that  couch  potato  may 
play  an  important  role  in  the  development  of  the 
PNS,  as  its  gene  product  may  help  guide  the  axons 
of  the  PNS  neurons  to  their  appropriate  target. 
The  role  of  the  gene  in  adult  thoracic  muscle  de- 
velopment remains  elusive,  but  many  phenotypic 
similarities  have  been  observed  between  couch 
potato  and  stripe,  which  maps  close  to  the 
former  gene.  The  stripe  gene  also  affects  thoracic 
muscle  development  and  is  thought  to  be  re- 
quired in  the  nervous  system. 

The  second  gene  we  are  studying,  named  A3  7, 


is  expressed  early  in  the  development  of  the  PNS. 
It  maps  at  cytological  band  80A  and,  like  couch 
potato,  it  was  first  detected  in  an  enhancer 
screen.  So  far,  only  a  single  insertion  has  been 
identified,  to  our  knowledge.  Flies  homozygous 
for  this  particular  enhancer  detector  insertion  are 
homozygous  viable.  A  mutagenesis  experiment 
allowed  us  to  recover  10  embryonic  recessive 
lethal  mutations  that  fail  to  complement  one  an- 
other. The  phenorype  of  the  mutant  embryos  is 
being  characterized,  and  preliminary  data  sug- 
gest that  some  cells  of  the  embryonic  PNS  are 
lacking,  although  the  gene  is  expressed  in  almost 
all  PNS  cells,  including  the  support  cells.  The 
gene  is  apparently  encoded  in  a  4.6-kb  transcript, 
which  is  only  expressed  transiently  in  the  PNS  of 
developing  embryos  (5-10  h).  Later  in  embry- 
onic development  (14-16  h),  the  transcript  is 
also  observed  in  most  embryonic  muscles.  Prelim- 
inary sequencing  data  show  that  the  cDNAs  that 
we  have  isolated  so  far  contain  no  open  reading 
frames  of  significant  length.  We  are  presently  try- 
ing to  define  the  phenotype  of  the  mutants  more 
accurately  and  continuing  to  characterize  the 
gene  at  the  molecular  level. 
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THE  production  of  mature  blood  cells  involves 
an  ordered  series  of  differentiation  programs. 
At  the  root  of  this  production  system  are  the  plu- 
ripotent  hematopoietic  stem  cells — a  small  popu- 
lation of  cells  capable  of  extensive  proliferation 
and  differentiation,  but  also  of  self-renewal. 
These  cells,  which  normally  reside  in  the  bone 
marrow,  arise  in  early  fetal  development  and  per- 
sist throughout  adult  life.  If  removed  from  the 
bone  marrow  and  transplanted  into  a  prepared 
recipient,  they  will  stably  reconstitute  the  full 
blood  and  immune  cell  developmental  systems. 
We  are  working  out  techniques  for  efficient  gene 
transfer  into  mouse  and  human  stem  cells,  using 
retrovirus  vectors.  Ultimately  we  hope  to  exploit 
these  methods  for  human  gene  therapy  of  genetic 
and  acquired  diseases. 

A  viral  vector  system  based  on  the  Moloney  mu- 
rine leukemia  virus  (MLV)  has  been  chosen  be- 
cause of  its  theoretical  potential  for  high  gene 
transfer  efficiency  in  a  variety  of  mammalian  cell 
types  (including  human).  The  unique  life  cycle 
of  this  retrovirus  makes  it  attractive  for  adapta- 
tion as  a  vector,  since  the  foreign  genetic  material 
is  stably  integrated  into  the  host  cell  genome. 
MLV  vector  particles  are  able  to  carry  their  ge- 
netic material  to  the  target  cells,  but  are  unable  to 
replicate  and  spread  as  a  live  infectious  agent. 

We  have  been  studying  two  model  systems  that 
allow  investigation  of  several  fundamental  prop- 
erties of  the  stem  cells  as  targets  for  gene  transfer 
and  for  expression  of  genetic  material  by  the  vec- 
tor. One  model  uses  the  bacterial  antibiotic  resis- 
tance gene  neo  to  introduce  identifiable  genetic 
tags  into  individual  stem  cells.  The  other  model 
uses  the  human  enzyme  adenosine  deaminase 
(ADA)  as  the  molecular  marker  for  gene  transfer. 
This  system  is  particularly  suitable  for  studies  of 
expression  of  genes  by  retrovirus  vectors.  In  ad- 
dition, the  genetic  deficiency  of  ADA  causes  a 
form  of  severe  combined  immune  deficiency,  so 
that  successful  laboratory  experiments  with  this 
gene  may  allow  a  smooth  transition  to  clinical 
application  of  the  gene  transfer  procedures. 

Our  earlier  work  demonstrated  that  genes 
could  be  introduced  into  hematopoietic  stem 


cells  but  that  the  process  was  much  less  efficient 
than  in  the  more  mature  cells  of  the  marrow.  In 
mouse  transplant  experiments,  only  about  50 
percent  of  the  animals  retained  expression  of  the 
human  ADA  enzyme  in  their  blood  for  more  than 
six  months.  This  has  led  to  an  investigation  of  the 
conditions  in  cell  culture  that  would  optimally 
support  the  survival  or  proliferation  of  the  stem 
cells  during  exposure  to  the  vectors. 

In  collaboration  with  Doug  Williams  (Im- 
munex,  Seattle),  we  have  been  evaluating  the  ef- 
fects of  several  recombinant  hematopoietic 
growth  factors  on  retroviral  vector-mediated 
gene  transfer  into  stem  cells.  These  factors  have 
included  interleukin-3,  -6,  and  -7;  granulocyte 
colony-stimulating  factor  (G-CSF);  and  leukemia 
inhibitory  factor  (LIF).  LIF  has  been  of  special 
interest  because,  among  its  many  biological  func- 
tions, it  appears  to  prevent  the  differentiation  of 
mouse  embryonic  stem  cells.  If  it  had  a  similar 
action  on  hematopoietic  stem  cells,  it  might  al- 
low the  preservation  of  their  developmental  ca- 
pacity in  culture. 

We  observed  that  LIF  stimulated  a  10-fold  in- 
crease in  the  efficiency  of  gene  transfer  into  the 
primitive  hematopoietic  precursor  cells  called 
CFU-S  (colony-forming  unit,  spleen).  A  novel  as- 
say employing  inbred  transgenic  mice  was  then 
used  to  test  the  activity  of  LIF  on  stem  cells.  These 
experiments  indicate  that  LIF  preserves  the  stem 
cells  during  the  culture  period  required  for  gene 
transfer.  Inclusion  of  LIF  in  the  bone  marrow  cul- 
tures now  allows  about  70  percent  of  the  stem 
cells  to  be  infected  with  the  vector.  Subsequently 
all  the  mice  receiving  such  cells  maintain  high- 
level  expression  of  human  ADA  in  their  blood  and 
immune  system  organs  for  at  least  six  months. 
Suspecting  that  LIF  acts  in  concert  with  one  or 
more  other  growth  factors  in  our  experimental 
model,  we  are  examining  its  effects  alone  and  in 
combination  with  other  stimulators  on  purified 
hematopoietic  stem  cells. 

To  improve  analysis  of  the  behavior  of  stem 
cells  and  their  progeny  in  culture  and  after  trans- 
plant, we  have  developed  a  new  method  for  in- 
troducing unique  genetic  identifiers  into  individ- 
ual cells.  This  biochemical   method,  which 
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allows  identification  of  the  stem  cells  by  the  poly- 
merase chain  reaction  (PGR),  is  especially  suited 
for  analysis  of  very  small  numbers  of  cells.  A  fam- 
ily of  vectors  carrying  the  bacterial  gene  neo  have 
been  constructed  that  are  identical  except  for 
small  variations  in  size,  variations  conveniently 
distmguished  by  PGR.  When  mixed  together 
these  vectors  provide  an  array  of  potential 
markers  that  can  integrate  into  each  stem  cell. 
Because  each  cell  receives  one  to  four  vectors, 
the  integration  of  a  particular  subset  of  the  vec- 
tors provides  a  genetic  fingerprint  unique  to  each 
cell  family. 

Preliminary  results  with  these  vectors  indicate 
that  we  can  mark  the  most  primitive  hematopoi- 
etic precursors  efficiently.  We  plan  to  use  this 
method  to  compare  different  culture  conditions 
for  stem  cells.  We  hope  to  be  able  eventually  to 
grow  the  stem  cells  in  very  long-term  cultures. 

The  retroviral  vector  genetic  marking  system 


may  also  prove  to  be  highly  informative  in  clini- 
cal bone  marrow  transplantation.  We  have  under- 
taken collaborative  studies  with  Albert  Deisse- 
roth  (M.D.  Anderson  Hospital,  Houston)  to 
determine  whether  the  techniques  that  have  been 
used  to  optimize  gene  transfer  of  mouse  bone 
marrow  stem  cells  can  be  used  for  human  cells. 

These  experiments  demonstrate  that  consistent 
and  efficient  gene  transfer  into  primitive  human 
precursors  is  possible.  After  further  tissue  culture 
studies  are  performed,  we  plan  to  do  genetic  la- 
beling experiments  in  patients  undergoing  bone 
marrow  transplant  for  advanced  leukemia.  These 
investigations  will  provide  a  novel  way  to  investi- 
gate the  origins  of  leukemia  recurrence  and  the 
recuperation  of  the  blood  system  following  trans- 
plantation. We  also  hope  to  learn  many  valuable 
lessons  that  will  contribute  to  the  long-term  goal 
of  treating  disease  by  insertion  of  therapeutic 
vectors. 
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STRUCTURAL  proteins  in  the  cytoplasm  and 
membranes  of  ceils  provide  the  basis  for  spa- 
tial organization  of  the  diverse  components  of  eu- 
karyotic  cells.  These  proteins  thus  are  principal 
participants  in  fundamental  activities  of  cells, 
such  as  cell  motility,  organization  of  the  cyto- 
plasm, and  cell-cell  interactions.  Our  work  over 
the  past  10  years  has  focused  on  plasma  mem- 
branes. We  initiated  these  studies  in  the  human 
erythrocyte.  This  relatively  simple  cell  has  pro- 
vided an  experimentally  accessible  model  system 
for  detailed  dissection  of  protein-protein  interac- 
tions that  are  responsible  for  the  structure  and 
organization  of  the  plasma  membrane. 

The  principal  structural  protein  in  the  erythro- 
cyte membrane  is  the  flexible  rod-shaped  mole- 
cule spectrin,  which  is  organized  in  a  two-dimen- 
sional network  attached  to  the  cytoplasmic 
surface  of  the  plasma  membrane.  Spectrin  mole- 
cules are  attached  at  their  ends  to  form  a  series  of 
hexagons  and  pentagons  that  closely  resembles  a 
geodesic  dome.  The  binding  of  spectrin  to  the 
protein  ankyrin  attaches  the  spectrin  network  to 
the  plasma  membrane.  Ankyrin  also  interacts 
with  high  affinity  with  the  cytoplasmic  domain  of 
an  integral  membrane  protein  (a  protein  that  tra- 
verses the  membrane  and  actually  has  portions 
exposed  on  both  the  inner  and  outer  membrane 
surfaces) .  The  spectrin-based  membrane  network 
or  skeleton  is  required  for  normal  stability  of 
erythrocytes  in  the  circulation.  Abnormalities  in 
amounts  or  function  of  spectrin  and  associated 
proteins  result  in  hemolytic  anemias  and  are  the 
basis  for  diseases  such  as  hereditary  spherocytosis 
and  hereditary  elliptocytosis.  Proteins  closely  re- 
lated to  spectrin  are  present  in  many  vertebrate 
cells  and  are  associated  in  most  cases  with  the 
plasma  membrane. 

Spectrin  is  present  in  especially  high  amounts 
in  brain,  where  it  comprises  3  percent  of  the  total 
membrane  protein.  The  spectrin-based  mem- 
brane skeleton  in  brain  and  other  tissues  is  likely 
to  play  an  important  role  in  providing  organiza- 
tion of  integral  membrane  proteins  in  the  plasma 
membrane  and  for  coupling  membrane  proteins 
to  elements  of  the  cytoskeleton.  Potential  physio- 


logical consequences  of  these  activities  include 
stabilization  of  the  lipid  bilayer  and  organization 
of  membrane  proteins  in  specialized  regions  on 
the  cell  surface  in  polarized  cells. 

Specific  aims  of  this  laboratory  are  to  elucidate 
the  proteins  in  erythrocytes  and  other  cells  that 
mediate  interaction  of  spectrin  with  membranes, 
determine  how  these  protein  interactions  are  reg- 
ulated, and  understand  the  cellular  functions  of 
the  spectrin  skeleton. 

Ankyrin  in  the  Nervous  System 

Brain  ankyrin  binds  to  brain  spectrin  and  to  in- 
tegral membrane  protein  sites  in  brain  mem- 
branes. Ankyrin  appears  to  function  as  an  adapter 
between  these  proteins  and  the  spectrin  skeleton. 
We  recently  discovered  that  brain  contains  multi- 
ple forms  of  ankyrin,  which  all  bind  to  spectrin 
but  are  likely  to  associate  with  distinct  membrane 
proteins. 

One  well-characterized  ankyrin-binding  pro- 
tein is  the  voltage-dependent  sodium  channel.  An 
isoform  of  ankyrin  is  highly  concentrated  along 
with  the  voltage-dependent  sodium  channel  at 
the  nodes  of  Ranvier  of  nerve  axons.  Nodes  of 
Ranvier  are  specialized  regions  on  the  axons  of 
nerves  where  the  myelin  or  insulation  of  the  axon 
is  interrupted  and  where  ions  can  enter  or  leave 
the  axon  through  ion  channels.  The  localization 
of  the  voltage-dependent  sodium  channel  at 
nodes  of  Ranvier  is  important  for  normal  conduc- 
tion of  nerve  impulses.  We  plan  to  explore  the 
possibility  that  linkage  of  the  sodium  channel  to 
ankyrin  plays  a  role  in  either  confining  or  initial 
targeting  of  the  sodium  channel  to  the  nodes  of 
Ranvier.  These  studies  are  relevant  to  diseases  of 
neurons  such  as  multiple  sclerosis,  where  the  my- 
elin coating  of  axons  is  lost  and  sodium  channels 
are  no  longer  restricted  to  the  nodes  of  Ranvier. 

We  recently  determined  the  complete  amino 
acid  sequence  of  the  major  form  of  ankyrin  in 
human  brain.  The  gene  encoding  brain  ankyrin  is 
located  on  chromosome  4;  the  gene  encoding 
erythrocyte  ankyrin  is  on  chromosome  8.  Surpris- 
ingly, the  same  gene  that  encodes  erythrocyte  an- 
kyrin also  is  expressed  in  brain,  with  high  abun- 
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dance  in  the  cerebellum  and  brain  stem.  Brain 
thus  contains  ankyrins  encoded  by  different 
genes.  The  erythrocyte  form  of  ankyrin  is  missing 
in  a  strain  of  mutant  mice  developed  at  the  Jack- 
son Laboratory.  The  ankyrin-deficient  mice  expe- 
rience degeneration  of  Purkinje  cells,  a  major 
type  of  neuron  in  the  cerebellum,  and  develop  a 
stagger  and  difficulty  in  walking.  These  mice  re- 
tain ankyrin  at  the  nodes  of  Ranvier,  suggesting 
that  ankyrin  at  the  nodes  is  the  product  of  yet 
another  gene.  The  neurological  problems  of  an- 
kyrin-deficient mice  may  have  counterparts  in 
humans  w^ith  slowly  progressive  diseases  due  to 
death  of  nerve  cells. 

The  diversity  of  ankyrins  suggests  that  this  fam- 
ily of  proteins  may  interact  with  many  membrane 
proteins  in  addition  to  the  sodium  channel.  We 
have  used  the  membrane-binding  domain  of  the 
major  form  of  brain  ankyrin  to  isolate  ankyrin- 
binding  proteins  and  have  identified  at  least  10 
new  proteins.  The  consequence  of  linkage  of 
these  proteins  to  the  membrane  skeleton  could 
be  to  localize  them  in  specialized  regions  of  the 
cell  membrane  appropriate  for  their  function. 

Ankyrin-Independent  Membrane 
Attachment  Sites  for  Spectrin 

Brain  spectrin  can  also  associate  directly  with 
membrane  proteins  through  an  interaction  that  is 
independent  of  ankyrin.  We  recently  discovered 
that  calcium,  in  concert  with  three  different  cal- 
cium-regulated proteins  (calmodulin,  a  calcium- 
activated  protease,  and  protein  kinase  C)  inhibits 
the  direct  spectrin-membrane  linkage  but  has  no 
effect  on  spectrin-ankyrin  interactions.  These  re- 
sults suggest  that  the  spectrin  skeleton  includes 
both  stable,  ankyrin-mediated  linkages  and  dy- 
namic calcium-sensitive  associations  that  are  sub- 
ject to  metabolic  control.  Identification  of  the 
spectrin  "receptor"  is  the  first  step  in  under- 
standing the  role  of  this  type  of  spectrin-mem- 
brane interaction  in  cells.  We  are  currently  char- 
acterizing a  protein  that  is  a  candidate  for  a  role  in 
linking  spectrin  to  the  membrane. 

Ankyrin  Structure 

Ankyrin  contains  three  independently  folded 


domains:  one  domain  interacts  with  certain  mem- 
brane proteins,  another  associates  with  spectrin, 
and  a  third  regulates  associations  of  the  binding 
domains.  Surprisingly,  the  membrane-binding 
domain  of  ankyrin  includes  an  amino  acid  se- 
quence that  is  homologous  to  regions  of  se- 
quence in  a  group  of  apparently  unrelated  pro- 
teins from  flies,  yeast,  and  even  viruses.  We 
recently  discovered  that  this  conserved  portion 
of  the  ankyrin  sequence  is  responsible  for  inter- 
action of  ankyrin  with  at  least  one  membrane  pro- 
tein. Ankyrin  thus  contains  a  highly  conserved 
and  ancient  structural  motif  that  may  have  a  gen- 
eral role  in  molecular  recognition.  We  hope  to 
determine  the  three-dimensional  structure  of  this 
portion  of  ankyrin  and  expect  that  this  struc- 
ture will  increase  understanding  of  the  interac- 
tions of  ankyrin  and  other  proteins  with  related 
sequences. 

Adducin 

The  protein  adducin  is  a  candidate  to  play  a 
role  in  assembly  of  the  spectrin  skeleton  in  eryth- 
rocytes, brain,  and  certain  epithelial  tissues.  We 
have  found  that  adducin  is  localized  at  sites  of 
cell-cell  contact  in  epithelial  tissues.  Adducin 
and  spectrin  are  colocalized  at  cell  contact  sites 
and  may  be  arranged  in  a  structure  analogous  to 
the  spectrin  network  of  erythrocytes.  The  associa- 
tion of  adducin  with  cell-cell  contact  sites  occurs 
before  assembly  of  other  types  of  specialized  cell 
junctions  such  as  desmosomes. 

Our  working  hypothesis  is  that  adducin  pro- 
motes assembly  of  a  stable  spectrin  network  at 
sites  of  cell-cell  contact.  A  further  hypothesis  is 
that  the  spectrin  network  is  an  essential  precon- 
dition for  assembly  of  specialized  cell  junctions. 
Formation  of  appropriate  cell-cell  contacts  and 
cell  junctions  is  an  essential  event  in  embryogen- 
esis  and  is  one  of  the  processes  that  is  disturbed  in 
diseases  such  as  cancer.  We  are  excited  by  the 
possibility  that  adducin  and  spectrin  may  partici- 
pate in  such  a  fundamental  activity  of  cells.  We 
have  determined  the  complete  protein  sequence 
of  both  adducin  subunits.  Future  experiments 
will  evaluate  the  role  of  adducin  and  spectrin  in 
formation  of  junctions  between  cells. 
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THIS  laboratory  studies  basic  mechanisms  that 
lead  to  septic  shock,  a  serious  condition  aris- 
ing as  a  result  of  many  types  of  infection.  We  have 
learned  that  the  final  common  pathway  leading  to 
shock  involves  the  production  of  certain  cyto- 
kines, particularly  tumor  necrosis  factor  (TNF), 
by  host  cells  known  as  macrophages.  These  cells, 
originally  derived  from  a  white  blood  cell,  the 
monocyte,  exist  in  many  tissues. 

Once  TNF  has  been  produced,  it  alters  the  me- 
tabolism of  cells  throughout  the  body,  triggering 
a  breakdown  of  protein  and  fat  stores.  If  TNF  is 
chronically  secreted  at  low  levels,  a  state  of  wast- 
ing called  cachexia  will  develop.  This  condition 
is  seen  in  cancer  and  many  other  forms  of  chronic 
illness.  On  the  other  hand,  if  massive  quantities 
of  TNF  are  released  over  a  short  time,  as  in  wide- 
spread injury,  the  protein  activates  neutrophils 
and  endothelial  cells  in  such  a  way  that  shock 
occurs. 

Because  TNF  is  a  critically  important  molecule 
in  various  human  disease  processes,  we  have 
sought  to  understand  how  its  biosynthesis  is  con- 
trolled. Probably  the  most  potent  stimulus  for 
TNF  release  is  a  molecule  known  as  lipopolysac- 
charide,  or  endotoxin.  This  molecule  is  pro- 
duced by  gram-negative  bacteria,  which  have  a 
remarkable  tendency  to  cause  shock.  In  the 
course  of  a  gram-negative  infection,  endotoxin  is 
released  into  the  bloodstream.  It  is  harmless  to 
most  cells,  but  is  a  powerful  activator  of  mono- 
cytes and  macrophages,  triggering  their  release  of 
TNF  with  all  of  its  attendant  consequences. 

By  studying  different  portions  of  the  TNF  gene, 
we  have  shown  that  endotoxin  causes  two  sepa- 
rate responses  within  the  macrophage.  One,  it 
causes  increased  transcription  of  the  TNF  gene, 
leading  to  a  marked  accumulation  of  TNF  mRNA 
within  macrophages.  Two,  it  causes  far  more  effi- 
cient translation  of  the  mRNA — i.e.,  increases  the 
speed  with  which  the  mRNA  is  read  to  produce 
TNF  protein.  Acting  in  concert,  these  two  effects 
are  responsible  for  a  10,000-fold  increase  in  the 
rate  of  TNF  biosynthesis  and  thus  a  massive  net 
effect. 

For  a  number  of  technical  reasons,  it  has  been 


very  difficult  to  demonstrate  the  major  sources  of 
TNF  in  living  animals.  It  is  not  clear,  for  example, 
whether  it  is  made  by  normal  tissues  in  healthy 
animals  or  whether  such  "baseline"  production 
is  important  for  maintenance  of  physiological  or 
immunological  processes.  Similarly,  it  is  not 
clear  whether  the  TNF  that  arises  in  cancer  is  de- 
rived from  cells  of  the  tumor  or  from  host  cells 
that  act  in  response  to  the  tumor.  TNF  is  believed 
to  be  made  in  a  variety  of  autoimmune  or  allergic 
diseases,  but  again,  the  principal  source  of  the 
protein  remains  uncertain. 

To  address  these  questions,  our  laboratory  has 
produced  transgenic  mice  that  express  a  reporter 
construct,  in  which  an  easily  measurable  enzyme 
(chloramphenicol  acetyltransferase,  or  CAT)  is 
employed  as  a  marker  for  TNF.  In  cells  that  pro- 
duce TNF,  in  other  words,  CAT  synthesis  also  oc- 
curs. CAT  remains  confined,  however,  to  the  cell 
of  origin,  whereas  TNF  is  secreted  and  becomes 
widely  dispersed  in  the  organism.  Using  these  an- 
imals, we  have  found  that  during  normal  develop- 
ment TNF  is  made  by  cells  of  the  thymus.  Other 
investigators  have  further  reported  that  thymic 
production  of  TNF  is  essential  for  normal  devel- 
opment. While  the  protein  does  not  appear  to  be 
produced  elsewhere  in  healthy  animals,  it  is 
readily  induced  by  administration  of  lipopolysac- 
charide  (LPS)  or  by  various  authentic  infections. 

Our  laboratory  has  also  made  progress  in  un- 
derstanding the  mechanism  of  action  of  drugs 
that  inhibit  TNF  biosynthesis  and  in  devising  mol- 
ecules that  block  the  action  of  TNF  once  it  has 
been  released.  These  studies  might  lead  to  better 
therapies  for  shock  and  other  disorders.  Gluco- 
conicoid  hormones  (e.g.,  prednisone,  dexameth- 
asone,  and  Cortisol)  have  long  been  used  as  anti- 
inflammatory drugs.  One  of  their  principal 
effects  appears  to  be  a  blocking  of  TNF  biosynthe- 
sis, which  depends  upon  inhibition  of  both  TNF 
gene  transcription  and  mRNA  translation.  Other 
drugs  of  a  class  known  as  phosphodiesterase  in- 
hibitors (e.g.,  theophylline,  caffeine,  and  pen- 
toxifylline) also  block  TNF  biosynthesis,  achiev- 
ing their  effect  by  preventing  TNF  mRNA 
accumulation.  They  appear  to  function  at  a  dif- 
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ferent  locus  from  that  of  glucocorticoids.  We 
have  shown  that  the  two  classes  of  drugs  when 
combined  assert  a  synergistic  effect. 

Recently  the  cell-surface  receptor  for  TNF  was 
cloned  in  a  number  of  laboratories.  We  have  engi- 
neered a  recombinant  molecule  in  which  the  TNF 
receptor  is  attached  to  a  portion  of  a  normal  anti- 
body, yielding  a  new  protein  molecule  in  which 


two  TNF-binding  sites  are  expressed.  This  biva- 
lent TNF-binding  protein  strongly  inhibits  the  bi- 
ological effects  of  TNF,  is  highly  stable  in  vivo, 
and  may  be  produced  in  large  quantities  by  re- 
combinant techniques.  We  anticipate  that  this 
type  of  molecule  will  allow  a  through  investiga- 
tion of  the  many  effects  of  TNF  in  health  and  dis- 
ease and  may  also  be  useful  as  a  therapeutic  tool. 


42 


T  Lymphocyte  Ontogeny  and  Specificity 


Michael  J.  Sevan,  Ph.D. — Investigator 

Dr.  Bevan  is  also  Professor  in  the  Department  of  Immunology  at  the  University  of  Washington,  Seattle.  He 
received  his  Ph.D.  degree  for  work  performed  at  the  National  Institute  of  Medical  Research,  Mill  Hill,  Ion- 
don,  England.  He  did  his  postdoctoral  work  at  the  Salk  Institute  in  the  laboratory  of  Melvin  Cohn,  after 
which  he  was  Assistant  and  Associate  Professor  of  Biology  in  the  Center  for  Cancer  Research  and  the  De- 
partment of  Biology  at  the  Massachusetts  Institute  of  Technology.  He  was  a  member  of  the  Department  of 
Immunology  at  the  Research  Institute  of  Scripps  Clinic  before  moving  to  the  University  of  Washington. 
Dr.  Bevan  was  recently  elected  Fellow  of  the  Royal  Society. 


T lymphocytes  mature  in  the  thymus  from 
bone  marrow-derived  hematopoietic  stem 
cells.  A  key  part  of  that  process  is  the  rearrange- 
ment of  T  cell  receptor  genes,  the  products  of 
which  will  be  expressed  on  the  surface  to  recog- 
nize foreign  antigens.  The  receptors,  while 
highly  varied,  are  expressed  clonally:  each  T  cell 
expresses  only  one  kind  of  receptor. 

Two  functionally  different  types  of  T  lympho- 
cytes emerge  from  the  thymus  to  populate  the 
peripheral  lymphoid  organs.  These  are  usually 
called  helper  and  cytotoxic.  In  response  to  recog- 
nizing foreign  antigen,  helper  T  lymphocytes  re- 
lease lymphokines,  which  augment  the  response 
of  other  lymphocytes,  including  B  cells,  to  the 
antigen.  Cytotoxic  T  lymphocytes  specifically 
recognize  cells  that  have  become  infected  with 
intracellular  pathogens  and  kill  them  efficiently. 

Although  the  same  pool  of  receptor  genes  is 
used  by  both  types  of  T  lymphocytes,  they  are 
able  to  respond  to  quite  different  types  of  antigen 
by  virtue  of  the  different  classes  of  MHC  (major 
histocompatibility  complex)  molecules  engaged 
in  antigen  presentation.  The  MHC  encodes  two 
types  of  surface  glycoproteins:  class  I  and  class  II. 
Killer  cells  recognize  peptide  components  of  for- 
eign proteins  bound  in  the  groove  of  class  I  MHC 
molecules,  while  helper  cells  recognize  peptide 
antigens  presented  in  the  groove  of  class  II 
molecules. 

Class  I  MHC  glycoproteins  are  assembled  with 
(82-microglobulin  and  a  foreign  peptide  fragment 
shortly  after  synthesis  in  the  endoplasmic  reticu- 
lum. The  peptides  that  become  part  of  the  class  I 
complex  are  derived  from  intracellular  self  and 
foreign  (viral)  proteins.  Class  I  MHC  molecules 
do  not  associate  with  antigens  that  are  taken  up 
from  the  extracellular  fluid.  This  is  the  world  of 
class  II  presentation.  Soluble  extracellular  anti- 
gens endocytosed  by  antigen-presenting  cells  are 
digested,  then  bound  by  class  II  molecules  in  en- 
dosomes.  Because  of  this  dichotomy  in  antigen 
presentation,  the  body  mounts  a  helper  T  cell  re- 
sponse to  noninfectious  extracellular  antigens, 
and  it  mounts  a  cytotoxic  T  cell  response  to  cells 
infected  with  viruses  or  bacteria. 


The  generation  of  appropriate  T  cell  receptors 
by  gene  rearrangement  is  not  left  to  chance.  Im- 
mature T  cells  expressing  new  receptors  on  their 
surface  are  only  allowed  to  mature  if  the  receptor 
has  an  appropriate  affinity  for  class  I  or  class  II 
self  MHC  molecules.  This  process,  referred  to  as 
"positive  selection  of  the  T  cell  receptor  reper- 
toire," is  quite  stringent.  Ninety  percent  of  im- 
mature T  cells  expressing  new  T  cell  receptors 
fail  this  selection  step  and  die  in  the  thymus. 

Selecting  the  T  Cell  Repertoire 

T  cells  recognize  fragments  of  foreign  protein 
antigens  bound  in  the  groove  of  MHC-encoded 
molecules.  The  peptide-binding  groove  of  the 
MHC  molecule  is  extremely  polymorphic,  so  that 
different  alleles  bind  a  different  range  of  pep- 
tides. We  know  the  crystal  structure  of  a  class  I 
MHC  molecule,  and  we  have  a  good  guess  as  to 
how  peptide  lies  within  it.  Although  we  do  not 
know  the  structure  of  the  T  cell  receptor,  a  num- 
ber of  groups  have  suggested  that  the  receptor 
recognizes  self  MHC  plus  foreign  peptide  by  con- 
tacting their  residues  on  the  face  of  the  complex. 

Different  alleles  of  MHC  molecules  expressed 
in  the  thymus  select  quite  different  T  cells  for 
maturation  based  on  their  receptor  specificity. 
Since  positive  selection  of  the  T  cell  repertoire  is 
done  in  the  absence  of  foreign  peptides,  it  was 
commonly  supposed  that  positive  selection 
would  be  based  on  the  affinity  of  the  T  cell  recep- 
tor for  the  exposed  outer  faces  of  the  MHC.  We 
were  able  to  show,  however,  by  comparing  the  T 
cell  receptor  repertoire  selected  by  variant  class  I 
molecules,  that  residue  changes  within  the  pep- 
tide-binding groove — i.e.,  not  exposed  on  the 
outside  of  the  molecule — determine  the  specific- 
ity of  selection.  The  immediate  interpretation  is 
that  the  MHC  class  I  groove  is  occupied  by  a  self 
peptide  during  positive  selection  in  the  thymus, 
and  the  type  of  peptide  is  all  important  for  the 
result.  Our  studies  also  suggest  that  a  large  variety 
of  self  peptides  presented  by  the  same  class  I  mol- 
ecule select  for  different  portions  of  the  T  cell 
repertoire.  These  unknown  self  peptides  are 
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therefore  set  up  as  mimics  of  the  foreign  anti- 
genic world,  predicting  the  antigenic  universe 
that  the  immune  system  will  encounter  after  thy- 
mic maturation. 

Class  I  MHC  Antigen  Presentation 

The  class  I  heavy  and  light  chains  have  amino- 
terminal  signal  sequences  so  that  they  are  made 
on  membrane-bound  ribosomes  and  assemble  in 
the  endoplasmic  reticulum.  Many  of  the  peptide 
antigens  that  they  complex  with  and  present  to 
cytotoxic  T  cells,  however,  derive  from  internal 
viral  proteins  that  are  made  in  the  cytoplasm  on 
free  ribosomes.  It  seems  likely  that  there  is  a  pep- 
tide transporter  located  in  a  pre-Golgi  membrane 
that  allows  the  transfer  of  unfolded  molecules  or 
peptide  fragments  from  the  cytoplasm  into  the 
compartment  occupied  by  nascent  class  I  mole- 
cules. Because  of  the  nature  of  class  I-restricted 
antigen  presentation,  the  readily  and  cheaply 
available  protein  antigens  such  as  egg  proteins 


and  serum  proteins  could  not  be  used  to  stimu- 
late a  cytotoxic  response.  We  overcame  this 
problem  by  using  the  genes  for  chicken  ovalbu- 
min and  (8-galactosidase  transfected  into  mouse 
cells  to  generate  such  class  I-restricted  re- 
sponses. By  this  entirely  artificial  procedure,  we 
converted  these  antigens  into  endogenously  syn- 
thesized mouse  proteins  and  were  readily  able  to 
stimulate  a  class  I-restricted  cytotoxic  response. 

This  system  has  been  used  to  study  how  antigen 
gains  access  to  the  class  I  pathway  of  antigen  pro- 
cessing and  presentation.  Soluble  protein  antigen 
can  be  introduced  into  the  cytoplasm  and  then 
into  the  class  I  antigen-processing  pathway  by  the 
osmotic  lysis  of  pinosomes  or  by  the  uptake  of 
acid-sensitive  liposomes  that  fuse  with  endoso- 
mal  membranes.  We  have  used  these  systems  to 
study  the  kinetics  of  antigen  breakdown  and  pre- 
sentation and  to  ask  what  degradation  systems  are 
involved  in  antigen  processing  for  class  I- 
restricted  presentation. 
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THE  technique  of  x-ray  crystallography  allows 
visualization  of  the  three-dimensional  struc- 
tures of  proteins  in  atomic  detail.  In  other  words, 
we  get  a  picture  of  the  protein  that  shows  the 
location  of  all  the  atoms  and  how  they  interact. 
The  shape  of  a  protein  and  the  location  of  individ- 
ual atoms  with  respect  to  one  another  are  impor- 
tant for  determining  how  the  protein  functions. 
With  such  knowledge  it  is  often  possible  to  de- 
sign compounds  that  modify  the  protein  for  medi- 
cal intervention. 

The  proteins  that  we  are  studying  structurally 
are  those  that  mediate  the  immune  response 
against  viruses  and  other  pathogens.  The  immune 
system  has  evolved  so  that  highly  specific  mol- 
ecules on  the  surfaces  of  lymphocytes  can  recog- 
nize a  virally  infected  cell.  In  the  infected  cell, 
pieces  of  viral  proteins  are  fragmented  and  bound 
to  a  cellular  protein  called  a  histocompatibility 
molecule.  If  the  complex  formed  between  the 
histocompatibility  molecule  and  the  viral  frag- 
ment is  recognized  by  a  protein  on  the  lympho- 
cyte, the  infected  cell  is  destroyed.  The  lympho- 
cytes that  bear  the  recognizing  proteins  are  T 
cells,  and  the  proteins  on  their  surface  are  T  cell 
receptors. 

My  previous  work  in  the  laboratory  of  Don  Wi- 
ley (HHMI,  Harvard  University)  used  x-ray  crystal- 
lography to  determine  the  structure  of  a  histo- 
compatibility molecule.  We  learned  how  and 
where  viral  molecules  bind  to  histocompatibility 
protein  and  how  T  cell  receptors  might  bind  to 
this  complex.  My  laboratory  now  seeks  to  deter- 
mine a  three-dimensional  structure  for  a  T  cell 
receptor,  in  order  to  understand  the  atomic  de- 
tails of  its  interaction  with  the  surface  of  an 
infected  cell.  (This  work  is  being  done  in  collabo- 
ration with  Mark  Davis,  HHMI,  Stanford  Univer- 
sity.) We  will  also  try  to  make  crystals  of  a 
complex  between  a  T  cell  receptor  and  histocom- 
patibility molecule  to  be  used  in  a  three-dimen- 
sional structure  determination.  An  understanding 
of  how  T  cell  receptors  interact  with  histocompat- 
ibility molecules  complexed  to  viral  fragments 
should  increase  our  understanding  of  how  the 


immune  system  distinguishes  normal,  healthy 
cells  from  virally  infected  cells  that  need  to  be 
destroyed. 

Our  laboratory  is  also  using  protein  expression 
systems  in  mammalian  and  bacterial  cells  to  pro- 
duce the  large  quantities  of  proteins  necessary  for 
crystallization  and  structural  studies.  Using  mo- 
lecular biological  techniques,  it  is  possible  to 
transfect  a  protein-encoding  gene  into  a  cell  in 
which  it  is  not  normally  found,  thus  persuading 
the  cell  to  manufacture  that  protein.  Many  pro- 
teins normally  occur  in  small  quantities;  the  use 
of  such  protein  expression  systems  allows  the 
isolation  of  much  more  of  these  proteins  than 
would  be  otherwise  possible.  We  have  expressed 
fragments  of  histocompatibility  molecules  in  bac- 
teria in  preparation  for  making  complexes  be- 
tween these  proteins  and  antigenic  peptides.  We 
are  also  working  on  protein  expression  in  mam- 
malian cells. 

In  collaboration  with  Kai  Zinn  at  Caltech,  we 
have  expressed  in  a  hamster  fibroblast  cell  line 
large  quantities  of  a  cell  surface  protein  found  in 
the  grasshopper  and  fruit  fly  nervous  systems.  We 
are  interested  in  the  structure  of  this  molecule, 
because  many  molecules  involved  in  cell  surface 
recognition  in  the  immune  and  nervous  systems 
are  related  to  each  other.  This  protein  has  now 
been  purified  and  crystallized.  We  are  continuing 
to  investigate  ways  to  improve  the  quality  of  the 
crystals,  as  the  current  crystals  do  not  diffract  to 
sufficient  resolution  to  allow  a  crystal  structure 
determination. 

We  have  also  used  a  similar  protein  expression 
system  to  make  an  Fc  receptor,  a  protein  that 
binds  to  the  Fc  portion  of  antibody  molecules. 
(This  work  is  in  collaboration  with  Neil  Simister, 
Brandeis  University.)  This  particular  Fc  receptor 
is  found  in  the  intestine  of  newborn  mammals 
and  binds  maternal  immunoglobulin  in  milk, 
thus  transferring  partial  immunity  from  mother  to 
infant.  The  amino  acid  sequence  and  structural 
organization  of  this  molecule  show  similarities  to 
histocompatibility  molecules.  We  have  made  a 
soluble  (non-membrane-bound)  version  of  this 
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molecule,  which  is  still  capable  of  binding  to  Fc 
molecules.  We  are  currently  working  out  a  purifi- 
cation scheme  for  this  protein  and  will  try  to  crys- 
tallize the  protein  in  preparation  for  an  x-ray 


structure  determination.  We  also  have  crystals  of 
a  structurally  unrelated  Fc  receptor  and  will 
compare  the  binding  of  these  two  receptors  to 
their  common  ligand. 
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OUR  laboratory  has  been  studying  the  molecu- 
lar mechanisms  of  action  of  insulin  and 
other  polypeptide  growth  factors.  The  goal  of 
these  studies  is  to  understand  the  biochemical 
steps  between  the  binding  of  insulin  to  its  recep- 
tors on  the  surface  of  its  target  cells  and  the  ulti- 
mate stimulation  of  changes  in  enzyme  activity, 
protein  synthesis,  and  gene  expression.  An  under- 
standing of  the  molecular  details  of  insulin's  ac- 
tions may  lead  to  the  biochemical  characteriza- 
tion of  the  insulin  resistance  seen  in  certain 
common  disorders,  such  as  obesity  and  type  II 
diabetes,  and  possibly  to  novel  therapies  for  these 
disorders. 

We  are  studying  the  intracellular  actions  of  in- 
sulin and  related  polypeptide  growth  factors  at 
several  levels.  One  of  the  first  effects  of  insulin 
on  cells  is  the  activation  of  an  enzyme  activity 
intrinsic  to  those  receptors,  a  protein  kinase  activ- 
ity that  transfers  a  phosphate  group  from  the  cel- 
lular energy  source  ATP  to  a  specific  amino  acid, 
tyrosine,  in  target  proteins.  Later  effects  of  insu- 
lin include  the  activation  of  other  cellular  pro- 
tein kinases  that  catalyze  the  phosphorylation  of 
various  cellular  proteins  on  serine  and  threonine 
residues.  In  the  past  year  we  identified  one  of 
these  insulin-activated  kinases  as  the  Raf-1  pro- 
tein kinase,  a  proto-oncogene  product  that  is  re- 
lated to  the  transforming  oncogene  of  a  murine 
sarcoma  virus.  The  activation  of  this  protein  ki- 
nase was  accompanied  by  its  phosphorylation  on 
serine  and  threonine  residues.  In  our  laboratory 
Ruey-Min  Lee  recently  identified  two  insulin- 
activated  protein  kinases  that  in  turn  phosphory- 
late  the  Raf-1  protein;  these  kinases  are  activated 
within  two  minutes  of  cellular  exposure  to  insu- 
lin and  may  represent  one  of  the  missing  links 
between  the  tyrosine  kinase  activity  of  the  insulin 
receptor  and  the  serine  kinase  activity  of  the 
Raf-1  protein  kinase. 

Another  well-known  action  of  insulin  is  the 
stimulation  of  protein  synthesis  in  its  target  cells. 
The  synthesis  of  many  proteins  is  increased  as 
part  of  insulin's  general  anabolic  effect.  How- 
ever, insulin  also  stimulates  the  synthesis  of  sev- 
eral proteins  to  a  much  greater  extent  than  the 


overall  average  of  cellular  proteins;  the  synthesis 
of  many  of  these  proteins  appears  to  be  stimu- 
lated at  the  level  of  messenger  RNA  translation. 
Joyce  Manzella  is  currently  studying  the  transla- 
tion of  the  mRNA  for  the  enzyme  ornithine  decar- 
boxylase as  an  example  of  insulin-stimulated 
mRNA  translation.  She  recently  found  a  direct 
correlation  between  the  ability  of  insulin  to  stim- 
ulate mRNA  translation  and  the  extent  of  pre- 
dicted intrachain  folding  or  secondary  structure 
for  that  mRNA.  In  searching  for  a  mechanism  for 
this  effect,  she  found  that  insulin  stimulated  the 
phosphorylation — and  presumably  the  activa- 
tion— of  two  translation  initiation  factors  whose 
actions  involve  unwinding  or  "melting"  those 
secondary  mRNA  structures.  These  studies  have 
led  her  to  propose  a  general  model  for  insulin- 
stimulated  mRNA  translation  based  on  the  insulin- 
stimulated  unwinding  of  mRNA  secondary 
structures. 

Insulin  also  stimulates  the  transcription  of 
many  genes  shortly  after  its  addition  to  sensitive 
target  cells.  One  such  gene,  the  c-fos  proto-onco- 
gene, has  been  studied  in  detail.  The  transcrip- 
tion of  this  gene  is  stimulated  by  insulin  within 
approximately  5-10  min  of  cellular  exposure. 
Recent  studies  by  Rajesh  Malik  and  Michael  Roe 
have  identified  two  cellular  proteins  that  appear 
to  be  modified  by  insulin  and  growth  factor  treat- 
ment, so  that  their  binding  to  the  regulatory  re- 
gion of  the  c-fos  gene  is  increased.  They  are 
currently  attempting  to  purify  and  clone  these 
proteins,  with  the  goal  of  determining  how  insu- 
lin stimulates  their  interaction  with  the  c-fos 
gene  promoter.  Wi-Sheung  Lai  has  recently 
cloned  another  gene  whose  rapid  transcription  is 
stimulated  by  insulin,  the  tris-tetraproline  (TTP) 
gene;  similar  studies  of  its  regulatory  elements 
are  under  way.  The  protein  encoded  by  this  rap- 
idly turned  on  gene  appears  to  belong  to  a  novel 
class  of  zinc  finger-containing  proteins  that  are 
probably  DNA-binding  proteins;  efforts  to  deter- 
mine the  DNA  targets  of  this  protein  are  ongoing. 

Other  protein  kinases  are  involved  in  the  intra- 
cellular responses  to  a  variety  of  polypeptide 
hormones  and  growth  factors  other  than  insulin. 
For  example,  some  of  these  growth  factors  also 
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stimulate  the  rapid  breakdown  of  membrane  lip- 
ids, generating  intracellular  signals  that  activate 
an  intracellular  serine/threonine  kinase  known 
as  protein  kinase  C,  and  increase  concentrations 
of  intracellular  calcium.  Activation  of  protein  ki- 
nase C  leads  to  the  phosphorylation  of  many  in- 
tracellular proteins  and  subsequent  metabolic 
processes,  including  cell  growth,  hormone  bio- 
synthesis, and  neurotransmission.  We  have  been 
studying  one  prominent  substrate  for  protein  ki- 
nase C,  the  myristoylated  alanine-rich  C  kinase 


substrate  (MARCKS)  protein,  which  is  phosphor- 
ylated  within  seconds  of  protein  kinase  C  activa- 
tion in  intact  cells.  Recent  studies  have  suggested 
a  role  for  this  protein  in  regulation  of  the  cellular 
availability  of  calmodulin,  a  calcium-binding 
protein  that  can  activate  a  number  of  important 
enzymes  that  respond  to  changes  in  intracellular 
calcium.  These  studies  may  lead  to  the  biochemi- 
cal characterization  of  a  link  between  the  protein 
kinase  C  and  calcium  pathways  activated  by  sev- 
eral polypeptide  growth  factors. 
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NUMEROUS  structurally  and  functionally  di- 
verse proteins  can  be  translocated  across  a 
few  distinct  cellular  membranes.  It  is  now  estab- 
lished that  targeting  to  these  membranes  and 
translocation  across  them  is  specified  by  a  mem- 
brane-specific "signal"  sequence.  This  is  a  part, 
transient  (or  permanent),  of  the  protein  to  be 
translocated.  The  primary  structure  of  numerous 
representatives  for  such  a  sequence  has  been  es- 
tablished in  our  laboratory  and  others.  Present 
work  focuses  on  the  identification  and  character- 
ization of  machinery  involved  in  the  recognition 
of  a  signal  sequence,  in  its  targeting  to  the  proper 
membrane,  and  in  protein  translocation.  Collec- 
tively, this  machinery  comprises  what  has  been 
termed  a  protein  translocon. 

Our  laboratory  is  working  on  translocons  for 
four  distinct  cellular  entities,  namely  1)  the  en- 
doplasmic reticulum  (ER)  of  animal  (and  yeast) 
cells,  which  is  able  to  translocate  proteins  from 
the  cytosol  to  the  ER  lumen;  2)  bacterial  plasma 
membranes  (gram-negative  bacteria),  able  to 
translocate  proteins  from  the  cytoplasm  to  the 
periplasmic  space;  3)  yeast  mitochondria,  able  to 
translocate  protein  from  the  cytoplasm  to  the  mi- 
tochondrial interior  ("matrix")  across  outer  and 
inner  mitochondrial  membranes;  and  4)  plant 
cell  (pea  and  spinach)  chloroplasts,  able  to  trans- 
locate protein  from  the  cytoplasm  to  the  chloro- 
plast  interior  ("stroma")  across  outer  and  inner 
chloroplast  membranes. 

From  studies  on  these  four  translocons  so  far, 
but  especially  on  the  ER  one,  it  is  likely  that  these 
and  others  are  composed  of  at  least  four  entities: 
1)  a  soluble  signal  recognition  factor  (SRF),  2) 
a  homing  receptor,  3)  a  signal  sequence-gated 
protein- conducting  channel,  and  4)  a  signal-re- 
moving peptidase.  SRF  has  as  its  functions  «)  rec- 
ognition of  the  signal  sequence  and  b)  targeting 
to  the  homing  receptor,  which  is  restricted  in  its 
localization  to  a  translocon-specific  membrane. 
Interaction  of  the  signal  sequence-SRF  complex 
with  the  homing  receptor  leads  to  a)  dissociation 
of  the  signal  sequence  and  b)  its  presentation  to  a 
signal  sequence  receptor,  which  might  be  a  sub- 
unit  of  the  protein-conducting  channel.  This 


channel  would  close  immediately  upon  comple- 
tion of  translocation,  only  to  open  again  after  pre- 
sentation of  another  signal  sequence.  The  pepti- 
dase would,  in  most  cases,  remove  the  signal 
sequence  either  during  translocation  or  shortly 
thereafter. 

In  the  case  of  the  ER,  the  SRF  was  isolated  and 
shown  to  be  a  ribonucleoprotein  particle.  Re- 
ferred to  as  a  signal  recognition  particle  (SRP), 
this  consists  of  one  7S  RNA  molecule  and  six  dif- 
ferent proteins.  Likewise,  a  homing  receptor  (re- 
ferred to  as  SRP  receptor)  and  a  signal-removing 
peptidase,  a  complex  of  five  proteins,  were  iso- 
lated. More  recently  we  were  able  to  demonstrate 
the  existence  of  a  protein-conducting  channel, 
using  electrophysiological  methods.  The  protein 
constituents  of  this  channel  remain  to  be  identi- 
fied. We  have  been  able  to  solubilize  the  ER 
membranes  by  detergent  and  to  reconstitute 
translocation-competent  vesicles.  Using  this 
method,  it  should  be  possible  to  identify  the 
channel  proteins. 

We  recently  identified  components  of  other 
translocons.  An  SRF  was  isolated  for  signal  se- 
quence targeted  to  the  bacterial  plasma  mem- 
brane. Moreover,  we  identified  signal  sequence- 
binding  subunits  for  the  mitochondrial  and 
chloroplast  translocons.  These  integral  mem- 
brane proteins  are  located  in  contact  sites  be- 
tween outer  and  inner  organelle  membranes  and 
are  candidates  for  subunits  of  a  protein-conduct- 
ing channel  in  the  outer  organelle  membrane 
linked  to  a  protein-conducting  channel  in  the  in- 
ner membrane. 

Our  other  major  research  effort  focuses  on  the 
organelles  associated  with  the  cell's  nuclear  en- 
velope membranes.  These  organelles  are  thought 
to  organize  the  large  amount  of  information  in 
the  linear  structure  of  the  DNA  into  numerous 
structurally  and  functionally  distinct  three- 
dimensional  superstructures,  allowing  only  a  lim- 
ited amount  of  that  information  to  be  expressed. 
Characterization  of  these  organelles  should  ad- 
vance understanding  of  such  fundamental  pro- 
cesses as  differential  gene  expression,  cell  differ- 
entiation, and  development. 
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Our  efforts  focus  on  the  structural  and  func- 
tional characterization  of  two  morphologically 
distinct  structures.  One,  the  nuclear  pore  com- 
plex, is  located  in  the  nuclear  envelope.  We  spec- 
ulate that  this  organelle  is  involved  in  "gene  gat- 
ing"— i.e.,  each  pore  complex  is  attached  to  a  set 
of  actively  transcribed  genes.  We  have  identified 
and  isolated  several  proteins  of  the  pore  complex 
and  have  recently  established  the  primary  struc- 
ture of  a  large  membrane  protein  that  is  specifi- 
cally located  in  the  pore  membrane  domain  of 
the  nuclear  envelope  membrane.  We  are  also  us- 
ing cell-free  systems  for  protein  uptake  into  the 
nucleus  to  isolate  the  hypothetical  SRF,  the  hom- 
ing receptor,  and  the  so-called  transporter  of  the 
pore  complex. 

The  other  morphologically  distinct  structure 
associated  with  the  nuclear  envelope  that  we  are 
studying  is  the  nuclear  lamina,  a  fibrous  mesh- 
work  associated  with  the  nuclear  side  of  the  inner 
nuclear  envelope  membrane.  The  lamina  consists 
of  three  proteins,  which  we  have  termed  lamins 
A,  B,  and  C.  We  have  speculated  that  the  nuclear 
lamina  is  involved  in  the  three-dimensional  orga- 
nization of  nontranscribed  chromatin. 

By  molecular  cloning  and  cDNA  sequencing  of 
the  three  lamins,  we  and  others  showed  recently 


that  these  proteins  are  members  of  the  interme- 
diate filament  protein  family.  We  demonstrated 
that  lamin  B  binds  to  the  carboxyl-terminal  por- 
tion of  cytoplasmic  intermediate  filament  pro- 
teins and  that  ankyrin,  a  protein  associated  with 
the  plasma  membrane,  binds  to  the  amino-termi- 
nal  portion  of  cytoplasmic  intermediate  filament 
proteins.  Recently,  we  have  investigated  the  in- 
teraction of  such  a  protein  with  nuclear  lamin  B 
in  more  detail  and  have  localized  the  interacting 
region  to  the  near  carboxyl-terminal  segment  of 
these  proteins.  Interestingly,  a  synthetic  peptide, 
representing  this  lamin  B-binding  site  of  the  cy- 
toplasmic intermediate  filament  protein,  when 
microinjected  into  cells,  led  to  the  detachment  of 
intermediate  filaments  from  the  nucleus. 

These  data  indicate  a  direct  connection  be- 
tween the  plasma  membrane  skeleton  (ankyrin) , 
the  cytoskeleton  (intermediate  filaments),  and 
the  peripheral  nucleoskeleton  (lamina)  through 
the  nuclear  pore.  Moreover,  we  have  recently 
identified  a  lamin  B  receptor  in  the  inner  nuclear 
membrane.  Its  primary  structure  has  been  deter- 
mined by  molecular  cloning  and  cDNA  sequenc- 
ing. It  has  a  number  of  interesting  sites,  suggest- 
ing that  it  may  not  only  interact  with  lamin  B  but 
also  with  DNA. 
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THE  commitment  of  our  laboratory  is  to  inves- 
tigate basic  scientific  problems  that  have  par- 
ticular relevance  for  health  in  the  Third  World. 
Three-quarters  of  the  world's  population  lives  in 
the  Third  World,  and  one-fourth  of  that  popula- 
tion suffers  from  malnutrition  and  disease.  The 
premises  of  our  research  are  that  the  advances  in 
molecular  biology  and  immunology  have  a  great 
deal  to  offer  for  understanding  infectious  diseases 
afflicting  people  in  developing  countries  and,  re- 
ciprocally, that  the  study  of  some  of  those  dis- 
eases can  provide  unique  insights  into  fundamen- 
tal immunological  and  pathogenetic  mechanisms 
of  relevance  to  people  in  the  industrialized 
world. 

The  Importance  of  Mycobacterial  Diseases 

Tuberculosis  and  leprosy  are  both  caused  by 
mycobacteria.  Leprosy  afflicts  10-12  million 
people  in  the  world  and  produces  deformity  in 
30  percent  if  untreated.  Throughout  time  and  in 
all  cultures,  leprosy  has  engendered  a  unique  fear 
and  stigma.  Although  Mycobacterium  leprae  W2is 
the  first  major  bacterial  pathogen  of  humans  ever 
described,  it  remains  one  of  the  very  few  that  has 
never  been  cultivated  in  the  test  tube.  Yet  its  anti- 
gens can  be  produced  and  studied  vicariously  in 
Escherichia  coli  by  means  of  recombinant  DNA 
technology. 

Tuberculosis  is  the  major  infectious  cause  of 
death  in  the  world  today.  Each  year  there  are  8 
million  new  cases  of  tuberculosis  and  3  million 
deaths,  afilicting  primarily  the  most  productive 
element  of  society — young  adults.  AIDS  (ac- 
quired immune  deficiency  syndrome)  causes  a 
breakdown  of  resistance  to  tuberculosis  and  has 
produced  a  grave  increase  in  the  disease,  both  in 
the  developing  and  the  industrialized  countries. 

Immunologic  Unresponsiveness 
and  Leprosy 

One  of  the  fundamental  issues  in  immunology 
is  the  nature  of  immunological  tolerance — i.e., 
the  mechanisms  by  which  cells  in  the  immune 
system  discriminate  between  foreign  antigens 
and  self  antigens,  and  prevent  responses  to  self.  A 


failure  of  tolerance  to  self  antigens  leads  to  au- 
toimmune diseases,  such  as  rheumatoid  arthritis, 
juvenile  diabetes,  and  perhaps  multiple  sclerosis. 
The  principal  mechanism  for  developing  toler- 
ance is  thought  to  be  deletion  of  clones  of  poten- 
tially autoreactive  T  cells  in  the  thymus  during 
neonatal  life.  It  is  becoming  increasingly  clear, 
however,  that  not  all  such  clones  can  be  deleted 
in  the  thymus  and  that  there  must  be  additional 
mechanisms  by  which  self-reactive  cells  can  be 
rendered  unresponsive  after  birth. 

Leprosy  provides  a  unique  model  with  which 
to  study  immunoregulation  and  unresponsive- 
ness in  humans.  The  disease  comprises  a  spec- 
trum of  clinical  entities.  In  the  tuberculoid  form, 
strong  cell-mediated  immunity  kills  the  organ- 
ism, but  damages  nerves  in  the  process.  In  the 
lepromatous  form,  at  the  other  end  of  the  spec- 
trum, patients  are  unable  to  respond  immunologi- 
cally to  M.  leprae  antigens.  Since  infection  oc- 
curs after  birth,  there  is  little  evidence  of  clonal 
deletion  of  T  cells  capable  of  reacting  to  the  or- 
ganism. Therefore,  understanding  the  mecha- 
nisms of  that  unresponsiveness  is  relevant  to  pre- 
venting autoimmune  disease  and  increasing 
transplant  survival. 

We  have  learned  that  the  unresponsiveness  in 
leprosy  is  antigen-specific,  in  that  lepromatous 
leprosy  patients  unable  to  respond  to  antigens  of 
M.  leprae  usually  respond  to  those  of  M.  tubercu- 
losis, which  is  a  very  closely  related  mycobacte- 
rium.  How  is  it  possible  for  T  cells  to  recognize 
antigens  in  the  tubercle  bacillus  and  yet  be  un- 
able to  recognize  the  same  or  closely  related  anti- 
gens associated  with  the  leprosy  bacillus?  We 
proposed  that  there  might  be  one  or  a  few  unique 
antigens  associated  with  M.  leprae  that  induce 
active  T  cell  suppression  of  potentially  reactive  T 
cell  clones.  Suppressor  cells  in  immunology  have 
been  a  very  controversial  subject,  but  the  idea 
that  some  T  cells  can  down-regulate  immune  re- 
sponses, particularly  self-destructive  ones,  is 
compelling. 

Our  studies  in  vitro  have  shown  that  about  85 
percent  of  responsive  patients  with  lepromatous 
leprosy  have  a  subset  of  T  cells  capable  of  being 
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triggered  specifically  by  leprosy  antigens  to  sup- 
press responses  of  immune  T  cells.  Although  they 
represent  a  minor  subset  of  T  cells  in  the  blood, 
they  are  the  major  lymphocyte  subset  in  leproma- 
tous  lesions.  By  establishing  long-term  T  cell 
clones  directly  from  the  lesions,  we  found  the 
suppressor  cells  to  have  a  unique  pattern  of  anti- 
gen recognition  different  from  other  cytotoxic  or 
lymphokine-producing  T  cells.  They  carry  a  par- 
ticular surface  marker,  CDS,  and  recognize  for- 
eign antigens  in  association  with  a  particular 
region,  HIA-DQ,  of  the  human  major  histocom- 
patibility complex  (MHC),  class  II.  We  speculate 
that  presentation  of  antigens  by  this  MHC  subset 
predisposes  the  immune  responses  toward  nega- 
tive rather  than  positive  responses.  We  hope  to 
understand  the  molecular  basis  of  T  cell 
suppression. 

New  Vaccines  from  Old — Recombinant  BCG 
as  a  Multivaccine  Vehicle 

Vaccines  represent  the  most  cost-effective  med- 
ical intervention.  Yet  three  general  problems 
limit  the  use  of  current  vaccines:  1)  they  require 
multiple  booster  shots  to  be  effective;  2)  they 
cannot  be  given  for  6-12  months  after  birth,  be- 
cause of  transfer  of  maternal  antibodies  that  inac- 
tivate them;  and  3)  the  cost.  BCG  (bacille  Cal- 
mette-Guerin)  is  the  most  widely  used  vaccine  in 
the  world.  It  is  a  live,  attenuated  bovine  tubercle 
bacillus  given  to  protect  children  against  tuber- 
culosis. Studies  by  our  collaborator,  Jacinto  Con- 
vit,  in  Venezuela,  revealed  that  immunologically 
unresponsive  patients  with  lepromatous  leprosy 
could  be  converted  to  positive  cell-mediated  im- 
munity and  often  self-cure  by  injection  of  a  mix- 
ture of  killed  M.  leprae  together  with  BCG  vac- 
cine. This  represents  the  first  instance  of  vaccines 
being  used  therapeutically.  It  is  also  the  first  ex- 
ample of  patients  overcoming  a  specific  immuno- 
logic tolerance,  and  indicates  that  BCG  is  a  potent 
human  immune  enhancer,  or  "adjuvant." 

BCG  has  other  unique  attributes  as  well.  It  has 


been  given  to  over  2.5  billion  people  and  has  a 
very  low  incidence  of  serious  side  effects.  It  is 
one  of  the  few  childhood  vaccines  that  can  be 
given  at  birth,  or  any  time  thereafter.  It  is  a  single- 
shot  vaccine  that  engenders  long-lasting  cellular 
immunity  and  costs  only  six  cents  a  dose. 

The  unique  attributes  of  BCG  suggested  to  us 
that,  if  it  could  be  genetically  engineered  to  ex- 
press a  variety  of  foreign  antigens  protective  for 
different  pathogens,  a  single  immunization  might 
be  capable  of  engendering  protective  responses 
to  multiple  pathogens  simultaneously.  One  prob- 
lem, however,  was  the  paucity  of  molecular  ge- 
netic information  about  the  Mycobacteria.  In  col- 
laboration with  William  Jacobs  (HHMI,  Albert 
Einstein  College  of  Medicine),  we  undertook  to 
develop  new  genetic  systems  for  introducing  for- 
eign genes  into  mycobacteria,  particularly  BCG 
strains.  We  developed  a  shuttle  strategy  in  which 
mycobacterial  DNA  could  be  genetically  cloned 
and  manipulated  in  E.  coli,  then  transferred  into 
mycobacteria.  Our  first  approach  was  to  use  my- 
cobacteriophages  (viruses  that  infect  bacteria)  as 
vectors  to  target  foreign  genes  to  a  specific  site  in 
the  bacterial  chromosome.  This  enabled  us  to  in- 
troduce foreign  DNA  into  BCG  for  the  first  time. 
Recently  we  have  developed  shuttle  plasmid  vec- 
tors that  are  able  to  produce  many  copies  of  for- 
eign genes  in  BCG. 

With  several  collaborators  at  Medlmmune, 
Inc.,  and  the  University  of  Pittsburgh,  we  have 
developed  the  first  experimental  recombinant 
BCG  vaccines.  These  express  protective  antigens 
from  M.  leprae,  E.  coli,  schistosomes,  malaria, 
leishmania,  and  HIV  (human  immunodeficiency 
virus).  Initial  experiments  in  mice  indicate  that 
three  major  types  of  protective  immune  re- 
sponses can  be  generated  in  vivo — namely,  im- 
munoglobulin antibodies,  T  cell  lymphokines, 
and  cytotoxic  T  lymphocytes.  Continuing  efforts 
will  be  made  to  define  antigens  that  will  engen- 
der in  recombinant  BCG  protective  immunity 
against  a  variety  of  viral,  bacterial,  and  parasitic 
pathogens. 
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THE  long-term  goal  of  our  research  is  to  under- 
stand how  proteins  of  the  extracellular  ma- 
trix contribute  to  tissue  structure  and  function. 
Quantitative  and  qualitative  changes  in  these 
proteins  occur  during  morphogenesis  and  as  part 
of  the  wound  healing  process.  These  observations 
suggest  that  both  the  organization  and  protein 
composition  of  the  matrix  are  precisely  regu- 
lated. It  is  clear  that  this  regulation  occurs  in  part 
at  the  level  of  gene  expression  and  in  part  at  the 
level  of  the  assembly  of  proteins  into  a  matrix- 
like configuration. 

I  have  chosen  to  focus  for  the  most  part  on  the 
matrix  molecule  type  I  collagen.  This  is  a  poly- 
mer of  two  related  proteins  whose  sequence  has 
been  determined.  Moreover,  the  multidomain 
structure  of  the  molecule  and  a  general  outline  of 
collagen  biosynthesis  are  known,  and  the  mole- 
cule is  recognized  to  be  distributed  widely 
within  tissues  such  as  bone,  tendon,  ligament, 
tooth,  dermis,  and  sclera.  Previous  studies  have 
implied  that  type  I  collagen  makes  an  important 
contribution  to  the  structure,  integrity,  and  nor- 
mal homeostasis  of  these  tissues.  Over  the  last 
year  we  have  continued  our  work  to  establish 
model  systems  that  would  allow  us  to  study  this 
contribution  at  the  molecular  level. 

One  system  is  designed  to  investigate  the  intra- 
cellular assembly  of  the  collagen  molecule.  In 
general,  this  work  involves  site-specific  mutagen- 
esis and  assays  that  quantify  the  effects  of  muta- 
tion on  the  assembly  process.  These  effects  are 
studied  at  two  levels.  First,  we  have  established 
conditions  that  allow  synthetic  peptides  to  fold 
into  a  collagen-like  triple  helix.  Peptide  folding 
is  slow  enough  that  the  process  can  be  character- 
ized by  methods  such  as  circular  dichroism,  and 
the  triple  helix  formed  in  vitro  is  stable  enough 
that  its  structure  can  be  characterized  by  nuclear 
magnetic  resonance  (NMR) .  Therefore  the  effect 
of  a  given  mutation  can  be  quantified  by  directly 
comparing  the  behavior  of  a  normal  peptide  with 
that  of  a  mutant  one.  Also,  cellular  transfection 
methods  have  been  developed  to  express  and  as- 
semble collagen  molecules  in  vitro;  and  again, 
the  effect  of  mutation  on  the  assembly  process 


can  be  quantified  by  directly  comparing  the  be- 
havior of  normal  and  mutant  molecules. 

In  our  initial  mutagenesis  experiments,  we 
characterized  a  highly  conserved  region  of  the 
triple  helical  domain  and  demonstrated  that  it 
made  an  important  contribution  to  the  assembly 
of  collagen  molecules  into  a  thermodynamically 
stable  conformation.  We  speculate  that  this  re- 
gion was  conserved  during  evolution  because  it 
plays  an  important  role  in  collagen  biosynthesis 
— i.e.,  in  folding  the  collagen  molecule  into  its 
correct  conformation.  In  the  future,  we  hope  to 
use  this  model  system  to  define  further  the  nor- 
mal contribution  made  by  other  collagen  do- 
mains to  the  assembly  process.  In  addition,  we 
are  interested  in  characterizing  those  regions  of 
the  molecule  that  mediate  interactions  between 
collagen  and  other  matrix  molecules  such  as  fi- 
bronectin,  heparin  sulfate  proteoglycan,  and  in- 
tegrins.  These  interactions  are  important  because 
they  represent  a  molecular  basis  for  the  assembly 
of  collagen  within  the  matrix. 

A  second  system  is  designed  to  investigate  the 
function  of  type  I  collagen  at  the  level  of  connec- 
tive tissue.  Our  initial  set  of  experiments  utilized 
a  transgenic  mouse  strain  that  expressed  only  half 
the  normal  amount  of  type  I  collagen.  We  demon- 
strated that  the  mutation  adversely  affected  the 
connective  tissue  of  bone  and  skin  dermis.  In  ad- 
dition, the  mutant  mice  were  profoundly  deaf. 
We  utilized  biomechanical  tests  to  quantify  the 
effect  of  the  collagen  deficiency  at  the  tissue 
level,  and  these  studies  demonstrated  that  the 
major  role  of  type  I  collagen  is  to  provide  con- 
nective tissue  with  a  high  degree  of  resiliency. 

More  recently,  we  also  demonstrated  that  the 
skeleton  of  these  transgenic  mice  is  able  to  adapt 
to  the  inherited  collagen  deficiency.  This  adapta- 
tion involves  a  thickening  of  cortical  bone  and 
results  from  the  synthesis  of  new  bone  matrix. 
This  suggests  a  signal  transduction  pathway  for 
bone  in  which  the  mechanical  environment 
(e.g.,  strain)  influences  the  pattern  of  osteoblast 
gene  expression.  Particularly  intriguing  was  our 
observation  that  the  adaptation  was  associated 
with  a  significant  improvement  in  bone  strength. 

Experiments  utilizing  other  transgenic  mouse 
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strains  have  been  designed  to  test  the  hypothesis  strategy  to  treat  medical  conditions  characterized 
that  adaptations  of  this  type  represent  a  general    by  abnormal  bone  fragility. 
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THE  cellular  events  that  accompany  lymphoid 
differentiation  are  highly  regulated.  This  reg- 
ulation of  lymphoid  gene  expression  is  central  to 
the  development  of  T  and  B  cells  and  the  ultimate 
function  of  mature  lymphoid  cells.  We  are  inves- 
tigating several  experimental  systems: 

•  The  involvement  of  Ly-6  antigens  in  the  de- 
velopment of  T  cell  responsiveness,  with  a  focus 
on  such  events  as  recognition  of  foreign  antigen 
and  processes  associated  with  signal  transduc- 
tion. 

•  The  potential  effects  of  defective  Ly-6  gene 
expression  on  development  of  autoimmunity. 

•  T  cell  activation,  which  occurs  as  a  result  of 
specific  recognition  by  the  antigen-specific  re- 
ceptor. We  have  developed  an  expression  system 
that  results  in  production  of  high  levels  of  solu- 
ble T  cell  receptors  (TCRs)  for  analysis  of  func- 
tion and  biophysical  properties. 

•  Regulation  of  lymphoid-specific  genes  by 
RNA  splicing.  A  factor  that  may  regulate  this  pro- 
cess has  been  molecularly  cloned  and  is  being 
studied  functionally. 

•  Formation  and  development  of  memory  B 
cells  and  plasma  cell  lineages.  This  is  studied 
with  an  emphasis  on  commitment  to  a  lineage 
and  its  development. 

The  Ly-6  antigens  were  initially  characterized 
as  associated  with  activated  T  cells.  The  Ly-6A 
antigen  is  expressed  on  these  cells  and  hemato- 
poietic stem  cells  as  well.  The  antigen  can  also  be 
used  to  induce  activation,  as  monoclonal  antibod- 
ies directed  against  Ly-6A  activate  T  cells  when 
crosslinked.  Activation  requires  the  presence  of 
the  TCR  complex  on  the  cell  surface. 

To  analyze  this  process  functionally,  we  are  us- 
ing site-directed  mutagenesis.  The  protein  is  an- 
chored in  the  membrane  not  by  a  conventional 
transmembrane  segment  but  by  a  glycosyl  phos- 
phatidylinositol  (GPI)  lipid  anchor.  Mutants 
have  been  made  with  an  altered  anchor,  and  a 
transmembrane  form  has  been  created.  These  con- 
structs have  shown  that  the  GPI  anchor  is  re- 
quired for  function. 

Another  form  of  mutation  is  the  functional  de- 
letion of  the  gene  product.  When  the  expression 


of  Ly-6A  is  inhibited  in  a  T  cell  clone  by  antisense 
RNA,  the  antigen  recognition  function  of  the  cell 
is  destroyed  and  survival  of  the  clone  in  culture  is 
dependent  on  certain  lymphokines.  These  exper- 
iments reveal  the  importance  of  Ly-6  antigens  to 
T  cell  function  and  have  opened  up  new  ap- 
proaches to  the  study  of  Ly-6  function.  We  have 
begun  experiments  to  disrupt  the  endogenous 
gene  in  embryonic  stem  cells  by  homologous  re- 
combination. These  cells  can  be  used  to  generate 
a  mouse  that  can  transmit  the  mutation  in  the 
germline. 

All  inbred  strains  of  mice  tested  display  a  com- 
mon pattern  of  Ly-6C  expression  except  NOD, 
NZB,  and  ST.  In  these  three  strains,  expression  of 
Ly-6C  is  greatly  reduced  in  the  bone  marrow  and 
cannot  be  detected  in  the  spleen  or  lymph  nodes. 
NOD  and  NZB  mice  are  well-studied  models  of 
autoimmune  disease  (insulin-dependent  diabetes 
in  the  former  case,  systemic  lupus  erythematosus 
in  the  latter).  These  two  strains  of  mice  also  ex- 
hibit a  depressed  syngeneic  mixed-lymphocyte 
reaction  (SMLR) ,  which,  as  a  measure  of  the  pro- 
liferative response  of  T  cells  to  self-MHC  class  II 
determinants,  represents  an  immunoregulatory 
cellular  interaction.  Analysis  of  recombinant 
inbred  lines  derived  from  NZB  indicates  cosegre- 
gation  of  the  defect  in  Ly-6C  expression  and  de- 
creased responsiveness  in  the  SMLR. 

Cloning  and  sequencing  of  the  affected  Ly-6C 
gene  segment  from  the  NOD  mouse  indicates  the 
presence  of  an  interruption  by  sequences  charac- 
teristic of  retroposon  insertion  in  the  5'  flanking 
region  of  the  Ly-6C  gene.  This  disruption  of  the 
normal  Ly-6C  gene  allows  only  a  minute  level  of 
Ly-6C  to  be  expressed,  raising  the  possibility  that 
the  antigen  is  directly  involved  in  regulatory  T 
cell  responses.  Ly-6C  might  be  necessary  for 
some  aspects  of  the  normal  development  of  the 
TCR  repertoire.  We  have  identified  defects  in  he- 
matopoietic stem  cell  populations  and,  using  a 
panel  of  antibodies,  are  attempting  to  identify 
and  purify  the  cells  that  can  transfer  disease. 

The  Ly-6C  and  Ly-6A  genes  are  very  inducible 
with  interferons.  Regions  in  the  promoters  of 
both  that  confer  responsiveness  to  interferon 
have  been  identified  but  lack  known  consensus 
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interferon-responsive  elements.  We  are  examin- 
ing deletion  and  point  mutation  analysis  of  these 
promoters,  using  transient  and  stable  transfec- 
tants.  The  segments  of  DNA  that  control  differ- 
ences in  tissue-specific  expression  are  being 
defined  in  transfected  cell  lines,  and  the  DNA- 
binding  proteins  that  control  the  expression  of 
these  genes  are  also  under  study. 

One  of  the  most  fundamental  interactions  in 
the  immune  system  is  that  of  the  TCR  with  MHC 
plus  ligand.  We  have  produced  large  amounts  of 
soluble  MHC  class  I-  and  class  Il-restricted  TCRs 
to  study  the  molecular  interactions.  We  have 
been  able  to  inhibit  the  functional  activity  of  T 
cell  clones  by  the  addition  of  TCR  molecules  to 
block  the  interaction  of  T  cells  and  antigen-pre- 
senting cells. 

Regulation  of  tissue-specific  expression  of 
exons  is  a  major  form  of  gene  regulation.  We  have 
molecularly  cloned  the  protein  that  binds  the 
polypyrimidine  regions  at  3'  splice  junctions. 
This  protein  may  be  one  of  the  more  significant 


factors  that  determine  the  splicing  pattern  of  a 
given  gene.  The  region  in  the  protein  that  con- 
tacts RNA  contains  a  novel  RNA  recognition  motif 
(RRM) .  We  are  characterizing  this  RRM,  as  well  as 
modifying  the  expression  of  the  gene  product  in 
T  cell  clones,  to  determine  whether  this  protein 
in  a  cell  can  alter  the  splicing  pattern  of  genes. 

A  primary  response  generates  antibodies  and 
immunologic  memory.  We  are  studying  the  mo- 
lecular mechanisms  that  generate  B  cell  memory 
and  the  relationship  of  somatic  mutation  to  that 
process.  Studies  of  a  primary  immune  response 
have  indicated  several  new  aspects  of  early 
events.  The  early  response  has  an  oligoclonal 
structure,  and  hypermutation  begins  earlier  than 
previously  recognized.  The  properties  of  newly 
acquired  somatic  mutations  indicate  the  exis- 
tence of  a  difi'erentiation  pathway  in  which  muta- 
tion is  inactive.  Once  the  memory  lineages  are 
generated,  they  continue  to  mutate.  The  second- 
ary immunization  triggers  a  new  round  of  prolifer- 
ation and  mutation. 
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LYMPHOCYTE  interactions  during  an  immune 
response  are  necessary  for  the  induction  of 
antigen-specific  lymphocytes.  The  resulting  ef- 
fector phase  of  an  immune  response  is  described 
as  either  humoral  or  cell  mediated:  each  of  these 
phases  combats  different  types  of  microorgan- 
isms. Although  these  effector  mechanisms  are 
well  characterized,  the  precise  mechanism  by 
which  the  response  to  a  given  antigen  or  in- 
fectious agent  is  directed  into  the  humoral  or 
cell-mediated  mode  is  not  known.  What  is  clear, 
however,  is  that  both  types  of  immunity  depend 
on  the  activation  of  CD4-bearing  T  lymphocytes, 
which  in  turn  induce  other  cell  types  to  respond 
to  the  foreign  antigen.  These  responses  include 
activation  of  B  cells  to  proliferate  and  to  secrete 
antibody,  induction  of  delayed-type  hypersensi- 
tivity reactions,  activation  of  CDS  cytolytic  T 
cells,  and  activation  of  macrophages. 

Two  questions  were  then  asked.  1)  Could  the 
same  CD4  T  cell  activate  all  these  target  cells, 
therefore  mediating  both  humoral  and  cell- 
mediated  responses?  2)  Is  the  CD4  T  cell  popula- 
tion functionally  heterogeneous;  i.e.,  do  some 
CD4-bearing  T  cells  activate  B  cells  and  play  a 
primary  role  in  the  induction  of  humoral  immu- 
nity, and  do  some  activate  macrophages  and  CD8- 
bearing  T  cells  and  play  a  primary  role  in  cell-me- 
diated immunity? 

For  several  years  we  have  focused  on  the  hetero- 
geneity in  CD4  T  cell  function  and  the  activation 
conditions  that  lead  to  it.  These  studies  have  suc- 
cessfully shown  that  monoclonal  CD4  T  lympho- 
cytes, obtained  by  T  cell  cloning  and  expansion 
in  tissue  culture,  belong  to  two  distinct  subsets  of 
T  cells.  One  subset  can  help  B  cell  antibody  se- 
cretion but  cannot  activate  macrophages;  the 
other  set  activates  macrophages  but  is  a  poor  B 
cell  activator.  To  reflect  their  main  functions, 
these  two  subsets  are  called  helper  T  cells  (Th2) 
and  inflammatory  T  cells  (Thl).  The  distinct 
functional  abilities  of  Th2  and  Thl  subsets  is  re- 
flected in  their  release,  upon  activation,  of  dis- 
tinct panels  of  cytokines.  Several  cytokines  are 
produced  selectively  by  one  or  the  other  subset. 
In  particular,  interleukin-2,  interferon-7,  and 


lymphotoxin  are  produced  by  the  Thl  but  not 
Th2  cell  subset.  Interleukin-4  is  produced  by  the 
Th2  but  not  Thl  cell  subset. 

Thus  there  is  a  correlation  between  cytokine 
production  and  function.  Interleukin-4  is  a  po- 
tent B  cell  activator  involved  in  B  cell  prolifera- 
tion and  secretion  of  immunoglobulins  IgGl  and 
IgE.  By  contrast,  interferon-7,  lymphotoxin,  and 
interleukin-2  are  associated  with  responses  in- 
volving the  activation  of  macrophages,  lysis  of 
target  cells,  and  induction  of  cytolytic  T  cells, 
which  is  consistent  with  the  known  function  of 
these  cytokines.  One  can  conclude  from  these 
studies  that  T  cells  are  committed  to  the  release 
of  a  distinct  panel  of  lymphokines  when  acti- 
vated, with  the  released  effector  molecules  de- 
termining their  effector  function. 

Recent  studies  in  this  laboratory  have  focused 
on  determining  whether  functionally  distinct 
subsets  of  CD4  T  cells  exist  in  vivo  and  whether 
the  selective  activation  of  one  subset  or  the  other 
has  the  expected  functional  consequences.  Nu- 
merous clinical  studies  have  suggested  that 
various  immunization  schemes  induce  primarily 
a  humoral  or  cell-mediated  immune  response.  It 
is  critically  important  to  determine  if  such  major 
differences  in  protective  immunity  reflect  differ- 
ences in  the  proportion  of  CD4  T  cell  subsets 
activated.  If  this  is  true,  one  might  propose  that 
the  form  of  the  antigen  or  the  antigen  presenta- 
tion must  somehow  direct  which  CD4  T  cell  will 
be  preferentially  activated.  To  test  this  possibil- 
ity, several  questions  about  normal  CD4  T  cells 
have  been  asked. 

First,  is  there  a  separation  of  resting  CD4  T  cells 
into  subsets?  To  answer  this,  an  analysis  of  the 
responses  to  antigens  that  give  rise  to  primarily 
humoral  or  cell-mediated  immunity  has  been 
performed.  When  humoral  immunity  dominates, 
Th2-like  CD4  T  cells  are  activated.  By  contrast, 
when  cell-mediated  immunity  dominates,  Thl- 
like  CD4  T  cells  are  activated.  Thus,  during  the 
course  of  an  immune  response,  CD4  T  cells  may 
become  specialized  in  their  functional  capabili- 
ties, and  these  T  cells  in  vivo  resemble,  in  their 
activities,  Thl  and  Th2  cloned  lines. 

Second,  when  during  development  or  activa- 
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tion  do  subsets  of  CD4-bearing  T  cells  arise?  A 
particularly  interesting  question  was  whether  the 
commitment  of  CD4-bearing  T  cells  to  the  Thl 
and  Th2  subsets  was  a  consequence  of  antigen 
priming.  Analysis  of  memory  and  virgin  CD4- 
bearing  T  cells  indicates  that  the  functional  spe- 
cialization characteristic  of  Thl  and  Th2  subsets 
resides  mainly  in  the  memory  population.  This 
suggests  that  contact  with  foreign  antigen  initi- 
ates a  commitment  of  CD4-bearing  T  cells  to  a 
particular  effector  response. 

The  form  an  immune  response  takes  must  be 
appropriate  to  the  microorganism  that  is  causing 


the  disease.  This  is  seen  clearly  in  leprosy,  where 
one  response  produces  abundant  antibody  and,  at 
the  same  time,  overwhelming  growth  of  the  lep- 
rosy bacterium  in  macrophages.  This  is  probably 
an  example  of  selective  activation  of  the  helper 
(Th2),  rather  than  the  inflammatory  (Thl),  sub- 
set of  CD4  T  cells.  What  controls  the  form  such  a 
response  will  take?  Model  systems  to  explore 
these  questions  are  needed,  so  that  an  inappropri- 
ate immune  response  can  be  redirected  to  be- 
come curative.  Future  research  in  this  laboratory 
will  focus  on  this  question  of  the  control  of  CD4 
T  cell  subset  activation  and  regulation. 
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Tracking  the  intracellular  location  of  retroviral  DNA.  To  determine  how  intermediates  in  the 
replication  of  retroviruses  move  to  the  nucleus,  we  can  use  hybridization  probes  that  specifically 
recognize  the  viral  DNA  and  provide  a  fluorescent  signal.  This  photomicrograph  shows  cultured 
rat  fibroblasts  that  have  each  been  infected  with  about  10  copies  of  the  Moloney  murine  leukemia 
virus.  The  green  spots  indicate  the  locations  of  viral  DNA  molecules.  The  nucleus  is  visualized  by 
staining  with  a  blue  fluorescent  dye  that  binds  nonspeciflcally  to  DNA.  An  antibody  to  microtu- 
bules, labeled  with  a  red  fluorescent  dye,  indicates  the  contours  of  the  cytoplasm.  By  carrying  out 
this  kind  of  analysis  at  intervals  after  viral  infection,  one  can  track  the  movement  of  viral  DNA  to 
the  cell's  nucleus. 
Research  of  Patrick  O.  Brown. 
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Medicine.  He  received  his  B.A.  degree  in  chemistry  from  the  University  of  Chicago.  His  graduate  work  with 
Nicholas  Cozzarelli  at  the  University  of  Chicago  was  focused  on  the  mechanisms  ofDNA  topoisomerases. 
He  received  his  Ph.D.  and  M.D.  degrees  from  the  University  of  Chicago,  completed  a  pediatrics  residency 
at  Children's  Memorial  Hospital  in  Chicago,  and  then  joined  J.  Michael  Bishop's  laboratory  at  the  Univer- 
sity of  California,  San  Francisco.  There  he  began  to  investigate  the  mechanism  of  retroviral  integration, 
which  has  continued  to  be  the  major  focus  of  his  research. 


RETROVIRUSES  are  an  important  cause  of  dis- 
ease in  most  vertebrate  species.  In  humans, 
retroviral  infection  can  lead  to  AIDS  (acquired 
immune  deficiency  syndrome),  leukemia,  lym- 
phoma, or  degenerative  diseases  of  the  central 
nervous  system.  Millions  of  people  throughout 
the  world  are  infected  with  the  human  immuno- 
deficiency virus,  HIV,  and  will  likely  succumb  to 
AIDS  unless  an  effective  treatment  is  developed. 

Retroviral  Replication 

The  retroviral  genes  are  carried  in  the  virus 
particle  as  RNA  molecules.  When  the  virus  infects 
a  cell,  it  transcribes  these  molecules,  its  RNA  ge- 
nome, into  a  double-stranded  DNA  molecule  and 
inserts  this  into  a  chromosome  of  the  host  cell. 
Thus  the  viral  genome,  then  called  a  provirus, 
becomes  an  integral  part  of  the  cell's  DNA.  Inte- 
gration of  a  provirus  into  its  host  cell's  genome  is 
essential  for  retroviral  reproduction.  This  dis- 
tinctive feature  of  the  life  cycle  of  the  retrovi- 
ruses accounts  for  many  of  the  characteristics  as- 
sociated with  retroviral  infection,  including 
insertional  mutagenesis,  induction  of  tumors, 
and  viral  latency  and  persistence.  Moreover,  the 
fact  that  retroviruses  are  designed  to  intro- 
duce foreign  genes  into  cellular  DNA  makes 
them  exceptionally  useful  as  tools  for  genetic 
engineering. 

The  question  that  propels  much  of  the  work  in 
our  laboratory  is.  How  does  a  retrovirus  get  its 
DNA  into  a  cell's  nucleus  and  integrate  it  into  the 
cell's  DNA?  To  address  this  question,  we  are  us- 
ing genetic,  biochemical,  and  cytological  ap- 
proaches. For  example,  fluorescent  probes  that 
bind  to  viral  DNA  enable  us  to  track  individual 
viral  particles  as  they  infect  a  cell.  This  allows  us 
to  investigate  the  influence  of  such  factors  as  the 
cell  division  cycle  and  the  role  of  specific  cyto- 
skeletal  elements  on  the  entry  of  viral  particles 
into  the  nucleus  of  cells.  The  methods  we  have 
developed  to  study  such  nuclear  entry  will  also 
be  applied  in  exploring  the  mechanisms  of  sub- 


cellular localization  and  the  intracellular  traf- 
ficking of  intermediates  in  viral  replication. 

Disassembly  of  the  nuclear  envelope  at  mitosis 
provides  one  possible  route  for  nuclear  entry  of 
viral  intermediates.  Hence  we  are  studying  the 
role  of  mitosis  in  the  viral  life  cycle.  It  has  been 
recognized  for  many  years  that  establishment  of  a 
retroviral  provirus  proceeds  much  more  readily 
in  actively  dividing  cells  than  in  their  resting 
counterparts,  but  the  basis  for  this  phenomenon 
remains  unknown.  To  bring  this  phenomenon 
into  clearer  focus,  we  are  investigating  the  de- 
pendence of  specific  steps  in  the  life  cycle  of  the 
murine  leukemia  virus  (MLV)  on  the  host  cell's 
stage  in  its  own  division  cycle.  Understanding 
how  cellular  functions  can  determine  the  fate  of 
an  infecting  retrovirus  may  lead  us  to  new  ap- 
proaches to  antiviral  therapy  and  to  improve- 
ments in  the  use  of  retroviruses  as  vectors  for 
gene  therapy. 

To  investigate  the  molecular  mechanism  by 
which  a  retrovirus  inserts  its  DNA  into  that  of  the 
infected  cell,  we  have  developed  a  variety  of 
methods  for  studying  the  retroviral  integration  in 
a  test  tube.  We  have  used  this  approach  to  define 
several  discrete  steps  in  the  joining  of  viral  to 
cellular  DNA  and  to  determine  the  requirements 
for  the  reaction.  The  enzymatic  machinery  that 
carries  out  integration  can  be  isolated  from  in- 
fected cells  in  a  stable  complex  with  the  uninte- 
grated  viral  DNA  molecule.  This  complex  was  the 
focus  of  our  initial  studies  of  integration,  and  we 
are  continuing  to  investigate  its  structure  and 
composition. 

Our  ability,  however,  to  study  the  enzymology 
of  integrase,  the  viral  protein  that  actually  cata- 
lyzes integration,  was  limited  by  the  paucity  of 
integrative  complexes  in  infected  cells.  In  the 
past  year,  we  have  constructed  bacterial  strains 
genetically  engineered  to  produce  abundant 
quantities  of  the  integrase  proteins  from  HIV  and 
MLV  and  have  developed  simple  purifications  of 
these  proteins.  Using  model  substrates,  we  can 
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now  readily  study  the  catalytic  activities,  which 
include  the  sequence-specific  processing  of  the 
viral  DNA  ends  and  the  joining  of  these  ends  to  a 
target  DNA  molecule. 

Studies  are  under  way  to  define  the  structure  of 
the  integrase  proteins,  the  molecular  mechanism 
by  which  they  carry  out  their  catalytic  functions, 
and  the  means  by  which  they  recognize  and  bind 
their  two  distinct  classes  of  DNA  substrates.  The 
ease  with  which  genetic  manipulations  of  inte- 
grase can  be  accomplished  in  the  bacterial  ex- 
pression system  will  enable  us  to  carry  out  a  ge- 
netic dissection  of  the  many  activities  of  the  HIV 
and  MLV  integrases.  Moreover,  this  system  should 
facilitate  our  efforts  to  develop  genetically  al- 
tered integrases  with  properties  more  favorable 
for  therapeutic  applications.  For  example,  we 
would  like  to  develop  an  integrase  that  can  selec- 
tively integrate  the  viral  DNA  into  predetermined 
sites  in  the  target  DNA. 

The  ultimate  goal  of  our  work  on  integration  is 
to  understand  in  molecular  detail  how  the  pro- 
teins of  the  integration  machinery  recognize  the 
viral  DNA,  form  an  active  complex,  recognize  the 
target  DNA,  and  finally  catalyze  the  DNA  breakage 
and  joining  reactions  that  lead  to  integration  of 
the  provirus.  It  is  hoped  that  this  understanding 
will  lead  to  the  development  of  new  agents  for 
inhibiting  the  replication  of  pathogenic  retrovi- 
ruses, and  to  improved  systems  for  the  therapeu- 
tic introduction  of  genes  into  mammalian  cells. 


New  Methods  for  Linkage  Mapping  in 
Complex  Genomes 

A  major  impediment  to  defining  and  character- 
izing the  genes  that  influence  complex  human 
traits  has  been  the  difficulty  of  collecting  suitable 
large  families  in  which  the  trait  segregates.  Such 
families  are  generally  needed  to  find  the  genes  by 
conventional  linkage  mapping.  The  same  genes 
could  in  principle  be  mapped  by  an  alternative 
strategy  that  involves  collecting  and  analyzing 
pairs  of  relatives  that  share  a  trait  of  interest.  In 
the  case  of  rare  recessive  traits,  just  a  few  affected 
inbred  individuals  could  suffice,  and  such  small 
sets  of  affected  relatives  are  generally  easier  to 
collect  than  large  pedigrees.  However,  linkage 
mapping  with  small  sets  of  relatives  generally  re- 
quires analysis  of  a  large  number  of  closely 
spaced,  highly  polymorphic  genetic  markers, 
which  makes  this  strategy  impractical  with 
current  technology. 

We  are  developing  a  new  set  of  genetic  tools 
that  should  facilitate  widespread  application  of 
these  highly  efficient  linkage  mapping  methods. 
Experiments  are  in  progress  to  test  our  new  map- 
ping procedure,  using  as  test  systems  the  nema- 
tode Caenorhabditis  elegans  and  some  well- 
characterized  human  families.  We  hope  to  be 
able  soon  to  apply  this  technology  to  map  genes 
for  a  number  of  complex  human  traits  that  have 
resisted  conventional  approaches. 


62 


Normal  and  Oncogenic  Forms  of 
thestc  Gene  Product 


Joan  S.  Brugge,  Ph.D. — Investigator 

Dr.  Brugge  is  also  Professor  of  Microbiology  at  the  University  of  Pennsylvania  School  of  Medicine.  She  re- 
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IN  1 9 1 1 ,  Peyton  Rous  at  the  Rockefeller  Univer- 
sity first  demonstrated  that  tumors  could  be  in- 
duced by  a  transmissible  agent  isolated  from  a 
chicken  sarcoma.  This  agent,  later  shown  to  be  a 
retrovirus,  was  designated  Rous  sarcoma  virus 
(RSV).  Since  then,  RSV  has  served  as  a  model  sys- 
tem to  examine  tumorigenesis.  One  of  the  four 
genes  that  RSV  carries  is  both  necessary  and  suffi- 
cient to  cause  tumor  formation  in  chickens.  This 
oncogene,  v-src,  is  derived  from  a  normal  cellu- 
lar gene  (c-src)  that  was  captured  by  RSV  during 
infection  of  chicken  cells.  The  c-src  gene  is  pres- 
ent in  normal  chromosomal  DNA  from  humans 
and  from  lower  eukaryotic  species,  such  as  fruit 
flies  and  sponges.  The  high  degree  of  conserva- 
tion of  this  gene  throughout  evolution  suggests 
that  it  plays  a  crucial  role  in  normal  cell  pro- 
cesses. The  Y-src  gene  from  RSV  contains  many 
mutations  that  distinguish  it  from  c-src.  The  src 
gene  codes  for  a  60-kDa  protein  that  is  located  on 
the  cytoplasmic  side  of  the  plasma  membrane. 
The  src  protein,  an  enzyme  referred  to  as  a  pro- 
tein kinase,  catalyzes  the  transfer  of  a  phosphate 
group  from  ATP  to  tyrosine  residues  on  other  cel- 
lular proteins.  The  addition  of  phosphate  to  pro- 
teins is  commonly  employed  by  cells  to  regulate 
the  activity  of  cellular  proteins.  It  is  believed  that 
the  mutated  RSV  src  protein  induces  tumor  for- 
mation by  the  inappropriate  phosphorylation  of 
cellular  proteins  that  are  involved  in  regulating 
cellular  growth.  Our  major  objectives  are  1)  to 
determine  the  function  of  the  src  protein  in  nor- 
mal cells,  2)  to  understand  how  this  protein  is 
regulated  in  normal  cells,  3)  to  determine  how 
mutations  in  the  structure  of  the  src  protein  acti- 
vate the  ability  of  this  protein  to  convert  a  normal 
cell  to  a  tumor  cell,  and  4)  to  elucidate  the  mech- 
anism whereby  these  mutated  forms  induce  tu- 
mor formation. 

We  have  found  that  two  types  of  normal  cells 
express  high  levels  of  the  src  protein:  1)  neurons 
from  embryonic  and  adult  neural  tissues  and  2) 
peripheral  blood  platelets.  To  investigate  the 
function  of  the  src  protein  in  neurons,  we  have 
microinjected  antibodies  directed  against  the  src 
protein  into  PC  12  cells,  a  neural  tumor  cell  line 


that  differentiates  into  neuron-like  cells  after 
treatment  with  nerve  growth  factor  (NGF) .  These 
studies  were  performed  in  collaboration  with 
Simon  Halegoua  (State  University  of  New  York  at 
Stony  Brook) . 

Introduction  of  purified  monoclonal  antibod- 
ies to  the  src  protein  inhibited  the  production  of 
neurite  extensions,  the  characteristic  morpholog- 
ical marker  of  neuron-like  differentiation.  Using 
this  antibody,  we  have  begun  to  examine  how  src 
communicates  with  other  regulatory  proteins  be- 
lieved to  participate  in  NGF-induced  neurite  out- 
growth. We  have  provided  evidence  that  src  col- 
laborates with  another  proto-oncogene,  ras,  to 
provide  essential  functions  that  act  on  a  single 
pathway  involved  in  NGF-induced  neurite  out- 
growth. We  have  also  found  that  the  oncogenic 
forms  of  several  tyrosine  kinases,  in  addition  to 
v-src  (as  shown  by  Alema  and  co-workers),  are 
able  to  induce  neurite  outgrowth  in  PCI 2  cells. 
These  studies  strongly  support  the  possibility  that 
tyrosine  phosphorylation  is  important  in  the 
regulation  of  critical  events  in  neural  cell 
development. 

We  have  characterized  a  uniquely  modified 
form  of  the  src  protein  that  is  specifically  pro- 
duced in  central  nervous  system  neurons.  This 
variant  form  of  the  src  protein  {src*) ,  which  con- 
tains six  additional  amino  acids  that  are  not  found 
in  the  src  protein  expressed  in  any  other  cell 
type,  is  generated  by  an  alternative  processing  of 
the  src-specific  messenger  RNA  in  neurons.  The 
src*  protein  displays  an  elevated  level  of  kinase 
activity.  This  unique  modification  suggests  that 
src  plays  a  specific  role  in  these  specialized  cells. 
Using  antibodies  that  exclusively  recognize  this 
unique  form  of  the  src  protein,  we  have  localized 
src*  to  specific  subclasses  of  neurons  in  different 
regions  of  the  brain.  Further  studies  are  designed 
to  determine  the  precise  intracellular  localiza- 
tion of  this  form  of  src  in  developing  and  mature 
neurons  to  provide  additional  clues  to  the  func- 
tion of  this  protein. 

Platelets  are  small  anucleate  cells  in  peripheral 
blood  that  contain  many  intracellular  vesicles 
whose  components  are  released  upon  activation 
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by  cellular  hormones.  Platelets  and  the  released 
products  carry  out  specific  functions  in  clot  for- 
mation and  wound  healing.  Platelets  are  an  ideal 
system  to  study  cellular  events  involved  in  the 
transduction  of  extracellular  signals.  The  src  pro- 
tein represents  a  large  percentage  of  total  cell 
protein  (0.2-0.4  percent);  thus  this  enzyme  may 
regulate  important  events  at  the  plasma  mem- 
brane. We  have  recently  found  that  four  protein 
tyrosine  kinases  closely  related  to  c-src  are  also 
expressed  in  platelets  and  that  thrombin  and 
other  activators  of  platelet  functions  cause  a 
rapid  increase  in  the  phosphorylation  of  multiple 
proteins  on  tyrosine. 

We  have  found  that  the  phosphorylation  of  sev- 
eral of  these  proteins  requires  platelet  aggrega- 
tion and  the  interaction  between  the  blood  cell 
adhesion  protein  fibrinogen  and  its  receptor  on 
the  platelet  surface.  Tyrosine  phosphorylation  of 
these  proteins  is  not  necessary  for  secretion  of  the 
intracellular  platelet  granules  (the  other  major 
function  of  peripheral  blood  platelets).  These 
studies  suggest  that  tyrosine  phosphorylation 
may  be  involved  in  triggering  intracellular  events 
that  are  induced  by  fibrinogen-mediated  platelet 
aggregation.  We  are  attempting  to  purify  these 
proteins,  to  identify  their  function  in  platelets, 
and  to  determine  how  the  phosphorylation  of 
these  proteins  on  tyrosine  residues  affects  their 
functional  activity. 

The  expression  of  high  levels  of  the  src  protein 
in  terminally  differentiated  cells,  such  as  neurons 
and  platelets,  indicates  that  c-src  is  not  exclu- 
sively involved  in  events  that  directly  regulate 
cellular  proliferation.  It  is  difficult  to  reconcile 
the  tumorigenic  activity  of  the  RSV  src  gene  prod- 
uct with  such  a  role  for  the  c-src  protein  in  termi- 
nally differentiated  cells.  Mutational  changes  in 
the  v-src  gene  probably  deregulate  and  activate 
the  tyrosine  kinase  activity  of  the  v-src  gene  prod- 
uct, thus  allowing  inappropriate  phosphoryla- 


tion of  cellular  substrates  that  are  involved  in 
growth  regulation.  The  identification  of  these 
substrates  is  one  of  the  most  difficult  and  impor- 
tant problems  in  this  field. 

Regulation  of  the  src  Protein  Activation  of 
Tumorigenic  Potential 

Evidence  suggests  that  the  c-src  protein  is 
tightly  regulated  in  normal  cells  and  that  muta- 
tional alterations  in  this  protein  cause  a  deregula- 
tion and  constitutive  activation  of  tyrosine  pro- 
tein kinase  activity.  We  are  using  site-directed 
mutagenesis  to  define  regions  of  the  src  molecule 
that  affect  the  kinase  activity  of  this  enzyme,  to 
identify  regions  that  are  important  for  interaction 
with  other  cellular  proteins  that  mediate  src- 
induced  transformation. 

We  have  focused  on  the  amino-terminal  half  of 
the  src  protein  that  lies  outside  of  the  catalytic 
domain  of  the  molecule,  in  two  highly  conserved 
regions  (SH2  and  SH3)  that  are  shared  with  several 
other  cellular  proteins.  Deletion  of  either  of 
these  regions  does  not  prevent  transformation  by 
oncogenic  forms  of  src  but  does  interfere  with 
the  interaction  of  src  with  several  protein  sub- 
strates. In  addition,  deletion  of  the  SH2  or  SH3 
domain  in  wild-type  c-src  activates  the  kinase  ac- 
tivity and  transformation  potential  of  this  proto- 
oncogene.  We  are  currently  examining  the  im- 
portance of  these  regions  in  events  other  than 
oncogenic  transformation. 

We  have  also  identified  a  novel  protein  sub- 
strate that  is  phosphorylated  in  v-src-transformed 
cells.  This  protein,  connexin43,  is  the  major  com- 
ponent of  gap  junctions,  which  mediate  cell-to- 
cell  communication.  The  phosphorylation  of 
connexin43  correlated  with  the  ability  of  v-src  to 
cause  a  reduction  in  junctional  communication 
between  cells,  suggesting  that  phosphorylation 
of  connexin43  by  the  v-src  protein  may  be  re- 
sponsible for  the  inhibition  of  junctional  commu- 
nication in  v-src-transformed  cells. 
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OUR  research  lies  at  the  interface  between 
theory  and  experiment  in  the  area  of  struc- 
tural biophysics.  The  research  tools  are  sim- 
ulation methods  of  computational  chemistry 
adapted  to  the  requirements  of  macromolecular 
systems.  Macromolecular  simulations  are  an  im- 
portant addition  to  the  arsenal  of  methods  avail- 
able to  structural  biologists  working  with  x-ray 
crystallographic  or  nuclear  magnetic  resonance 
(NMR)  spectroscopic  data.  In  one  set  of  projects 
we  are  trying  to  understand  the  detailed  micro- 
scopic interactions  that  govern  stability  and  rec- 
ognition in  biological  systems  and  to  test  the  reli- 
ability of  the  theoretical  methods  as  tools  for  this 
purpose.  We  are  also  directly  combining  macro- 
molecular simulation  with  experimental  data  to 
make  data  analysis  possible  or  more  efficient. 

Generalizing  Molecular  Replacement 
in  X-ray  Crystallography 

In  macromolecular  crystallography,  the  deter- 
mination of  initial  phases  is  the  major  obstacle  to 
determining  the  structure  of  the  crystallized  mol- 
ecule. This  is  because  the  observable  x-ray  dif- 
fraction information  from  a  single  crystal  com- 
prises only  the  amplitudes  but  not  the  phases  of 
the  reflections.  Although  this  "phase  problem" 
has  been  solved  in  the  case  of  small  molecules 
(up  to  a  few  hundred  atoms  in  the  unit  cell) 
through  the  development  of  direct  methods  by 
Hauptman  and  Karle,  the  application  of  these  meth- 
ods to  macromolecules  has  so  far  been  unsuccessful, 
and  one  has  to  resort  to  time-consuming  and  some- 
times difficult  experimental  methods. 

The  initial  determination  of  phases  by  molecu- 
lar replacement  is  often  attempted  if  the  struc- 
ture of  a  similar  or  homologous  protein  is  known. 
Molecular  replacement  involves  the  placement 
(i.e.,  rotation  and  translation)  of  the  known 
structure  in  the  unit  cell  of  the  target  crystal  in 
order  to  obtain  the  best  agreement  between  cal- 
culated model  diffraction  data  and  the  observed 
diffraction  data.  Recent  progress  in  obtaining  ap- 
proximate three-dimensional  models  of  macro- 


molecules  from  information  other  than  x-ray 
crystallography  suggests  an  increased  use  of  mo- 
lecular replacement.  For  instance,  the  database  of 
known  protein  sequences  is  growing  rapidly. 
Techniques  for  aligning  sequences,  such  as  con- 
sensus templates,  have  been  developed  to  recog- 
nize distantly  related  proteins  or  protein  domains 
and  to  carry  out  model  building  on  the  basis  of 
the  several  hundred  known  protein  structures. 

Molecular  replacement  often  fails  if  the  search 
model  is  too  inaccurate — that  is,  if  the  differ- 
ences in  atomic  positions  between  the  search 
model  and  the  crystal  structure  are  more  than  1  A. 
In  this  case  we  proposed  to  vary  the  orientations 
and  positions  of  domains,  or  structural  subunits, 
of  the  search  model  in  a  neighborhood  around 
the  initial  positions .  We  developed  a  new  molecu- 
lar replacement  strategy.  First  a  conventional  ro- 
tation search  is  carried  out.  Then  the  rotation 
search  is  "filtered"  by  employing  refinement  of 
the  domain  orientations  and  positions  against  the 
Patterson  correlation  coefficient.  Finally,  the  re- 
fined search  models  with  the  highest  correlation 
coefficients  are  used  for  conventional  translation 
searches.  Computer  model  studies  have  already 
suggested  the  usefulness  of  the  new  method.  Re- 
cently we  obtained  phases  for  six  crystal  struc- 
tures that  previously  could  not  be  solved,  includ- 
ing several  monoclonal  antibody  Fab  fragments. 

Presently,  the  molecular  replacement  strategy 
is  extended  by  applying  modern  techniques 
of  nonlinear  optimization,  such  as  simulated 
annealing. 

Predictions  of  Helix-Helix  Association  and 
Stability  in  Proteins:  Leucine  Zipper  and 
Membrane  Proteins 

Prediction  of  the  three-dimensional  structure 
of  proteins  based  on  their  sequence  has  up  to 
now  been  impossible.  This  fundamental  problem 
of  structural  biology  (the  "folding  problem")  re- 
mains unsolved,  despite  improvements  in  com- 
putational techniques  for  macromolecular  simu- 
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lation  and  computer  hardware.  Nevertheless, 
macromolecular  simulation  has  been  successful 
in  predicting  localized  conformations  if  suffi- 
cient experimental  constraints  or  restraints  (e.g., 
in  the  form  of  an  x-ray  structure)  are  available.  It 
is  therefore  conceivable  that  other  more  global 
predictions  are  possible  if  appropriate  experi- 
mental information  is  available.  We  have  em- 
barked on  trying  to  predict  the  association  and  stabil- 
ity of  helices  that  are  believed  to  form  coiled  coil 
conformations.  Conformational  search  strategies  are 
being  employed  with  empirical  energy  functions, 
using  molecular  dynamics  and  energy  minimization 
starting  with  generic  coiled  coil-forming  a-helices. 

Presently  we  are  applying  this  approach  to  the 
family  of  leucine  zipper  proteins,  which  are  se- 
quence-specific DNA-binding  proteins  that  regu- 
late gene  expression  in  certain  mammalian  cells. 
We  have  predicted  the  structure  of  the  dimeriza- 
tion  domain  of  GCN4,  for  which  a  high-resolu- 
tion x-ray  structure  will  be  available  shortly  (per- 
sonal communication,  Thomas  Alber,  University 
of  Utah).  Furthermore,  we  are  trying  to  explain 
why  Jun  can  form  a  homodimeric  protein  that 
binds  to  DNA,  whereas  Fos  is  unable  to  do  so  and 
only  occurs  in  a  Jun-Fos  heterodimer.  We  are  also 
applying  this  approach  to  glycophorin  A  and 
other  membrane  proteins  that  are  believed  to 
form  predominantly  helical  structural  elements 
crossing  the  membrane.  Interesting  results  were 
obtained  during  the  past  year  for  vacuum  simula- 
tions. Currently  we  are  attempting  to  incorporate 
the  environment  (water  and  lipid  molecules) 
into  the  simulation.  A  rapidly  increasing  amount 
of  experimental  data  on  mutants  of  the  systems 
studied  will  allow  a  thorough  comparison  of  the 


theoretical  predictions  and  the  experimental 
results. 

Macromolecular  Simulation  of 
Free-Energy  Differences 

We  are  involved  in  a  number  of  projects  that 
are  aimed  at  simulating  free-energy  differences 
between  two  states  of  a  biological  system,  using 
the  free-energy  perturbation  technique.  Our 
goals  are  to  investigate  microscopically  the  struc- 
ture and  stability  of  protein  secondary  structural 
elements  and  protein-peptide  complexes  and  to 
evaluate  the  reliability  of  free-energy  calcula- 
tions and  molecular  dynamics  simulations  as 
tools  for  this  purpose.  In  particular,  we  are  study- 
ing 1)  the  complexes  of  bovine  pancreatic  ribo- 
nuclease  S  and  a  number  of  mutants  of  the  S  pep- 
tide for  which  x-ray  crystal  structures,  binding 
free  energies,  and  enthalpies  have  been  obtained 
by  Frederic  Richards,  Julian  Sturtevant,  and  their 
colleagues  (Yale  University)  and  2)  the  com- 
plexes of  L-tryptophan  and  several  of  its  chemical 
analogues  with  Escherichia  coli  trp  aporepres- 
sor,  for  which  x-ray  crystal  structures  and  binding 
free  energies  have  been  measured  by  Paul  Sigler 
(HHMI,  Yale  University)  and  his  colleagues. 

We  also  plan  to  carry  out  free-energy  calcula- 
tions on  mutants  of  /3-turns  in  staphylococcal  nu- 
clease, in  a  joint  project  with  Robert  Fox  (HHMI, 
Yale  University) .  High-resolution  x-ray  structures 
of  various  mutants  and  thermodynamic  data 
through  temperature-dependent  NMR  studies 
have  been  obtained.  Specifically,  we  would  like 
to  simulate  the  free-energy  differences  between 
the  cis  and  trans  conformations  of  Pro- 1 1 7  for  a 
number  of  point  mutations  in  the  vicinity  of  the 
proline  residue. 
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i  versity,  where  he  solved  the  three-dimensional  structure  of  leucine  aminopeptidase. 


WE  are  interested  in  developing  a  detailed 
understanding  of  the  physical  principles 
that  govern  the  general  problem  of  molecular  rec- 
ognition in  biological  systems.  The  systems  we 
have  chosen  to  study  are  models  for  gene  regula- 
tion in  eukaryotes  and  slow-binding  inhibition  of 
enzymes. 

Our  approach  is  to  use  the  methods  of  x-ray 
crystallography  to  determine  the  three-dimen- 
sional structures  of  biological  macromolecules 
and  their  complexes  with  DNA  or  other  ligands. 
These  structures  contain  a  wealth  of  atomic  detail 
that  can  be  analyzed  with  biochemical,  molecu- 
lar genetic,  and  theoretical  methods  to  provide  a 
functional  description  of  the  intra-  and  intermo- 
lecular  interactions  responsible  for  stabilizing 
macromolecular  complexes. 

In  the  long  term,  we  hope  that  our  structural 
studies  and  analyses  will  allow  us  to  exploit  the 
powerful  formalism  of  physics  to  classify  system- 
atically the  interactions  between  individual 
atoms  that  effect  molecular  recognition  in  biolog- 
ical systems.  We  believe  that  such  a  quantitative 
understanding  will  ultimately  permit  us  to  har- 
ness the  machinery  of  molecular  recognition  and, 
thereby,  make  defined  interventions  into  impor- 
tant biochemical  processes  such  as  disease  states. 

Eukaryotic  Gene  Regulation 

We  have  begun  to  examine  two  model  systems  of 
eukaryotic  gene  regulation,  with  the  goal  of  im- 
proving our  understanding  of  the  structural  and 
physical  bases  of  transcriptional  control  of  genes. 
First,  we  are  collaborating  with  Robert  Roeder  (the 
Rockefeller  University)  on  x-ray  crystallographic 
studies  of  transcription  factor  IID  (TFIID)  and  up- 
stream stimulatory  factor  (USF).  These  two  pro- 
teins are  involved  in  transcription  of  class  II  genes 
in  eukaryotes.  TFIID  binds  to  the  TATA  consensus 
sequence  and  functions  as  a  general  transcription 
initiation  factor.  USF  is  a  member  of  the  c-myc-rc- 
lated  family  of  DNA-binding  proteins  and  contains 
both  a  helix-loop-helix  motif  and  a  leucine  repeat. 
It  binds  as  a  dimer  to  an  upstream  activating  se- 
quence GGCCACGTGACC.  During  transcription. 


TFIID  and  USF  bind  to  DNA  in  close  proximity  and 
interact  with  each  other  to  enhance  both  DNA  bind- 
ing and  transcription.  In  addition  to  determining 
the  three-dimensional  structures  of  these  proteins 
complexed  to  their  respective  promoter  elements, 
we  hope  to  determine  the  structure  of  a  complex 
consisting  of  TFIID,  USF,  and  DNA. 

Second,  we  are  collaborating  with  Eseng  Lai 
(Memorial  Sloan-Kettering  Cancer  Center)  on 
structural  studies  of  human  hepatocyte  nuclear 
factor  3-  This  transcriptionally  active  protein  be- 
longs to  a  gene  family  in  mammals  that  is  homolo- 
gous to  the  Drosophila  homeotic  gene  fork 
head.  These  diverse  proteins  share  a  highly  con- 
served DNA-binding  region,  which  bears  no  simi- 
larity to  previously  defined  DNA-binding  motifs 
and  is  thought  to  represent  an  entirely  new  type 
of  DNA-binding  protein. 

Successful  three-dimensional  structure  deter- 
minations of  these  model  systems  will  provide 
insights  into  three  different  modes  of  DNA-pro- 
tein  interaction  and  may  also  give  some  informa- 
tion about  intermolecular  interactions  between 
proteins  bound  to  the  same  piece  of  DNA. 

Slow-binding  Enzyme  Inhibition 

My  previous  work  in  William  Lipscomb's 
laboratory  (Harvard  University)  employed  x-ray 
crystallography  to  determine  the  first  three-di- 
mensional structure  of  an  amino-terminal  exo- 
protease,  a  two-zinc  metalloenzyme  known  as 
leucine  aminopeptidase  (LAP)  isolated  from  bo- 
vine lens.  We  discovered  that  the  enzyme  consists 
of  two  unequal  a/fi  domains.  In  its  active  form  it 
exists  as  a  hexamer,  which  resembles  the  alloste- 
ric  enzyme  aspartate  transcarbamoylase  in  appear- 
ance. The  enzyme's  active  site  consists  of  a  novel 
bimetallic  cluster,  with  the  two  zinc  ions  sepa- 
rated by  2.9  A  and  coordinated  only  by  carboxyl- 
ate  and  carbonyl  oxygen  atoms.  The  precise 
structure  of  the  active  site  allowed  us  to  suggest  a 
mechanism  by  which  the  enzyme  may  effect  pep- 
tide bond  hydrolysis. 

In  addition,  we  determined  the  isomorphous 
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structure  of  the  enzyme  complexed  with  its  slow- 
binding  inhibitor  bestatin,  which  is  a  dipeptide 
analogue  that  lacks  a  scissile  peptide  bond.  A  de- 
tailed analysis  of  the  interactions  between  besta- 
tin and  the  active-site  residues  allowed  us  to  sug- 
gest a  mechanism  of  slow-binding  inhibition 
of  LAP  and  other  bimetallic,  amino-terminal 
exoproteases. 

Finally,  the  three-dimensional  structure  of  LAP 


also  provides  some  insights  into  the  role  that 
post-translational  modification  plays  in  the  problem 
of  recombination.  LAP  is  homologous  to  pep  A,  a 
manganese-dependent,  hexameric  aminopeptidase 
found  in  Escherichia  coli  that  is  required  for  plas- 
mid  ColEl  site-specific  recombination. 

We  will  continue  similar  work  with  other  en- 
zyme systems  of  biological  and  biochemical 
importance. 


A  computer  graphics  representation  of  the  three-dimensional  structure  of  the  hexameric,  zinc- 
dependent  leucine  aminopeptidase  from  bovine  lens  that  has  been  determined  to  2.25  A  resolu- 
tion. This  enzyme  is  responsible  for  post-translational  modification  of  proteins  and  oligopeptides 
and  for  degradation  of  aging  proteins  in  a  diverse  group  of  organisms  ranging  from  Escherichia 
coli  to  humans. 


Research  of  Stephen  K.  Burley. 
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IN  a  wide  variety  of  cellular  responses — in- 
cluding contraction,  secretion,  and  prolifera- 
tion— Ca^^  functions  as  a  ubiquitous  intracellu- 
lar messenger.  Two  common  pathways  by  which 
intracellular  Ca^^  transients  can  be  triggered 
have  been  identified  in  cells.  The  first  involves 
the  influx  of  Ca^"^  into  the  cytoplasm  from  the 
extracellular  medium  through  plasma  membrane 
Ca^^  channels,  and  the  second  involves  the  re- 
lease of  Ca^^  into  the  cytoplasm  from  intracellu- 
lar stores  through  Ca^^  release  channels. 

Over  the  last  nine  years,  we  have  investigated 
the  structure  and  function  of  the  membrane 
components  involved  in  Ca^^  fluxes  across  mem- 
branes. In  particular,  we  have  focused  on  identi- 
fying, purifying,  and  characterizing  the  mem- 
brane proteins  that  function  as  surface  Ca^^ 
channels  and  intracellular  Ca^"*^  release  channels 
in  excitable  cells. 

Excitation-Contraction  Coupling 
in  Skeletal  Muscle 

Muscle  contraction  is  initiated  by  a  depolariza- 
tion of  the  transverse  tubular  membrane,  which 
in  turn  signals  the  release  of  Ca^^  from  the  junc- 
tional sarcoplasmic  reticulum.  One  specific  aim 
of  our  research  is  to  understand  the  structure  and 
function  of  protein  components  of  the  sarcoplas- 
mic reticulum  membrane.  We  have  purified  the 
ryanodine  receptor  of  rabbit  muscle  sarcoplas- 
mic reticulum  and  have  shown  that  it  can  mediate 
single-channel  activity  identical  to  that  of  the  sar- 
coplasmic reticulum's  Ca^^  release  channels.  The 
morphology  of  the  purified  receptor  has  revealed 
that  it  is  identical  to  the  "SR  feet"  and  thus  would 
seem  to  play  a  dual  role  in  excitation-contraction 
coupling  as  the  Ca^^  release  channel  and  the 
bridging  structure  in  the  junctional  gap. 

A  second  specific  aim  of  this  research  concerns 
the  dihydropyridine-sensitive  Ca^^  channel  of 
skeletal  muscle  and  its  dual  role  as  a  voltage  sen- 
sor for  excitation-contraction  coupling  and  a 
Ca^"^  channel.  The  dihydropyridine  receptor  has 
been  purified  from  rabbit  skeletal  muscle  and 
shown  to  consist  of  four  subunits — a,,  012,  and 
7.  In  the  last  year  we  determined  the  structure  of 


the  7-subunit  by  isolating  cDNAs  from  an  expres- 
sion library.  We  deduced  that  the  primary  struc- 
ture is  a  25,058-dalton  protein  containing  four 
transmembrane  domains  and  two  N-linked  glyco- 
sylation  sites.  This  description  is  consistent  with 
biochemical  analyses  showing  that  the  7-subunit 
is  a  glycosylated  hydrophobic  protein. 

We  have  also  analyzed  the  dihydropyridine  re- 
ceptor's a2-subunit  and  associated  b  peptides. 
The  results  have  shown  that  the  b  peptides  are  not 
true  subunits,  but  instead  represent  the  carboxyl- 
terminal  peptide  of  a  proteolytically  processed 
a2-subunit.  The  goal  of  this  project  now  is  to  de- 
termine how  the  receptors  for  dihydropyridine 
and  ryanodine  are  coupled  in  the  excitation- 
contraction  process. 

Neuronal  Ca^"^  Release  Channels 

A  second  area  of  research  in  our  laboratory  con- 
cerns Ca^^  release  channels  in  neuronal  cells.  Ino- 
sitol 1,4,5-trisphosphate  (IP3),  produced  after 
receptor  activation,  is  an  important  second  mes- 
senger in  the  release  of  stored  Ca^^  from  intracel- 
lular compartments.  Although  IP3  appears  to  have 
a  major  role  in  Ca^"*^  regulation  within  neurons, 
physiological  and  pharmacological  evidence  has 
indicated  the  presence  of  non-IPj-gated  Ca^^ 
pools.  We  have  identified  a  high-afifinity  ryano- 
dine receptor  in  rabbit  brain  membranes  as  a 
400-kDa  protein  that  cross-reacts  with  antibodies 
against  the  sarcoplasmic  reticulum  ryanodine  re- 
ceptor and  migrates  as  a  large  oligomeric  com- 
plex of  about  30s  on  sucrose  density  gradients. 
Although  we  have  not  identified  the  exact  physio- 
logical role  of  the  brain  ryanodine  receptor,  we 
are  now  attempting  to  purify  it  so  that  it  may  be 
shown  responsible  for  the  non-IPj-gated  Ca^^  re- 
lease in  neurons. 

Molecular  Pathogenesis 
of  Muscular  Dystrophy 

A  third  area  of  research  concerns  the  structure 
and  function  of  dystrophin,  the  high-molecular- 
weight  protein  product  of  the  human  Duchenne 
muscular  dystrophy  (DMD)  gene.  Dystrophin  oc- 
curs in  the  sarcolemma  (fiber  membrane)  of  nor- 
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mal  skeletal  muscle,  but  is  absent  from  the  mus- 
cle of  DMD  patients  and  mdx  mice.  The  pre- 
dicted amino  acid  sequence  of  dystrophin 
suggests  that  it  is  a  membrane  cytoskeletal  pro- 
tein and,  furthermore,  that  DMD  is  a  disease  of 
the  membrane  cytoskeleton. 

Studies  of  other  genetic  diseases  involving  the 
cytoskeleton  have  shown  that  a  deficiency  in  a 
major  cytoskeletal  component  can  severely  affect 
the  integrity  of  the  cell  membrane.  For  example, 
spectrin  deficiency  results  in  fragile  erythrocyte 
membranes  and  thus  severe  hemolytic  anemia. 

In  comparison,  dystrophin  has  not  been  consid- 
ered to  play  a  major  structural  role  in  the  mem- 
brane cytoskeleton  of  skeletal  muscle  because  of 
its  low  abundance  of  approximately  0.002  per- 
cent of  total  muscle  protein.  In  the  past  year  we 
have  determined  the  cytoskeletal  properties  and 
relative  abundance  of  dystrophin  in  a  highly  puri- 
fied preparation  of  skeletal  muscle  sarcolemma. 
We  have  found  that  it  constituted  fully  2  percent 
of  the  sarcolemma  protein.  This  suggests  that  it 
does  play  a  major  structural  role  and  that  the  in- 
tegrity of  DMD  muscle  sarcolemma,  through  lack 
of  dystrophin,  is  likely  to  be  compromised. 

In  order  to  understand  the  molecular  pathogen- 
esis of  DMD,  it  is  imperative  to  identify  the  pro- 


teins that  are  associated  and/or  linked  to  dystro- 
phin, since  the  absence  of  one  component  of  the 
membrane  cytoskeleton  is  sometimes  accompa- 
nied by  the  loss  of  another.  In  the  past  year  our 
laboratory  has  succeeded  in  purifying  a  dystro- 
phin-glycoprotein complex  from  rabbit  skeletal 
muscle  and  identifying  five  dystrophin-asso- 
ciated  proteins,  including  four  glycoproteins. 

In  addition  to  dystrophin,  the  complex  con- 
tains a  59-kDa  protein  triplet  and  four  glycopro- 
teins of  1 56,  50,  43,  and  35  kDa.  We  have  begun 
to  characterize  the  status  of  these  proteins  in  dys- 
trophin-free  muscle.  One  interesting  finding  was 
a  dramatic  (about  90  percent)  deficiency  of  the 
156-kDa  dystrophin-associated  glycoprotein  in 
muscle  from  mdx  mice  and  DMD  patients.  Thus 
the  marked  reduction  of  the  156-kDa  glycopro- 
tein in  dystrophic  muscle,  and  possibly  of  other 
dystrophin-associated  proteins,  maybe  the  initial 
step(s)  involved  in  the  molecular  pathogenesis 
of  muscular  dystrophy. 

Our  goal  for  the  next  year  is  to  clone  each  of 
the  dystrophin-associated  glycoproteins  and  de- 
termine their  structure  and  possible  function. 
The  results  should  help  to  define  the  function  of 
dystrophin  and  explain  how  its  absence  results 
in  DMD. 
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HOMOLOGOUS  recombination  between  DNA 
sequences  residing  in  the  chromosome  and 
newly  introduced  DNA  sequences,  termed  gene 
targeting,  provides  the  means  for  specifically 
modifying  any  gene  in  any  desired  manner  in  cul- 
tured mammalian  cells.  The  desired  alteration  is 
first  introduced  into  a  chosen  cloned  DNA  se- 
quence, using  standard  recombinant  DNA  tech- 
nology. Gene  targeting  is  then  used  to  transfer  the 
gene  modification  to  the  cognate  gene  in  the  ge- 
nome of  cultured  cells.  If  the  recipient  cell  for 
the  gene  modification  is  a  pluripotent,  mouse 
embryo-derived  stem  (ES)  cell,  then  the  means 
are  available  for  creating  chimeric  mice  from 
these  cells  that  will  transmit  the  altered  gene  to 
their  progeny.  With  this  technology  the  biologi- 
cal function  of  any  cloned  gene  can  be 
determined. 

Our  laboratory  is  using  this  technology  to  de- 
termine the  function  of  two  sets  of  genes  in  the 
early  development  of  the  mouse.  The  first  set  of 
genes  includes  members  of  the  int  proto-onco- 
gene  family  (int-1,  int-2  .  .  .  ),  which  are  be- 
lieved to  be  involved  in  localized  developmental 
decisions  through  cell-cell  signaling.  This  family 
of  genes  contains  at  least  20  members.  Targeted 
disruption  of  the  murine  int-1  proto-oncogene 
resulted  in  mice  with  severe  abnormalities  in 
midbrain  and  cerebellar  development,  indicating 
a  prominent  role  for  the  int- 1  protein  in  the  in- 
duction, from  the  neural  tube,  of  the  mesenceph- 
alon and  metencephalon.  Some  of  these  mice 
survive  birth  and  exhibit  severe  ataxia  (i.e., 
loss  of  balance  and  coordinated  movement) .  Sur- 
prisingly, despite  missing  a  large  portion  of 
their  midbrain,  these  mice  respond  fairly  nor- 
mally to  stimuli  of  light,  sound,  smell,  and  touch. 


We  have  also  created  mice  with  null  mutations  in 
int-2.  Mice  heterozygous  for  this  mutation  ap- 
pear normal  and  are  fertile.  We  are  in  the  process 
of  analyzing  the  phenotype  of  int-2  homo- 
zygotes. 

The  second  set  of  genes  that  we  are  analyzing 
are  transcription  factors  participating  in  a  devel- 
opmental program  for  specifying  positional  in- 
formation along  the  anteroposterior  axis  of  the 
early  mouse  embryo.  These  mammalian  genes  are 
homologues  of  the  Drosophila  Ultrabithorax 
and  Antennapedia  genes.  In  invertebrates  these 
genes  specify  the  identity  of  cells  within  each 
parasegment.  The  function  of  the  corresponding 
genes  in  the  human  and  mouse  is  not  known. 
However,  their  expression  pattern  argues  for  an 
equally  important  role  in  mammalian  develop- 
ment. In  the  human  and  mouse  this  set  of  genes, 
known  collectively  as  the  Hox  genes,  contains  at 
least  30  members.  We  have  used  gene  targeting  to 
disrupt  specifically  a  dozen  of  these  genes  in 
mouse  ES  cells.  These  cells  are  in  turn  being  used 
to  generate  chimeric  mice  that  transmit  the  mu- 
tant gene  to  their  progeny.  Disruption  of  the  box- 
1.5  gene  resulted  in  mice  with  multiple,  but 
regionally  restricted,  developmental  defects: 
box- 1 .5~/hox- 1 .5~  mice  are  athymic,  aparathy- 
roid,  and  have  reduced  thyroid  and  submaxillary 
tissue.  They  also  exhibit  a  wide  spectrum  of 
throat  abnormalities,  including  shortened  necks, 
abnormal  larynx,  truncated  soft  palate,  and  poor 
organization  of  muscle  tissue.  In  addition,  these 
mice  often  feature  defects  of  the  heart  and  arter- 
ies, as  well  as  craniofacial  abnormalities.  This 
collection  of  deficiencies  is  remarkably  similar  to 
the  pathology  afflicting  humans  with  DiGeorge 
syndrome,  a  congenital  disorder. 


71 


Prepattern  of  sensory  organs  in  imaginal  wing  disc  of 
Drosophila.  The  bright  spots  are  cells  expressing  the 
achaete  protein,  which  will  give  rise  to  adult  sensory 
structures. 

From  Skeath,  J.B.,  and  Carroll,  S.B.  1991.  Genes  Dev 
5:984-995. 
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VIRTUALLY  every  animal  species  can  be  distin- 
guished by  its  external  and/or  internal  ap- 
pearance, its  morphology.  Gross  similarities  in 
external  or  internal  organization  usually  reflect 
an  evolutionary  relationship  between  animals. 
Widespread  similarities  in  body  organization  may 
reflect  a  common  evolutionary  origin  of  a  large 
number  of  w^ell-diversified  species.  For  example, 
a  segmentally  organized  body  plan,  present  in  an- 
nelids (e.g.,  earthworms),  arthropods  (lobsters, 
spiders,  insects) ,  and  to  a  lesser  degree  in  verte- 
brates (snakes,  mice,  humans),  is  one  of  the  most 
general  forms  of  organization  in  the  animal  king- 
dom. Evolutionary  biologists  believe  that  modifi- 
cations of  the  basic  repeating  segmental  pattern 
found  in  earthworms  and  millipedes,  for  exam- 
ple, have  led  to  the  diverse  array  (perhaps  more 
than  1  million  species)  of  arthropods  found  to- 
day. For  instance,  it  is  thought  that  a  gradual  re- 
duction in  leg  number  on  the  more  posterior  seg- 
ments of  a  many-legged  animal  has  led  to  the 
present-day  typical  insect  body  plan,  with  only 
three  pairs  of  legs  protruding  from  the  thorax  fol- 
lowed by  a  legless  abdomen. 

To  understand  how  body  patterns  evolve,  we 
must  first  learn  about  how  body  plans  develop. 
To  this  end,  we  and  several  other  HHMI  laborato- 
ries are  engaged  in  a  detailed  study  of  the  genetic 
program  that  guides  the  development  of  the  fruit 
fly  Drosophila  melanogaster.  We  focus  on  those 
genes  that  are  critical  to  the  organization  of  the 
segmental  outline  of  embryo  and  adult,  control- 
ling the  overall  pattern  and  function  of  body  seg- 
ments. The  wealth  of  genetic  information  gath- 
ered over  the  past  70  years  and  the  explosion  of 
Drosophila  embryology  and  molecular  biology 
over  the  past  1 0  years  make  it  the  best  complex 
animal  model  for  understanding  the  genetic  regu- 
lation of  cell  behavior  and  the  three-dimensional 
organization  of  tissues,  organs,  and  entire 
organisms. 

Gene  Activity  During  Drosophila 
Development 

The  genetic  control  of  pattern  formation  can  be 


broken  down  conceptually  into  at  least  three 
phases.  The  first  consists  of  a  molecular  prepat- 
tern,  revealed  as  chemical  changes  taking  place 
in  different  regions  of  the  animal  that  foreshadow 
the  cellular  events  to  follow.  For  example,  in  the 
Drosophila  embryo,  certain  key  proteins  come  to 
be  expressed  in  stripes,  which  represent  the  seg- 
mental divisions  to  form  later.  The  second  phase 
of  segmental  pattern  formation  involves  the  es- 
tablishment of  a  segmentally  repeating  array  of 
stem  cells  within  the  diff'erent  germ  layers  that 
give  rise  to  the  different  tissues  of  the  animal.  For 
example,  genes  such  as  those  of  the  achaete- 
scute  complex  (AS-C)  are  activated  only  in  the 
stem  cells  of  the  central  and  peripheral  nervous 
system.  Finally,  in  the  third  phase,  these  stem 
cells  divide,  giving  rise  to  the  full  complement  of 
differentiated  cells  that  make  up  different  tissues 
and  organs  and  express  distinct  structural  genes 
to  carry  out  their  specialized  tasks. 

From  molecular  prepattern  to  stem  cell  forma- 
tion to  the  differentiation  of  their  progeny,  there 
is  a  flow  of  genetic  information.  The  prepattern 
specifies  the  spatial  domains  of  genes  that  are  ac- 
tivated in  stem  cells,  and  these  genes  in  turn  regu- 
late cell-type-specific  gene  expression.  Our  labo- 
ratory is  interested  in  the  genetic  basis  of  this 
information  flow,  how  it  operates  and  is  regu- 
lated. From  the  level  of  overall  body  organization 
to  the  molecular  details  of  gene  regulation,  we 
want  to  know  which  genes  are  key  to  pattern  regu- 
lation, how  they  coordinate  gene  expression  and 
cell  behavior,  and  how  they  and  the  regulatory 
programs  they  belong  to  have  changed  in  the 
course  of  the  morphological  diversification  of 
animals. 

To  this  end,  our  laboratory  focuses  on  three 
aspects  of  the  genetic  control  of  pattern  forma- 
tion during  Drosophila  development.  First,  we 
are  working  on  the  molecular  regulation  of  the 
pair-rule  genes,  the  first  genes  expressed  in  a  seg- 
mentally repeating  prepattern.  Second,  we  are 
studying  how  the  early  embryonic  prepattern 
genes  governing  segment  number,  segment  polar- 
ity, segment  identity,  and  germ-layer  specificity 
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regulate  the  expression  of  the  stem  cell-specific 
proneural  genes  of  the  AS-C.  Finally,  we  are  work- 
ing to  understand  better  the  prepattern  that  gov- 
erns pattern  formation  in  imaginal  discs — the 
distinct  pouches  of  cells  that  are  set  aside  during 
embryogenesis,  proliferate  extensively  during 
larval  development,  and  give  rise  to  adult  struc- 
tures such  as  the  wing,  leg,  and  eye. 

Regulation  of  Segmentation  Gene 
Expression 

During  early  Drosophila  development,  small 
batteries  of  genes  are  expressed  in  rapid  succes- 
sion to  establish  the  segmental  prepattern.  The 
first  genes  expressed  in  a  periodic  pattern  are  the 
pair- rule  genes,  deployed  in  one  stripe  per  every 
two  segments.  Previous  studies  have  shown  that 
the  generation  of  some  pair-rule  patterns  is  de- 
rived directly  from  other  such  patterns,  i.e., 
stripes  carving  more  stripes,  but  a  select  group  of 
pair-rule  patterns  is  generated  from  the  aperiodic 
pattern  of  the  gap  proteins,  a  small  number  of 
transcription  factors  expressed  in  broad  partially 
overlapping  zones  of  the  early  embryo.  The  hairy 
gene  is  one  pair-rule  gene  regulated  in  this  way. 
Our  laboratory  is  trying  to  determine  how  the 
many  gap  genes  carve  the  hairy  pattern  into 
stripes.  One  unexpected  observation  is  that  sev- 
eral of  the  gap  proteins  appear  to  act  by  repress- 
ing hairy  transcription  in  discrete  regions  of  the 
embryo,  as  opposed  to  activating  specific  stripes. 
The  elucidation  of  the  direct  activators  and  re- 
pressors of  hairy  transcription  and  their  sites  of 
action  within  the  upstream  regulatory  DNA  of  the 
hairy  locus  is  a  major  goal  of  our  present  work. 

Segmental  Pattern  Regulation  and 
Proneural  Gene  Expression 

The  stripes  of  pair-rule  gene  products  are  not 
sufficient  to  specify  the  precise  location  of  indi- 


vidual cells  of  any  tissue  or  organ.  Rather,  multi- 
ple regulatory  systems  that  determine  segment 
number,  polarity,  identity,  and  germ-layer  posi- 
tion must  all  converge  upon  subsets  of  other 
genes  to  specify  the  three-dimensional  coordi- 
nates of  individual  cells.  To  achieve  a  better  un- 
derstanding of  this  process,  we  have  begun  a  de- 
tailed study  of  the  spatial  regulation  of  two  genes 
of  the  AS-C,  which  promote  neural  development. 
Once  the  early  prepatterns  are  in  place,  the  posi- 
tional information  they  provide  determines 
where  the  genes  of  the  AS-C  will  be  active.  We 
have  found  that  the  products  of  the  achaete  and 
scute  genes,  which  themselves  are  regulatory 
proteins,  are  expressed  at  distinct  dorsoventral 
positions  of  the  neuroectoderm  in  both  a  seg- 
mentally  repeating  and  segmentally  modulated 
pattern.  The  identification  of  the  direct  regula- 
tors of  AS-C  gene  expression  and  the  sites  through 
which  they  act  is  a  second  major  thrust  of  the 
laboratory. 

Prepatterns  in  Imaginal  Discs 

One  of  the  best  known  but  least  understood 
aspects  of  Drosophila  development  concerns  the 
morphogenesis  of  adult  structures  from  the  larval 
imaginal  tissues.  The  eyes,  wings,  legs,  antennae, 
and  other  appendages  are  derived  from  these  sacs 
of  cells,  which  undergo  extensive  changes  during 
larval  growth  and  pupation.  Many  of  the  genes 
that  control  the  pattern  of  the  relatively  small 
numbers  of  cells  of  each  embryonic  segment  also 
control  the  final  morphology  of  the  adult.  The 
molecular  prepatterns  that  govern  eye,  wing,  leg, 
and  antennal  development  are  not  well  known. 
Our  laboratory  has  embarked  on  a  detailed  study 
of  certain  genes  expressed  in  imaginal  discs,  such 
as  hairy,  the  AS-C,  and  others,  in  order  to  define 
better  the  imaginal  prepatterns  and  the  morpho- 
genetic  processes  they  control. 
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MOLECULAR  genetics  offers  unprecedented 
opportunities  for  correction  of  single-gene 
defects,  the  development  of  simple  DNA-based 
diagnostics,  and  the  discovery  of  disease  genes. 
This  laboratory  has  made  significant  progress  in 
the  development  of  gene  replacement  therapies 
for  three  diseases.  The  isolation  of  the  fragile  X 
locus  has  been  achieved  as  a  collaborative  effort 
with  two  laboratories.  DNA-based  methods  for 
diagnosis  of  affected  males  and  carrier  females 
are  now  fully  developed  for  four  diseases  in- 
herited through  the  X  chromosome. 

Genetic  Correction  of  Inherited  Disease 

Human  genes  can  now  be  cloned,  placed  into 
defective  viruses  used  as  vectors,  and  transferred 
into  other  cultured  cells,  embryonic  cells,  and 
mice.  These  encouraging  developments  increase 
the  likelihood  of  successful  gene  replacement 
therapy.  Our  laboratory  is  developing  technology 
toward  that  objective  for  three  heritable  diseases. 
Each  disease  offers  different  technical  and  strate- 
gic challenges. 

Adenosine  deaminase  (ADA)  deficiency  is 
an  inherited  autosomal  recessive  disease.  Bone 
marrow  transplantation  provides  a  once-per-life- 
time  cure,  but  carries  the  potential  complica- 
tions of  graft-versus-host  reaction.  PEG-ADA  ad- 
ministration (ADA  attached  to  polyethylene 
glycol)  on  a  continuing  basis  has  provided  im- 
provement in  immunologic  function  of  patients, 
but  not  a  cure.  We  reported  previously  our  high- 
efficiency  retroviral  delivery  of  a  human  ADA 
minigene  to  mouse  stem  cells  that,  when  re- 
turned to  a  lethally  irradiated  mouse,  resulted  in 
99  percent  rescue  and  high-level,  long-term  ex- 
pression of  the  gene.  Thus  stem  cell  infection  has 
been  achieved  in  mice,  and  recent  human  experi- 
ments are  equally  encouraging. 

Both  "safe"  virus  and  helper  cell  lines  were 
used  to  deliver  the  human  ADA  gene  to  normal 
and  ADA-deficient  long-term  bone  marrow  cul- 
tures. The  infection  rate  exceeds  90  percent,  and 
the  level  of  ADA  expression  is  equal  to  normal 
myeloblastic  progenitors.  This  success  is  a  conse- 
quence of  precisely  defined  conditions  of  stem 


cell  stimulation  and  growth.  The  parameters  in- 
clude cocultivation  of  human  bone  marrow  cul- 
tures with  viral  producer  lines  and  selected  re- 
combinant interleukin  cell-stimulatory  reagents. 
Presently  the  research  is  focused  on  the  use  of 
CD34  monoclonal  selected  cells.  These  have 
stem  cell  properties  and  represent  about  lO""*- 
10~'  of  human  bone  marrow  cells.  If  transfer  can 
be  mediated  by  these  selected  cells,  the  current 
large  cell-culture  requirements  for  the  human 
experiment  are  reduced  to  a  trivial  number  of 
cells.  Clinical  protocols  are  proposed  for  1991. 

Ornithine  transcarbamylase  (OTC)  defi- 
ciency, the  most  common  urea  cycle  defect  in 
humans,  is  inherited  in  an  X-linked  recessive 
manner.  The  coma,  seizures,  and  retardation  are 
the  result  of  hyperammonemia  secondary  to  the 
enzyme  deficiency.  Liver  transplantation  pro- 
vides a  once-per-lifetime  cure,  although  again 
with  the  potential  complications  of  graft-versus- 
host  reaction.  This  condition  is  poorly  managed 
through  dietary  protein  restriction  and  medical 
therapy.  Two  mouse  models,  sparse  fur  and  ash, 
are  available  for  developing  gene  correction  tech- 
nology. Using  a  human  OTC  minigene  under  reg- 
ulation of  a  small  bowel  liver-specific  promoter, 
the  sparse  fur  mouse  has  been  totally  corrected 
of  the  deficiency,  including  both  coat  features 
and  metabolic  defects. 

Surprisingly,  the  correcting  enzyme  was  ex- 
pressed in  small  bowel  (an  ectopic  tissue)  and 
not  in  liver.  This  observation  has  directed  our  ef- 
forts for  human  correction  to  two  target  organs: 
liver  and  small  bowel.  Toward  that  objective,  two 
viral  vectors  are  now  available  and  are  found  to 
transfer  human  OTC  successfully  to  cells  in  cul- 
ture. For  liver  delivery,  we  use  a  retroviral  vector 
with  tissue-specific  promoter.  For  small  bowel 
delivery,  we  use  a  defective  adenoassociated 
virus.  Each  has  biosafety  features  and  delivers 
genes  efficiently.  Our  studies  now  focus  on  the 
correction  of  the  ash  mouse  mutant,  which  has 
profound  OTC  deficiency.  All  human  studies 
currently  involve  liver  cells  in  culture. 

Duchenne  muscular  dystrophy  is  a  severe 
disorder  also  inherited  in  an  X-linked  recessive 
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manner.  Deficiency  of  the  protein  dystrophin 
leads  to  mukiple  abnormalities  of  muscle  dys- 
function and  death.  The  gene  is  large  and  its 
mRNA  complex.  Three  mouse  models  of  DMD  are 
available  for  study.  We  have  recently  developed 
methods  for  the  isolation  of  a  full-length  mouse 
dystrophin  minigene  that  functions  normally  in 
cells  and  mice.  Using  special  muscle-specific 
promoters  controlling  the  dystrophin  minigene, 
correction  of  the  deficiency  in  the  mdx  mouse 
model  has  been  achieved. 

Current  research  efforts  on  DMD  feature  mouse 
and  human  studies.  Our  direction  in  the  former 
studies  is  to  develop  efficient  methods  for  gene 
delivery  to  muscle  cells  of  the  adolescent  mdx 
mouse.  In  the  human  studies  we  have  embarked 
on  a  collaborative  effort  (with  Helen  Blau,  Stan- 
ford University)  to  isolate  DMD  muscle  satellite 
cells  from  very  young  patients.  These  cells  will 
be  the  target  for  gene  delivery  and  will  be  re- 
turned to  the  patient. 

Gene  Isolation  and  Molecular  Diagnostics: 
Fragile  X  Locus 

The  end  of  the  long  arm  of  the  X  chromosome 
contains  some  40  disease  genes,  including  those 
for  two  mental  retardation  disorders  (fragile  X 
syndrome  and  the  oculocerebrorenal  syndrome 
of  Lowe)  and  a  severe  muscular  dystrophy 
(Emery- Dreifuss) .  Yeast  artificial  chromosomes 
(YACs)  containing  80  percent  of  this  40-mega- 
base  region  have  been  isolated  and  analyzed  for 
possible  involvement  in  these  disorders.  Clones 
containing  the  region  involved  in  Lowe's  syn- 
drome and  fragile  X  have  been  obtained,  and  the 
identification  of  the  genes  involved  is  under  way. 

In  the  case  of  fragile  X,  the  YAC  identified  has 
allowed  the  development  of  a  new  marker,  based 
on  the  polymerase  chain  reaction  (PCR) ,  that  im- 
proves fragile  X  diagnosis;  narrows  the  region  re- 


sponsible for  the  syndrome  to  about  200  kb;  and 
provides  the  material  for  deciphering  the  nature 
of  this  most  common  form  of  inherited  mental 
retardation.  Molecular  diagnosis  of  fragile  X  now 
exceeds  the  accuracy  of  cytogenetic  studies  and 
has  been  used  already  at  Baylor  in  three  prenatal 
cases. 

Gene  Isolation  and  Molecular  Diagnostics: 
Duchenne  Muscular  Dystrophy 

Simultaneous  (multiplex)  PCR  amplification 
of  multiple  regions  of  the  DMD  gene  is  now  possi- 
ble with  81  percent  accuracy  for  deletions  and 
duplications,  as  determined  by  a  cooperative 
study  conducted  at  14  international  laboratories. 
Genetic  linkage  analysis  is  developed  and  tested 
for  a  new  multiplex  amplification  method,  which 
employs  short  tandem  repeat  polymorphisms 
(STRs)  at  six  positions  in  the  gene.  These  devel- 
opments make  scanning  for  new  mutations  and 
genetic  linkage  efficient,  cost-effective,  and  accu- 
rate. The  techniques  apply  to  both  affected  males 
and  female  carriers. 

Gene  Isolation  and  Molecular  Diagnostics: 
Lesch-Nyhan  Syndrome 

All  Lesch-Nyhan  syndrome  diagnoses  for  both 
affected  males  and  female  carriers  are  performed 
at  the  molecular  level  by  direct  automated  se- 
quencing of  genetic  defects.  The  mechanisms  of 
mental  retardation  and  choreoathetosis  (involun- 
tary movement),  associated  with  Lesch-Nyhan 
syndrome,  are  being  studied  with  transgenic  tech- 
nology. These  studies  will  produce  mice  defi- 
cient in  both  HPRT  (hypoxanthine  guanine  phos- 
phoribosyltransferase)  and  uricase,  as  is  the  case 
in  the  human  syndrome.  Humans  and  other  homi- 
nids  appear  to  have  lost  uricase  through  peptide 
chain  termination  and  splice  junction  muta- 
tions that  occurred  approximately  20  million 
years  ago. 
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A cell  must  orchestrate  thousands  of  chemical 
reactions  in  order  to  live,  to  grow,  and  to 
respond  to  its  environment.  These  chemical  reac- 
tions rarely  happen  spontaneously,  but  are 
usually  catalyzed  by  macromolecules  called  en- 
zymes. It  was  long  thought  that  all  enzymes  were 
proteins.  More  recently  we  and  others  have  found 
that  RNA,  a  form  of  genetic  material,  can  in  some 
cases  act  as  an  enzyme. 

The  finding  of  RNA  catalysis  has  several  impli- 
cations. First,  it  means  that  RNA  is  not  restricted 
to  being  a  passive  carrier  of  genetic  information 
but  can  participate  actively  in  directing  cellular 
biochemistry.  In  particular,  many  RNA-process- 
ing  reactions  are  at  least  in  part  catalyzed  by  RNA. 
Second,  the  study  of  how  RNA  enzymes  work  may 
reveal  hitherto  unknown  mechanisms  of  biologic 
catalysis.  Third,  RNA  enzymes  (ribozymes)  can 
be  used  as  sequence-specific  RNA  cleavage  agents 
in  vitro,  providing  useful  tools  for  biochemical 
studies  of  RNA.  Finally,  on  a  more  speculative 
note,  RNA  catalysis  has  the  potential  of  providing 
new  therapeutic  agents.  For  example,  it  has  been 
suggested  that  ribozymes  directed  against  viral 
RNA  sequences  might  be  able  to  cleave  and 
thereby  inactivate  viruses  in  a  living  organism. 

Many  of  our  studies  of  RNA  catalysis  concern 
the  Tetrahymena  ribozyme,  named  for  the  sin- 
gle-celled animal  from  which  it  was  originally 
isolated.  This  RNA  enzyme  is  capable  of  cleaving 
other  RNA  molecules  (substrates)  in  a  sequence- 
specific  manner.  One  of  our  objectives  is  to  un- 
derstand the  mechanisms  by  which  this  RNA  mol- 
ecule acts  as  a  catalyst.  A  second  goal,  in  the  area 
of  structural  biology,  is  to  obtain  a  detailed  pic- 
ture of  the  active  site  of  this  ribozyme. 

In  the  past  year  we  have  demonstrated  that  this 
ribozyme  uses  a  novel  mode  of  RNA  recognition 
to  bind  its  RNA  substrate.  In  addition  to  the  well- 
established  mode  of  binding  by  formation  of  base 
pairs  (as  in  the  "ladder"  of  the  famous  DNA  dou- 
ble helix),  the  ribozyme  also  binds  two  of  the 
sugar  groups  that  form  the  "backbone"  of  the 
RNA  substrate  chain.  We  expect  that  this  type  of 
recognition  will  be  widespread  in  biology.  In  a 
separate  study,  we  used  genetic  engineering  to 


introduce  small  changes  near  the  active  site  of  the 
ribozyme.  We  were  able  to  improve  greatly  both 
the  speed  with  which  the  ribozyme  cleaves  RNA 
and  its  specificity  (its  ability  to  cleave  the  correct 
RNA  sequence  while  leaving  others  untouched). 

In  the  structural  area,  we  developed  a  method 
for  monitoring  the  folding  of  the  RNA  chain  of  the 
ribozyme.  This  allowed  us  to  understand  one  of 
the  roles  of  magnesium  ions  in  RNA  catalysis.  We 
are  now  using  the  same  technique  to  explore  the 
contributions  of  individual  structural  domains 
and  even  individual  units  (nucleotides)  to  the 
folding  of  the  ribozyme. 

Telomere  Structure 

Unlike  the  circular  chromosomes  of  bacteria, 
the  chromosomes  found  in  the  nuclei  of  higher 
organisms  are  linear  DNA  molecules.  The  ends  of 
linear  chromosomes,  called  telomeres,  must  be 
protected  from  degradation,  and  special  features 
are  required  to  ensure  their  replication.  We  are 
studying  telomere  structure  and  function,  with 
special  emphasis  on  the  protein  that  caps  off  the 
ends  of  each  chromosome. 

Most  cells  have  only  a  few  dozen  chromosomes 
and  therefore  not  many  telomeres.  We  chose  to 
work  with  the  ciliated  protozoan  Oxytricha 
nova,  because  it  has  26  million  miniature  chro- 
mosomes per  cell.  This  gives  us  a  large  amount  of 
telomeric  protein  to  study.  Similarities  in  the 
DNA  sequences  of  the  telomeres  of  Oxytricha 
and  those  of  higher  cells,  including  human  cells, 
give  us  reason  to  believe  that  our  findings  in  Oxy- 
tricha will  be  of  some  generality. 

Last  year  we  isolated  and  sequenced  the  genes 
encoding  the  two  subunits  of  the  Oxytricha  telo- 
mere-binding  protein.  We  have  now  used  genetic 
engineering  methods  to  produce  large  amounts 
of  these  protein  subunits  in  bacteria.  The  protein 
synthesized  and  purified  from  bacteria  forms  the 
same  telomeric  DNA-protein  interaction  seen  in 
living  Oxytricha.  Thus  we  now  have  a  laboratory 
system  for  studying  telomere  structure  and  func- 
tion that  is  convenient  and  appears  to  be  faithful 
to  the  biological  system. 

We  have  also  begun  to  search  for  proteins  with 
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a  similar  function  in  human  cells  and  have  ob- 
tained some  promising  initial  results.  Telomeres 
appear  to  help  organize  chromosomes  within  the 
nucleus,  and  this  organization  has  been  reported 


to  be  perturbed  in  cancer  cells.  Information 
about  the  human  telomere-binding  protein  and 
its  interaction  with  the  nuclear  envelope  may 
therefore  be  of  medical  importance. 
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THE  immune  system  acts  in  a  critically  impor- 
tant fashion  to  protect  the  organism  from  a 
vast  array  of  infectious  and  parasitic  agents.  To 
execute  this  function  the  immune  system  has 
evolved  potent  mechanisms  to  eliminate  or  de- 
stroy invading  organisms.  Because  these  effector 
mechanisms  are  so  potent,  stringent  regulation  of 
the  activation  of  the  immune  response  is  very  im- 
portant. Inappropriate  activation  of  immune  ef- 
fector systems  or  failure  of  the  immune  system  to 
distinguish  self  tissues  from  invading  pathogens 
can  lead  to  serious  damage  to  the  host.  Examples 
of  such  failures  of  normal  immune  regulation  are 
insulin-dependent  diabetes  mellitus  (in  which 
the  immune  system  participates  prominently  in 
the  destruction  of  the  insulin-producing  cells  of 
the  pancreas) ,  rheumatoid  arthritis  (in  which  ab- 
normal self-reactive  antibodies  cause  destructive 
inflammation  in  joints),  and  systemic  lupus  er- 
ythematosus (in  which  a  broad  spectrum  of  self- 
reactive  antibodies  induce  inflammatory  damage 
in  many  tissues) . 

Proper  regulation  of  immune  defenses  depends 
on  the  coordinate  action  of  an  array  of  different 
host  mechanisms.  Many  of  these  mechanisms  are 
influenced  in  fundamental  ways  by  genes  of  the 
major  histocompatibility  complex  (MHC).  Our 
laboratory  studies  the  structure  and  molecular  bi- 
ology of  this  important  gene  complex.  These  stud- 
ies are  motivated  by  a  desire  both  to  understand 
the  nature  and  regulation  of  the  genes  in  the  MHC 
and  to  use  the  MHC  as  a  model  system  to  test 
methods  for  analysis  of  large  gene  complexes  in 
general. 

The  MHC  encodes  genes  that  determine 
whether  tissue  grafts  between  unrelated  individ- 
uals are  accepted  or  rejected.  The  products  of 
these  genes  determine  each  individual's  tissue 
type  (also  called  HLA  type).  These  HLA  mole- 
cules also  serve  other  important,  perhaps  more 
biologically  relevant,  functions,  as  they  partici- 
pate in  the  regulation  of  most  immune  reactions. 

The  MHC  comprises  a  cluster  of  at  least  60 
closely  linked  genes,  which  can  be  divided  into 
three  classes  on  the  basis  of  their  structures  and 
their  functions.  Class  I  genes  encode  proteins 


that  are  present  on  the  surfaces  of  all  nucleated 
cells,  where  they  participate  in  a  number  of  im- 
mune responses.  Importantly,  when  a  host  cell  is 
infected  with  a  virus,  fragments  of  viral  proteins 
associate  with  the  class  I  molecules.  This  com- 
plex of  viral  protein  and  class  I  molecule  is  recog- 
nized by  host  cytotoxic  T  lymphocytes,  which 
then  act  to  destroy  the  virally  infected  cell. 

Class  II  MHC  genes  encode  cell  surface  pro- 
teins that  share  some  structural  features  with  the 
class  I  molecules,  indicating  evolution  from  a 
common  ancestral  gene.  Class  II  molecules  are 
present  on  the  surfaces  of  only  a  few  cell  types, 
where  they  also  bind  fragments  of  foreign  anti- 
gens. In  this  case  the  complex  of  foreign  antigen 
and  class  II  molecule  is  recognized  by  host  helper 
T  lymphocytes,  resulting  in  the  initiation  of  an 
active  immune  response  to  those  antigens. 

Class  III  genes  encode  molecules  with  diverse, 
and  in  some  cases  unknown,  functions.  Four  of 
the  class  III  genes  specify  soluble  proteins  that 
circulate  in  the  blood,  where  they  form  part  of 
the  most  primitive  portion  of  the  immune  system, 
the  complement  system.  They  provide  an  initial, 
nonspecific  barrier  to  invading  microorganisms, 
before  a  specific  immune  response  is  developed. 
In  addition  to  the  complement  genes,  this  region 
contains  the  genes  encoding  two  potent,  white 
blood  cell-derived  inflammatory  molecules  (tu- 
mor necrosis  factor  and  lymphotoxin) .  The  class 
III  region  also  contains  several  additional  genes, 
some  with  as  yet  unknown  functions. 

An  important  feature  of  most  of  the  MHC  genes 
is  their  marked  variability  in  structure  between 
different  individuals  in  the  population.  This  indi- 
vidual-to-individual (or  allelic)  variation  confers 
an  immunologic  fingerprint  that  makes  up  each 
individual's  tissue  type  and  is  the  basis  for  recog- 
nition and  rejection  of  foreign  tissue  grafts.  More 
generally,  this  fingerprint  permits  very  sensitive 
discrimination  of  self  from  nonself. 

It  has  been  apparent  for  over  a  decade  that  sus- 
ceptibility to  a  large  number  of  immunologically 
mediated  diseases  is  determined  by  an  individ- 
ual's tissue  type.  In  some  cases,  disease  suscepti- 
bility appears  to  be  conferred  by  an  allelic  variant 
of  a  single  MHC  gene.  In  other  cases,  susceptibil- 
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ity  seems  to  be  conferred  by  a  complete  tissue 
type  (made  up  of  the  full  complement  of  class  I, 
II,  and  III  genes) .  In  most  cases  the  mechanism  by 
which  these  inherited  MHC  genes  affect  disease 
susceptibility  is  not  known.  To  understand  the 
association  of  certain  MHC  alleles  with  disease 
susceptibility,  it  will  be  necessary  to  analyze  the 
molecular  structures  of  complete  MHC  com- 
plexes containing  those  alleles.  Such  a  detailed 
analysis  will  also  be  required  for  an  understand- 
ing of  the  mechanisms  and  forces  that  govern  the 
evolution  of  the  complex.  Because  this  evolution 
determines  our  repertoire  of  immune  responsive- 
ness, understanding  these  evolutionary  forces 
will  be  important  for  understanding  how  we 
adapt  to  a  world  of  constantly  changing  patho- 
gens, from  simple  viruses  to  complex  parasites. 

Our  laboratory  is  defining  the  structure  of  the 
entire  MHC  by  molecular  cloning.  Large  frag- 
ments of  human  chromosomes  are  propagated  in 
yeast  following  fusion  to  simple  yeast  chromo- 
somal elements.  These  recombinant  molecules 
exist  in  yeast  as  independent  chromosomes  (yeast 
artificial  chromosomes  [YACs]) .  The  use  of  these 
YACs  permits  us  to  analyze  DNA  fragments  5-10 


times  larger  than  previously  possible  with  con- 
ventional gene-cloning  methods.  This  is  particu- 
larly important  for  analysis  of  the  structure  of  the 
MHC,  because  of  the  large  size  of  the  complex. 
Our  analysis  has  shown  YAC  cloning  to  be  a  pow- 
erful methodology.  All  but  3  percent  of  the  HLA 
complex  has  been  isolated  in  52  YACs.  These  mo- 
lecular clones  are  stable  and  reproduce  faithfully 
the  structure  of  the  original  human  chromosome 
from  which  they  derive.  Their  structure  can  be 
analyzed  at  high  resolution.  New  DNA  probes  can 
be  rapidly  isolated  from  the  YACs  to  be  used  for 
comparison  of  the  structures  of  normal  and  dis- 
ease-associated MHCs.  We  anticipate  that  they 
will  facilitate  the  identification  of  additional 
genes  within  the  gene  complex.  Analysis  of  such 
genes  may  be  critical  for  an  understanding  of  the 
mechanisms  of  disease  association. 

These  studies  provide  access  to  the  entire 
structure  of  the  MHC.  They  also  provide  an  exam- 
ple of  the  power  of  the  new  YAC-cloning  technol- 
ogy that  may  facilitate  its  general  application  to 
the  mapping  of  the  human  genome  and  to  more 
extensive  analysis  of  other  specific  human  dis- 
ease loci. 
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HORMONES,  key  players  in  the  endocrine  and 
nervous  systems,  are  produced  by  specific 
tissues  in  the  body,  effect  extracellular  communi- 
cation, and  regulate  cellular  function.  Our  stud- 
ies have  focused  on  the  hormonal  regulation  of 
gene  expression  and  on  several  genes  that  encode 
the  hormones  themselves. 

Thyroid  Hormone  Regulation  of  TSH 
Gene  Expression 

Thyrotropin  (TSH)  is  a  polypeptide  hormone 
that  is  produced  and  secreted  by  a  single  cell  type 
in  the  anterior  pituitary  gland.  It  comprises  two 
different  sugar-containing  polypeptide  subunits, 
a  and  |S,  which  are  encoded  by  genes  located  on 
different  chromosomes.  Importantly,  TSH  stimu- 
lates the  thyroid  gland  to  produce  the  thyroid 
hormones,  T3  and  T4.  These  are  modified  amino 
acids  that  regulate  metabolism  and  gene  expres- 
sion in  almost  every  cell  of  the  body.  To  maintain 
a  constant  level  of  T3  and  T4  in  the  bloodstream, 
these  hormones  act  back  on  the  pituitary  to  de- 
crease TSH  production  and  secretion.  Hence  TSH 
and  thyroid  hormones  are  involved  in  a  classic 
negative-feedback  relationship. 

To  understand  further  the  molecular  mecha- 
nisms involved  in  the  negative  regulation  of  TSH 
synthesis  by  thyroid  hormones  at  the  transcrip- 
tion level,  we  have  isolated  and  analyzed  the 
genes  encoding  the  a-  and  jS-subunits  of  TSH  in 
the  rat. 

Thyroid  hormones,  in  general,  act  at  the  cellu- 
lar level  by  entering  the  cell  either  as  T3  or  T4. 
T4  may  be  converted  to  T3  in  certain  tissues,  such 
as  the  pituitary.  T3  then  enters  the  nucleus, 
where  it  interacts  with  the  thyroid  hormone  re- 
ceptor (TR) ,  a  protein  encoded  by  the  proto-on- 
cogene  c-erbA.  The  T3-TR  complex  then  binds 
directly  to  cis-DNA  elements  within  thyroid  hor- 
mone-responsive genes  to  activate  the  appro- 
priate responses. 

Using  deletion-mutation  analyses  of  the  regula- 
tory regions  of  the  subunit  genes,  we  have  local- 
ized putative  thyroid  hormone  response  ele- 
ments (TREs)  to  the  3'  end  of  the  first  exon  and  to 


a  region  just  5'  to  this  exon  in  the  rat  TSH;8  gene, 
and  to  a  proximal  upstream  promoter  region  of 
the  rat  a  gene.  There  is  limited  sequence  similar- 
ity between  these  cis-DNA  elements  and  a  TRE 
defined  in  the  rat  growth  hormone  gene.  Consis- 
tent with  their  roles  as  TREs,  these  DNAs  bind 
TRs.  Thus  our  studies  have  supported  the  hypoth- 
esis that  thyroid  hormones  inhibit  a  and  TSH|8 
gene  expression  by  the  direct  binding  of  a  T3-TR 
complex  to  specific  areas  within  regulatory  re- 
gions of  the  subunit  genes  and  their  resultant  in- 
terference with  transcription. 

Multiple  Thyroid  Hormone  Receptors 

Another  interest  involves  the  elucidation  of  the 
trans-acting  factors  involved  in  thyroid  hormone 
action.  Critically  important  are  several  forms  of 
the  TR.  These  are  encoded  by  two  genes,  a  and  /3, 
each  expressing  at  least  two  related  molecules 
obtained  by  alternative  splicing.  The  rat  TRa 
gene  encodes  a  bona  fide  TR,  TRal ,  and  a  related 
form,  c-erbAa2,  that  is  identical  to  TRal  for  the 
first  370  amino  acids,  which  include  the  DNA- 
binding  domain  and  much  of  the  thyroid  hor- 
mone-binding domain  at  the  carboxyl -terminal 
end.  However,  carboxyl  terminal  to  this  common 
region  is  a  divergent  sequence.  The  c-erbAa2 
form  does  not  bind  thyroid  hormone  or  transacti- 
vate  thyroid  hormone-responsive  genes,  but 
does  bind  to  putative  TREs  in  vitro,  as  shown  by  a 
DNA-binding  assay.  These  and  other  results  sug- 
gest that  the  biologically  active  c-erbAa2  serves 
as  an  inhibitor  of  active  TR  action.  The  rat  TR|8 
gene  encodes  two  functional  TRs,  TRjSl  and 
TR|82.  TR/31  and  TR/32  are  identical,  from  several 
amino  acids  amino  terminal  to  the  DNA-binding 
region  to  the  remainder  of  the  molecule.  How- 
ever, the  amino-terminal  regions  of  the  DNA- 
binding  domain  are  different.  We  have  shown 
that  TR/32  binds  hormone  with  the  same  relative 
affinities  as  the  other  receptors,  binds  DNA, 
and  transactivates  thyroid  hormone-responsive 
genes. 

Surprisingly,  TR/32  is  expressed  only  in  the  pi- 
tuitary gland.  This  is  remarkable  because  of  the 
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major  role  of  thyroid  hormones  in  regulating  im- 
portant hormone  genes  in  the  pituitary,  includ- 
ing TSH.  The  finding  also  stands  in  contrast  to  the 
general  tissue  distribution  of  the  other  receptor 
forms. 

In  addition,  we  wished  to  determine  how  these 
trans-acting  factors,  of  which  TRs  represent  pro- 
totypes, are  themselves  regulated.  We  have 
shown  that  thyroid  hormones  regulate  TR  mRNAs 
in  a  complex  and  a  tissue-specific  manner  in  vivo 
and  in  vitro.  The  TR/32  mRNA  is  the  most  regu- 
lated form:  in  a  24-hour  period,  thyroid  hor- 
mones decrease  TR|82  mRNA  to  less  than  1 0  per- 
cent of  basal  levels  in  the  pituitary  gland.  In 
contrast,  TRal  and  c-erbAa2  mRNAs  are  mod- 
estly suppressed,  while  TRjSl  mRNA  is  not  nega- 
tively regulated  by  thyroid  hormones. 

In  addition,  we  have  observed  an  interesting 
feature  of  the  a  gene — that  the  strand  opposite 
the  one  that  encodes  TRal  and  c-erbAal  (pro- 
duced by  alternative  splicing  of  a  single  a  tran- 
script) encodes  a  novel  member  of  the  c-erbA 
(thyroid  hormone/steroid  hormone)  supergene 
family,  called  Rev  erbAa.  The  predicted  protein 
is  more  related  to  thyroid  hormone  and  retinoic 
acid  than  the  steroid  hormone  receptors.  The 
mRNA  of  Rev  erbAa  shares  a  common  269-base 
pair  exonic  region  with  that  of  c-erbAa2.  This 
unusual  genomic  arrangement,  which  is  con- 
served among  several  mammalian  species,  sug- 


gests possible  interregulation  of  the  TRa  and  Rev- 
erbAa at  the  post-transcriptional  levels. 

Finally,  we  have  recently  demonstrated  the  ex- 
istence of  TRAP  (TR  auxiliary  protein) ,  a  nuclear 
protein  that  can  augment  the  binding  of  TRs  to 
various  TREs.  This  ubiquitous  60-  to  65-kDa  pro- 
tein dimerizes  with  TRs  to  form  a  heterodimer 
and  also  binds  specific  sequences  within  the  TRE. 
The  full  nature  of  TRAP  and  its  role  in  thyroid 
hormone  action  remains  to  be  determined. 

Thus  the  thyroid  hormone  receptor  family  is 
complex.  There  are  at  least  three  biologically  ac- 
tive forms  expressed  in  a  tissue-specific  fashion 
and  another  form  that  may  have  an  important  role 
in  modulating  the  effects  of  the  others.  A  knowl- 
edge of  the  functions  of  these  different  forms  and 
their  interaction  with  other  nuclear  factors  will 
be  critical  for  our  full  understanding  of  thyroid 
hormone  action. 


Summary 

Our  work  has  focused  on  the  molecular  mecha- 
nisms involved  in  the  regulation  of  TSH  gene  ex- 
pression by  thyroid  hormones,  including  the  cis 
elements  and  trans  factors  that  are  involved  in 
this  process.  We  hope  that  our  efforts  will  pro- 
vide insight  into  the  hormonal  regulation  of  gene 
expression  and  cellular  function  in  normal  and 
pathologic  endocrine  states  and  in  cancer. 
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THE  study  of  the  linear  sequence  of  bases  in 
genomic  DNA  and  messenger  RNA  is  steadily 
gaining  momentum  as  a  result  of  the  increasing 
ease  and  advantage  of  using  shared  computer  da- 
tabases to  find  connections  among  distant  con- 
cepts and  distant  biological  systems.  For  exam- 
ple, connections  have  been  found  betw^een 
human  oncogenes  and  yeast  transcription  factors, 
between  differentiation  antigens  and  bacterial 
chaperone  proteins,  and  between  developmental 
regulatory  genes  and  bacterial  DNA-binding 
proteins. 

Unfortunately,  these  database  searches  are  fre- 
quently unsuccessful,  because  not  all  classes  of 
genetic  elements  are  represented.  The  complete 
sequence  of  a  few  small  genomes  should  rectify 
this.  Sequencing  projects  have  begun  for  ge- 
nomes of  various  bacteria,  yeasts,  a  plant,  and  a 
worm  chosen  for  their  well-studied  genetics  and 
their  small  genome  sizes.  Their  genus  names  are 
Mycoplasma,  Mycobacteria,  Escherichia,  Ther- 
mococcus,  Arabidopsis,  and  Caenorhabditis, 
and  they  represent  all  major  branches  of  the  evo- 
lutionary tree.  The  genome  closest  to  completion 
is  that  of  Escherichia,  with  30  percent  of  its  4.7 
million  base  pairs  already  in  the  database, 
through  the  efforts  of  around  2,000  researchers. 

To  improve  the  accuracy  and  efficiency  of 
these  projects,  we  have  developed  new  sequenc- 
ing technologies.  One,  called  multiplex  se- 
quencing, is  a  way  of  keeping  a  large  set  of  DNA 
fragments  as  a  precise  mixture  throughout  most 
of  the  sequencing  steps.  Because  each  mixture 
can  be  sequenced  with  the  same  effort  as  a  single 
sample  in  previous  methods,  more  fragments  can 
be  handled. 

The  mix  is  deciphered  by  strategically  tagging 
the  fragments  at  the  beginning  with  unique  bits 
of  DNA  and  then,  at  the  end,  hybridizing  to  the 
sequencing  reactions  complementary  bits  of  DNA 
that  have  been  spread  out  by  size  and  immobi- 
lized on  large  membranes.  This  method  also  im- 
proves the  accuracy,  since  the  mixtures  contain 
internal  standards  of  known  sequence  that  help 
in  the  computer  analysis  of  the  film  data. 


The  number  of  probings  obtainable  per  mem- 
brane represents  the  increased  efficiency  factor- 
of  this  method.  This  number  exceeds  50  now 
(the  higher  the  better)  and  is  likely  to  increase. 
We  have  designed  and  tested  devices  to  facilitate 
most  of  the  steps  in  multiplex  sequencing,  in- 
cluding DNA  preparation,  sequencing  reactions, 
gel  loading,  hybridization,  film  exposure,  and 
film  reading.  All  of  these  procedures  have  been 
applied  to  collect  over  1  million  bases  of  raw  data 
and  are  undergoing  further  development.  Chemi- 
luminescent  detection  of  the  multiplex  sequence 
images  is  showing  promise  as  an  effective  replace- 
ment for  the  radioactivity  normally  used.  To  fill 
in  the  inevitable  last  gaps  in  the  sequences,  we 
are  exploring  several  approaches,  including  di- 
rect genomic  sequencing  (without  cloning)  from 
the  edges  of  the  gaps  and  isolation  of  gap-span- 
ning clones  by  clone  hybridization. 

To  make  the  extensive  DNA  sequences  even 
more  useful  to  biological  searches,  encoded  pro- 
teins must  be  found  and  collected  into  families 
based  on  their  interactions  or  distant  relatives. 
For  example,  we  have  found  matches  for  about 
half  of  the  genes  required  by  Escherichia  coli  for 
vitamin  biosynthesis  and  have  gathered  these 
into  families  of  proteins  involved  in  membrane 
transport,  heme  and  corrin  ring  methylation, 
amine  group  transfer,  and  so  on. 

As  another  example,  we  have  searched  for  the 
class  of  the  most  abundant  cellular  proteins, 
which  have  nonetheless  eluded  the  extensive  bio- 
chemical and  genetics  studies  of  E.  coli.  This  has 
been  done  by  systematically  correlating  amino- 
terminal  protein  sequence  data  obtained  from 
two-dimensional  gel  spots  with  the  DNA  se- 
quence and  two-dimensional  gel  databases.  Of 
1 30  unique  sequences  determined  so  far,  over  40 
are  candidates  for  such  major  novel  proteins. 

We  are  also  extending  our  methods  for  detect- 
ing in  vivo  molecular  interactions  by  analyzing 
protection  of  individual  bases  from  chemical  and 
enzymatic  methylation.  This  in  vivo  footprinting 
method  has  been  extended  to  a  cloning-based  as- 
say for  DNA-protein  interactions. 

In  the  future,  with  new  sequencing  technolo- 
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gies  such  as  automated  multiplex  sequencing, 
with  examples  of  most  of  the  basic  genetic  mod- 
ules, and  with  an  eye  to  sequence  elements  con- 


served among  species,  the  analysis  by  investiga- 
tors worldwide  of  human  sequences  and  genetics 
should  become  more  manageable. 
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Computer- generated  false- color  image  of  a  Drosophila  leg  imaginal  disc.  The  disc  was  la- 
beled to  visualize  the  mRNA  transcript  of  the  Distal-less  gene.  A  digitized  image  of  a  labeled 
disc  was  prepared  and  processed  to  generate  an  intensity  "map"  so  that  different  colors 
represent  regions  of  different  staining  intensity  in  the  original  sample.  The  black  region 
(rimmed  in  purple )  represents  the  highest  RNA  concentration.  RNA  decreases  across  the  disc 
through  several  color  steps  (blue,  green,  yellow,  red,  pink ),  which  indicate  progressively 
lower  concentrations.  The  point  of  this  manipulation  is  to  demonstrate  that  a  gradient  of  the 
Distal-less  mRNA  exists  in  the  disc.  The  use  of  false-color  intensity  mapping  highlights  the 
idea  that  different  spatial  domains  could  in  principle  be  organized  or  specified  by  gradients 
of  gene  product  intensity. 
Research  of  Stephen  M.  Cohen. 
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ONE  of  the  most  striking  features  of  early  em- 
bryogenesis  is  the  rapidity  and  reproducibil- 
ity with  which  the  egg  organizes  the  embryonic 
body  plan.  The  process  by  which  cells  in  the  em- 
bryo acquire  distinct  identities  is  known  as  pat- 
tern formation.  The  concepts  are  as  old  as  de- 
scriptive embryology,  and  the  basic  problems  are 
the  same  in  all  animal  and  plant  embryos.  In 
order  to  generate  an  embryo  with  an  integrated, 
coherent  body  pattern,  cells  need  to  adopt  devel- 
opmental programs  compatible  with  those  of 
their  neighbors.  We  are  interested  in  understand- 
ing how  the  embryo  can  orchestrate  its  genetic 
information  to  specify  the  identities  of  groups  of 
cells  within  the  context  of  a  larger  body  pattern. 
To  address  these  questions,  we  are  analyzing  the 
genetic  control  of  limb  development  in  the  fruit 
fly  Drosophila. 

We  would  like  to  understand  the  genetic  and 
molecular  mechanisms  that  the  fly  embryo  uses 
to  determine  where  the  limbs  will  develop  with 
respect  to  the  rest  of  the  body  pattern.  How  is  the 
position  identified?  How  do  cells  that  occupy  the 
correct  position  become  committed  to  develop 
as  limb  precursors?  Once  the  limb  primordium  is 
established,  how  do  cells  get  the  information  that 
tells  them  what  part  of  the  limb  to  make? 


Specification  of  Limb  Cell  Identity 

Using  classical  genetics,  we  have  identified  a 
gene  named  Distal-less  that  plays  an  important 
role  in  limb  development.  Limbs  do  not  develop 
in  embryos  that  lack  Distal-less  gene  function. 
This  observation  suggests  that  Distal-less  activity 
may  play  an  essential  early  role  in  limb  develop- 
ment, but  it  does  not  tell  us  what  the  nature  of 
that  role  might  be.  We  get  an  additional  hint  from 
examining  genetically  mosaic  animals  in  which 
Distal-less  activity  is  removed  from  a  group  (or 
clone)  of  cells  that  were  already  committed  to 
develop  into  limb  structures.  These  cells  shift 
their  program  and  develop  into  body  wall.  Dis- 
tal-less, then,  seems  to  function  as  a  developmen- 
tal switch  that  is  important  for  telling  a  cell  that  it 
is  a  limb  precursor.  Distal-less  may  also  define 


and  maintain  the  identity  of  the  presumptive 
limb  cells. 

How  might  Distal-less  afi'ect  cell  identity?  To 
approach  an  answer  in  molecular  terms,  we  have 
cloned  and  characterized  the  gene.  Distal-less  is 
a  member  of  a  growing  family  of  important  regu- 
latory genes  that  encode  a  sequence-specific 
DNA-binding  motif  called  the  homeodomain.  A 
number  of  Drosophila  homeodomain-containing 
genes  have  been  shown  to  control  important  de- 
velopmental decisions  by  regulating  the  expres- 
sion of  other  genes.  We  hope  to  identify  the  tran- 
scriptional targets  through  which  Distal-less 
exerts  its  regulatory  functions. 

How  Does  the  Embryo  Position  Its  Limbs? 

Using  the  Distal-less  gene  as  a  molecular 
marker,  we  have  been  able  to  visualize  popula- 
tions of  limb  precursor  cells  in  the  embryo  well 
before  any  overt  sign  of  limb  development  be- 
gins. These  small  clusters  of  cells,  placed  at  well- 
defined  sites,  are  called  the  limb  primordia. 
Clearly,  to  define  these  points,  the  embryo  must 
use  some  preexisting  source  of  information, 
which  leads  to  the  spatially  localized  activation 
of  Distal-less.  A  number  of  observations  impli- 
cate the  wingless  gene,  without  which  (in  mu- 
tant embryos)  the  limb  primordia  do  not  form. 
The  gene  is  expressed  in  a  reiterated  series  of 
well-defined  stripes  perpendicular  to  the  ante- 
rior-posterior axis  of  the  embryo.  These  and  other 
observations  suggest  that  the  ivingless  gene  pro- 
vides a  positional  cue  by  which  limb  formation  is 
oriented. 

If  wingless  alone  sufficed  to  define  the  limb 
primordia,  we  would  expect  the  limbs  to  begin 
development  as  stripes  of  cells  expressing  Distal- 
less.  The  limb  primordia,  however,  arise  as  spots, 
not  as  stripes.  One  way  to  limit  the  limb  primor- 
dia to  spots  would  be  to  require  interaction  be- 
tween the  stripes  of  cells  expressing  wingless 
and  an  intersecting  longitudinal  stripe  of  cells 
providing  a  different  type  of  information.  The 
wingless  gene  encodes  a  secreted  growth-factor- 
like molecule  homologous  to  the  INT-1  onco- 
gene of  mammals.  A  longitudinal  stripe  of  cells 
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expressing  a  receptor  for  the  wingless  signal 
would  allow  for  specification  of  clusters  of  cells 
at  unique  points  with  respect  to  the  body  pattern. 

Distal-less-Dcpcndctit  Genes  Involved  in 
Limb  Development 

One  approach  to  identifying  genes  that  act 
downstream  of  Distal-less  in  organizing  the  limb 
involves  looking  for  other  genes  expressed  in  the 
embryonic  limb  primordia.  We  would  expect  a 
downstream  target  gene  to  depend  on  Distal-less 
gene  function  in  order  to  be  expressed  in  the 
limb  primordia.  We  have  begun  to  screen 
through  the  genome,  using  a  method  for  visualiz- 
ing the  pattern  of  gene  expression  rather  than  for 
destroying  the  genes  by  mutation.  Although  this 
work  is  at  an  early  stage,  we  have  identified  a 
gene,  named  disconnected,  that  is  expressed  in 
the  embryonic  limb  primordia  and  that  depends 
on  Distal-less. 

Proximal-Distal  Pattern  Formation 

As  mentioned  above,  flies  that  lack  Distal-less 
gene  activity  do  not  develop  limbs.  Flies  in  which 
Distal-less  gene  activity  is  merely  impaired  de- 
velop limbs  that  are  foreshortened  along  their 
proximal-distal  axis.  A  range  of  Distal-less  muta- 
tions of  varying  severity  have  been  isolated  that 
cause  a  graded  series  of  defects  in  limb  develop- 
ment. The  characteristics  of  different  mutations 
tell  us  that  the  amount  of  activity  of  the  gene  is 
somehow  important  in  controlling  the  range  of 
structures  that  the  limb  can  develop.  Distal  parts 
of  the  limb  require  more  Distal-less  gene  activity 
than  proximal  parts  in  order  to  develop  normally. 
These  observations  suggest  that  the  gene  may 
play  an  important  role  in  organizing  the  proxi- 
mal-distal axis  of  the  limbs. 

In  order  to  understand  how  Distal- less  exerts 
this  function,  we  have  begun  to  examine  the  spa- 
tial distribution  of  the  gene  product  in  develop- 
ing limbs  of  the  adult  fly.  Distal-less  RNA  is  ex- 
pressed in  a  concentration  gradient  across  the 
developing  limb.  The  distal-most  region  of  the 
limb  expresses  a  high  level,  intermediate  regions 


express  lower  levels,  and  proximal  regions  ex- 
press little  or  none.  These  observations  are  partic- 
ularly intriguing  in  that  we  have  observed  a  dif- 
ferential requirement  for  the  activity  of  the  gene 
in  different  regions  of  the  leg  primordium.  Distal 
regions  require  more  activity  and  express  more 
transcript.  We  are  trying  to  define  better  the  char- 
acteristics of  this  gradient  and  hope  to  determine 
whether  the  observed  graded  distribution  of  the 
gene  product  is  contributing  meaningful  infor- 
mation to  the  organization  of  the  proximal-distal 
pattern  of  the  developing  limb. 

To  assess  the  role  of  Distal-less  in  organizing 
pattern  in  the  developing  adult  limb,  we  have 
sought  genes  that  might  act  downstream  of  Dis- 
tal-less in  this  process.  The  Abnormal  leg  pat- 
tern (Alp)  gene  was  identified  as  a  prospective 
partner  on  the  basis  of  a  specific  phenotypic  in- 
teraction with  Distal-less.  The  severity  of  the  leg 
defects  caused  by  dominant  mutations  of  Alp  is 
sensitive  to  changes  in  the  amount  of  Distal-less 
gene  activity.  During  the  past  year,  we  have  gen- 
erated mutants  that  eliminate  Alp  function  and 
have  cloned  and  characterized  the  gene.  It  is  ex- 
pressed in  a  narrow  ring  of  cells  that  encircles  the 
developing  limb.  This  ring  can  be  taken  as  a 
marker  for  a  defined  cell  identity  along  the  proxi- 
mal-distal axis  of  the  leg.  Expression  of  Alp  in  this 
ring  depends  on  the  Distal-less  gene. 

We  suggest  that  understanding  pattern  forma- 
tion in  the  developing  limb  of  the  fly  will  teach 
us  about  fundamental  mechanisms  that  may  be 
important  during  development  of  vertebrate  em- 
bryos. The  system  of  pattern  formation  that  we 
are  studying  differs  from  early  pattern  formation 
in  the  precellular  fly  embryo  in  that  pattern  is 
being  organized  in  a  field  of  cells,  rather  than  in 
an  array  of  nuclei  sharing  a  common  cytoplasm. 
Cells  in  the  developing  limb  must  define  their 
own  identities  and  communicate  this  information 
to  their  neighbors.  In  this  context  we  are  particu- 
larly intrigued  by  the  role  that  Distal- less  plays  as 
a  pattern  organizer  at  the  genetic  level  and  by  its 
prospective  role  as  a  regulator  of  gene  expression 
at  the  molecular  level. 
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THE  theme  of  our  laboratory  is  the  study  of 
human  genetic  disease  at  the  molecular 
level.  Our  goal  is  to  identify  genes  involved  in 
specific  genetic  disorders,  to  define  their  struc- 
ture and  function,  to  understand  the  control  of 
their  expression,  and  to  use  this  information  to 
develop  potential  nev^  therapies. 

The  most  recent  edition  of  Mendelian  Inheri- 
tance in  Man  lists  over  4,000  genetic  disorders. 
In  the  majority  of  these,  the  normal  function  of 
the  gene  involved  is  not  known.  The  identifica- 
tion of  disease-causing  genes  without  knowledge 
of  their  protein  product  or  its  normal  role  is  a 
major  endeavor  in  our  research. 

Only  recently  has  it  become  possible  to  iden- 
tify such  genes,  and  the  process  is  still  laborious. 
First  the  gene  must  be  mapped  to  a  specific  hu- 
man chromosome,  using  a  process  known  as  link- 
age analysis.  This  involves  identification  of  fami- 
lies with  the  disorder  and  analysis  of  DNA  from 
these  families  with  a  panel  of  probes  from  all 
parts  of  the  human  genome.  The  probes  are  used 
to  establish  the  chromosomal  location  of  a  DNA 
sequence  that  may  have  been  inherited  in  associa- 
tion with  a  disease  gene  that  will  probably  lie 
close  to  it.  Such  sequences,  or  "markers,"  can 
pinpoint  the  chromosome  on  which  the  disease 
gene  resides.  Additional  probes  from  that  chro- 
mosome can  then  be  tested  to  identify  markers 
that  are  even  closer.  It  is  often  possible  to  narrow 
the  responsible  region  to  about  1  percent  of  a 
particular  chromosome.  Although  this  is  a  major 
achievement,  such  a  region  may  yet  contain  30  to 
50  genes,  only  one  of  which  is  responsible  for  the 
disease.  Thus  additional  refinements  must  be 
made  before  candidates  for  the  responsible  gene 
can  be  identified. 

A  major  development  in  the  past  year  has  been 
the  successful  cloning  of  the  gene  for  von  Reck- 
linghausen neurofibromatosis  (NFl),  a  common 
genetic  disorder  (sometimes  incorrectly  referred 
to  as  the  Elephant  Man  disease) .  Using  techniques 
of  chromosome  jumping  and  yeast  artificial  chro- 
mosomes, we  were  able  to  identify  on  chromo- 
some 17  a  large  gene  that  harbors  unmistakable 
mutations  in  patients  with  NFl.  Analysis  of  the 


gene  reveals  that  it  is  similar  in  interesting  ways 
to  other  genes  in  humans  (and  even  in  yeast) .  It 
probably  participates  in  the  suppression  of  unreg- 
ulated cell  growth  by  interaction  with  another 
class  of  cancer-related  genes  (the  ras  genes). 
These  developments  will  make  it  possible  to  de- 
fine in  precise  terms  the  basic  defect  in  NFl.  In 
the  relatively  near  future,  this  gene  discovery  will 
lead  to  improvements  in  diagnosis;  in  the  longer 
term  it  offers  the  promise  of  improved  therapy  for 
this  devastating  disease. 

In  the  previous  year,  as  part  of  a  collaborative 
effort  with  investigators  at  the  Hospital  for  Sick 
Children  in  Toronto,  we  were  successful  in  iden- 
tifying the  gene  for  cystic  fibrosis  (CF),  a  com- 
mon severe  genetic  disease  characterized  by  lung 
infections,  pancreatic  insufficiency,  and  an  ele- 
vation in  sweat  chloride.  Additional  investiga- 
tions of  this  gene,  denoted  CFTR  (cystic  fibrosis 
transmembrane  conductance  regulator) ,  has  pro- 
ceeded on  many  fronts  this  year. 

WTiile  one  mutation  (called  AF508)  is  respon- 
sible for  about  75  percent  of  CF  chromosomes  in 
the  United  States,  we  and  others  have  identified 
numerous  mutations  in  the  CFTR  gene  in  other 
individuals  with  the  disease.  We  are  also  studying 
the  regulation  of  the  gene — trying  to  determine 
why  it  is  expressed  in  some  tissues  (such  as  lungs 
and  pancreas)  but  not  at  all  in  others  (such  as 
brain) .  It  also  appears  that  the  gene  is  capable  of 
producing  more  than  one  protein  by  a  phenome- 
non referred  to  as  alternative  splicing. 

Importantly,  we  have  succeeded  in  transferring 
a  normal  version  of  the  CFTR  gene  into  CF  cells 
growing  in  laboratory  culture,  and  have  shown 
that  the  defect  is  corrected.  These  results, 
achieved  in  collaboration  with  James  Wilson 
(HHMI,  University  of  Michigan)  and  Ray  Frizzell 
(University  of  Alabama) ,  can  be  thought  of  as  the 
first  step  toward  a  gene  therapy  for  CF. 

The  Huntington  disease  (HD)  gene  has  been 
localized  by  a  collaborative  group,  which  in- 
cludes our  laboratory.  It  is  now  known  to  lie  in  a 
region  near  the  tip  of  the  short  arm  of  chromo- 
some 4,  within  an  interval  of  approximately  2 
million  base  pairs.  Using  chromosome  jumping 
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and  yeast  artificial  chromosome  cloning  from 
probes  known  to  map  to  this  interval,  we  have 
nearly  completed  an  overlapping  set  of  clones  of 
the  region  and  are  beginning  to  search  for  candi- 
date genes. 

A  final  project  is  an  investigation  of  the  switch 
in  production  from  fetal  to  adult  hemoglobin  in 
the  human.  In  this  precisely  controlled  phenome- 
non, fetal  hemoglobin  genes  turn  off  and  adult 
hemoglobin  genes  turn  on  at  about  the  time  of 
birth.  This  phenomenon  is  important  for  two  rea- 
sons. First,  it  is  a  model  for  developmental  con- 
trol of  gene  expression.  Second,  turning  the  fetal 
genes  back  on  in  an  adult  with  sickle  cell  anemia 


or  thalassemia  would  be  likely  to  cure  these 
blood  diseases,  which  afflict  hundreds  of  thou- 
sands of  persons  worldwide. 

In  attempting  to  understand  this  complex  pro- 
cess, we  have  identified  a  series  of  mutations  in 
the  fetal  hemoglobin  genes  responsible  for  fail- 
ure to  turn  these  genes  off  completely  during 
adult  life,  which  leads  to  a  benign  condition 
called  hereditary  persistence  of  fetal  hemoglobin 
(HPFH).  These  mutations  mark  DNA  sequences 
important  in  the  developmental  switch.  We  are 
characterizing  proteins  from  the  nucleus  that 
bind  to  these  sequences  and  control  the  expres- 
sion of  the  hemoglobin  genes. 
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INFORMATION  obtained  from  studies  of  im- 
mune system  development  in  a  variety  of  verte- 
brate species  is  used  to  explore  diseases  of  the 
immune  system  in  humans.  We  are  particularly 
interested  in  the  pathogenesis  of  immunodefi- 
ciency diseases  and  lymphoid  malignancies. 

Comparative  studies  in  birds  and  mammals  ini- 
tially revealed  the  separate  developmental  path- 
ways of  T  and  B  lymphocytes,  the  rwo  major  types 
of  immunocompetent  cells.  T  cells  provide  help 
for  antibody-producing  B  cells  and  are  primarily 
responsible  for  immunity  against  viruses  and 
fungi. 

The  thymus  is  the  source  of  T  cells  in  all  verte- 
brates, but  the  central  tissue  for  B  cell  production 
may  vary.  In  birds  B  cells  are  derived  from  the 
hindgut  bursa  of  Fabricius,  whereas  in  mammals 
they  are  generated  in  blood-forming  organs,  first 
the  fetal  liver  and  then  the  bone  marrow.  Multi- 
potent  stem  cells  in  these  hematopoietic  tissues 
serve  as  the  precursors  of  T  cells,  B  cells,  and 
other  types  of  blood  cells. 

Inherited  or  acquired  gene  defects  may  specifi- 
cally alter  growth  or  maturation  of  these  cell 
lines  to  cause  immune  system  dysfunction  or 
malignancy. 

Genes  encoding  the  T  cell  antigen  receptors 
(TCR)  and  antibodies  undergo  controlled  rear- 
rangement and  expression  in  lymphoid  cells 
beginning  their  development  along  T  or  B  cell 
lineages.  Other  genes  encode  growth-  and  differ- 
entiation-promoting proteins  and  their  receptors 
on  T  and  B  cells.  An  elaborate  developmental  pro- 
gram is  thus  responsible  for  the  generation  of 
millions  of  lymphocyte  clones,  each  expressing  a 
TCR  or  antibody  molecule  of  different  antigen 
specificity.  These  newly  formed  T  and  B  cells  are 
seeded  via  the  bloodstream  from  the  thymus  or 
bone  marrow  to  the  spleen  and  other  peripheral 
tissues,  where  they  execute  immune  surveillance 
of  foreign  and  self  antigens. 

T  Cell  Development 

We  have  embarked  on  a  comparative  analysis  of 
T  cell  development  in  representative  avian,  am- 
phibian, and  mammalian  species,  prompted  by 


an  interest  in  the  evolutionary  strategy  for  gener- 
ating T  cells  that  can  discriminate  between  self 
and  nonself  and  in  seeking  fresh  clues  to  some  of 
the  unresolved  mysteries  of  the  human  immune 
system. 

Our  studies  in  birds  reveal  remarkable  conser- 
vation of  the  pattern  of  T  cell  development  found 
in  mammals,  including  the  sequential  develop- 
ment of  T  cells  bearing  TCR  of  either  76  or  a/3 
isotypes.  While  the  functional  role  of  those  ex- 
pressing 76  TCR  is  still  enigmatic,  they  are  always 
generated  first  during  ontogeny,  may  not  undergo 
clonal  selection  during  their  intrathymic  develop- 
ment, migrate  preferentially  to  red  pulp  areas  of 
the  spleen  and  the  epithelial  lining  of  intestines, 
and  constitute  approximately  one-third  of  the  T 
cell  pool  in  adult  birds. 

The  avian  76  T  cells  (also  called  TCRl  cells) 
are  capable  of  killing  other  cells,  but  unlike  the 
a/3  T  cells,  are  rarely  triggered  by  recognition  of 
conventional  class  II  molecules  of  the  major  his- 
tocompatibility complex  (MHC).  Study  of  their 
physiological  role  is  facilitated  in  birds  by  their 
relative  abundance,  and  we  hope  to  exploit  the 
experimental  ability  to  inhibit  their  development 
in  a  selective  fashion. 

A  fascinating  aspect  of  the  avian  a/3  T  cells  is 
their  development  along  two  discrete  sublin- 
eages  identifiable  by  the  TCR2  and  TCR3  mono- 
clonal antibodies.  In  collaborative  studies  with 
Craig  Thompson  and  his  colleagues  (HHMI,  Uni- 
versity of  Michigan),  TCR2+  and  TCR3'  cells 
have  been  found  to  utilize  different  families  of 
TCRjS  variable-region  genes  (V/8):  TCR2  cells  use 
the  V|8l  genes,  and  the  TCR3  cells  use  Vi82  genes. 
The  avian  \fi  gene  locus  is  thus  simpler  than  the 
mammalian  Vj8  locus,  which  contains  many  Vj8 
gene  families. 

The  avian  V/31  and  V/32  genes  exhibit  highly 
conserved  sequences  that  characterize  two  major 
subgroups  of  the  mammalian  V|8  genes.  Birds  may 
thus  provide  a  useful  model  system  to  study  the 
functional  significance  of  the  generic  V|S1  and 
V|82  gene  families.  Avian  TCR(S  diversity  is  cre- 
ated largely  by  nucleotide  sequence  variations  in 
the  joints  berween  the  rearranged  Vj8,  D  (diver- 
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sity),  and  J  (joining)  genes.  During  development, 
the  avian  V|Sl  genes  are  rearranged  before  the 
V/32  genes,  and  T  cells  using  the  V/31  gene  family 
preferentially  migrate  to  the  intestines.  Notably 
also,  the  relative  frequencies  of  alloreactive 
TCR2  and  TCR3  cells  var\^  according  to  the  MHC 
class  II  gene  combination  in  a  graft-versus-host 
assay. 

Studies  of  T  cell  development  in  the  South  Mri- 
can  frog  Xenopus  laevis  are  also  beginning  to  re- 
veal interesting  information.  Monoclonal  antibod- 
ies have  been  made  that  identify  76  and  a/3  TCR 
candidates  expressed  as  dimeric  units  by  separate 
lymphocyte  subpopulations  in  the  frog  thymus, 
spleen,  and  intestine.  The  frog  T  cells  that  bear 
the  putative  76  and  a/3  TCR  homologues  exhibit 
specialized  tissue  localization  patterns  virtually 
identical  to  those  defined  for  their  avian  and 
mammalian  counterparts.  Present  eflfons  are  di- 
rected toward  identification  of  the  frog  TCR 
genes  and  exploitation  of  the  advantages  of  this 
amphibian  developmental  model. 

B  Cell  Development 

B  cell  precursors  are  identifiable  in  the  bone 
marrov^  as  lymphocytes  that  contain  hea\y 
chains  in  their  cytoplasm.  Lacking  conventional 


light  chains  that  are  needed  for  surface  expres- 
sion of  immunoglobulins,  pre-B  cells  have  been 
considered  "blind"  to  environmental  antigens. 
This  \iew  has  been  challenged  by  the  recent  dis- 
covery of  genes  called  X5  and  V  pre-B  that  encode 
a  surrogate  light-chain  complex  apparently  ex- 
pressed together  with  /Lt-chains  on  leukemic  pre- 
B  cells.  We  have  identified  a  subpopulation  of 
normal  pre-B  cells  that  express  the  surrogate 
light-chain/ju-chain  receptor  complex  on  their 
surface.  The  data  imply  that  pre-B  cells  are  vul- 
nerable to  clonal  selection.  Current  experiments 
are  directed  toward  identification  of  the  natural 
ligands  for  these  unusual  receptors  and  character- 
ization of  the  cellular  response. 

More  than  half  of  the  antibodies  that  we  pro- 
duce daily  are  of  IgA  isotype,  and  many  of  these 
are  transported  across  mucosal  surfaces  of  the 
body  for  protective  immunity.  We  have  identified 
a  receptor  for  IgA  antibodies  on  phagocytic  white 
blood  cells  and  have  characterized  its  biochemi- 
cal nature.  Monoclonal  antibodies  made  against 
these  IgA  receptors  are  being  used  to  define  their 
precise  tissue  distribution  and  for  functional  stud- 
ies. These  studies  focus  attention  on  IgA  anti- 
body-mediated phagocytosis  as  a  potentially  im- 
portant defense  mechanism. 
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THE  first  step  in  the  transfer  of  genetic  infor- 
mation from  DNA  to  cell  components  is  the 
synthesis  of  an  RNA  copy  of  the  gene.  This  pro- 
cess is  termed  transcription,  and  the  RNA  synthe- 
sized is  called  a  messenger  RNA  (mRNA) .  The  pro- 
cess can  be  thought  of  as  making  a  copy  of  part  of 
the  architectural  plan  for  the  construction  of  a 
building.  The  synthesis  of  each  protein  compo- 
nent of  the  cell  is  directed  by  a  distinct  mRNA 
molecule,  and  these  mRNA  molecules  are  all  syn- 
thesized by  an  enzyme  called  RNA  polymerase  II. 
This  enzyme  recognizes  each  gene  and  synthe- 
sizes the  appropriate  amount  of  mRNA  at  the  re- 
quired time. 

RNA  polymerase  II  contains  more  than  1 0  dif- 
ferent proteins  whose  precise  functions  are 
largely  unknown.  Our  goal  is  to  understand  the 
structure  and  function  of  the  subunits  of  RNA 
polymerase  II.  Six  years  ago  my  laboratory  began 
to  analyze  the  largest  subunit  of  the  mouse  RNA 
polymerase  II  complex.  This  subunit  comprises 
one-half  of  the  mass  of  the  enzyme  and,  through 
the  work  of  many  laboratories,  is  known  to  be 
involved  in  the  enzymatic  synthesis  of  RNA.  We 
have  cloned  and  sequenced  the  mouse  gene  en- 
coding this  largest  subunit  and  have  also  isolated 
and  characterized  several  mutations  in  this  gene. 
These  mutant  genes  are  being  used  to  study  the 
function  of  different  domains  of  the  largest  sub- 
unit.  The  aim  of  these  experiments  is  to  under- 
stand how  RNA  polymerase  II  orchestrates  the  or- 
dered expression  of  100,000  genes  during  the 
vertebrate  life  cycle. 

The  gene  encoding  the  largest  subunit  of  RNA 
polymerase  II  comprises  28  segments  (exons) 
that  cover  30,000  bases  of  mouse  chromosomal 
DNA  near  the  center  of  chromosome  1 1 .  The 
amino  acid  sequence  deduced  from  the  DNA  se- 
quence has  revealed  two  interesting  properties  of 
the  subunit.  The  major  portion  of  the  protein  is 
similar  in  sequence  to  a  bacterial  RNA  polymer- 
ase subunit  (from  Escherichia  coli)  that  carries 
out  an  equivalent  function.  This  evolutionary 
conservation  is  much  stronger  than  had  been  ex- 
pected and  has  allowed  us  to  predict  that  the 
mouse  subunit  is  involved  in  the  transcription 
elongation  process. 


Although  the  major  part  of  the  largest  subunit  is 
related  to  the  bacterial  enzyme,  our  DNA  se- 
quence analysis  has  also  revealed  a  domain  that  is 
unique  to  RNA  polymerase  II.  This  domain  is  lo- 
cated at  one  end  of  the  molecule  and  constitutes  a 
5 2 -fold  repeat  of  a  seven-amino  acid  sequence. 
This  unusual  sequence,  while  absent  in  bacteria, 
is  found  in  the  large  subunits  of  virtually  every 
RNA  polymerase  II,  including  those  of  animals, 
plants,  insects,  and  protists.  We  are  currently  fo- 
cusing our  efforts  on  understanding  the  role  of 
this  domain  in  the  process  of  transcription. 

Our  genetic  approach  to  the  function  of  this 
carboxyl-terminal  domain  (CTD)  has  grown  from 
analysis  of  mutations  in  the  largest  subunit  gene. 
We  first  isolated  mutant  mouse  tissue-culture  cell 
lines  that  are  resistant  to  the  mushroom  toxin  a- 
amanitin.  The  largest  subunit  genes  from  several 
of  these  cell  lines  have  been  cloned,  and,  by  rein- 
troduction  into  amanitin-sensitive  cells,  have 
been  shown  to  confer  resistance  to  a-amanitin. 
We  have  used  this  gene  transfer  technique  to 
map  the  mutations  responsible  for  amanitin 
resistance. 

The  availability  of  a  well-defined,  selectable  ge- 
netic marker  in  the  largest  subunit  gene  has 
proved  useful  in  the  analysis  of  the  role  of  the 
CTD.  Deletion,  insertion,  and  substitution  muta- 
tions have  been  created  in  the  CTD  of  an  amani- 
tin-resistance  gene.  The  effect  of  these  mutations 
has  been  tested  by  introducing  the  altered  resis- 
tance genes  into  cells  and  scoring  for  amanitin 
resistance.  Removing  the  entire  CTD  eliminates 
the  ability  to  transfer  amanitin  resistance,  demon- 
strating that  the  CTD  plays  an  essential  role  in 
transcription.  We  have  also  shown  that  up  to  20 
of  the  52  repeats  are  dispensable  for  growth  in 
tissue  culture,  indicating  either  that  the  CTD  is 
functionally  redundant  or  that  dispensable  re- 
peats are  only  necessary  in  some  cell  types. 

We  have  also  been  examining  postsynthetic 
modifications  of  the  CTD.  This  domain  is  rich 
in  amino  acids  (such  as  serine,  threonine,  and 
tyrosine)  that  can  be  modified  by  addition  of 
phosphate  groups.  RNA  polymerase  II  is  a  phos- 
phorylated  enzyme,  but  no  function  for  phos- 
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phorylation  is  known.  We  have  used  synthetic 
CTD  peptide  repeats  as  substrates  to  identify  en- 
zymes (CTD  kinases)  that  carry  out  this  phos- 
phorylation. Two  CTD  kinases  have  been  iso- 
lated, each  containing  protein  kinase  catalytic 
subunits  previously  identified  in  yeast  as  cell- 
division  control  proteins.  The  identification  of  a 
cell  cycle  control  protein  as  a  component  of  an 
enzyme  that  modifies  RNA  polymerase  suggests 
that  cell  growth  may  be  controlled,  at  least  in 
part,  through  phosphorylation  of  the  CTD. 

Using  the  purified  CTD  kinases,  we  have 
mapped  the  sites  of  phosphorylation  on  the  CTD. 
These  sites,  serines  that  precede  proline  residues, 
correspond  to  sites  in  other  proteins  that  are  mod- 
ified in  a  cell  cycle-dependent  fashion.  Current 
studies  are  directed  toward  examining  the  level 
of  CTD  phosphorylation  at  different  times  in  the 
cell  cycle.  One  consequence  of  CTD  phosphory- 


lation is  that  the  CTD  becomes  greatly  extended. 
The  basis  of  this  conformational  change  is  also  an 
area  of  investigation  in  our  laboratory. 

To  define  more  precisely  the  amino  acid  se- 
quences required  for  CTD  function,  we  have  de- 
vised a  strategy  to  clone  synthetic  CTD  repeats. 
This  work  is  being  done  in  the  yeast  Saccharo- 
myces  cerevesiae,  where  as  few  as  1 0  of  the  re- 
peats are  sufficient  for  viability.  We  have  con- 
structed mutant  CTDs  in  which  the  residues 
identified  as  phosphorylation  sites  have  been 
changed.  In  most  cases  these  mutations  are  lethal, 
indicating  that  phosphorylation  is  essential  for 
RNA  polymerase  II  function.  We  are  currently  us- 
ing yeast  genetics  to  isolate  suppressors  of  these 
mutations,  in  an  effort  to  identify  proteins  that 
interact  with  the  CTD.  Through  characterization 
of  such  proteins  we  hope  to  determine  the  func- 
tion of  the  CTD  in  RNA  polymerase  II  function. 
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OUR  laboratory  is  interested  in  how  protein 
channels  in  cell  membranes  mediate  the 
electrical  activity  of  the  brain.  Such  channels, 
which  open  and  close  to  regulate  the  flow  of  mi- 
nute amounts  of  electrical  current  into  a  cell,  are 
intimately  involved  in  the  brain's  information 
processing.  They  are  important  in  detecting  sen- 
sory signals  such  as  light  and  sound,  in  the  trans- 
mission of  this  information  from  the  sense  organ 
to  the  brain,  and  in  communication  from  one 
brain  cell  to  another.  We  are  focusing  primarily 
on  ion  channels  in  the  sensory  receptor  cells  of 
the  inner  ear,  especially  on  the  mechanism  of  ac- 
tivation of  the  channels  that  detect  sound. 

Channels  that  have  been  studied  in  other  tis- 
sues are  of  two  broad  classes:  those  that  are  acti- 
vated by  the  voltage  across  the  cell  membrane, 
and  those  that  are  activated  by  some  chemical 
either  inside  or  outside  the  cell.  The  channels 
that  we  study  are  part  of  a  novel  third  class.  They 
are  directly  activated  by  a  mechanical  stress  on 
the  channel  protein,  in  this  case  the  stress  of  a 
sound  vibration. 

We  are  especially  concerned  with  those 
aspects  of  mechanically  sensitive  ion  channels 
that  give  them  their  unique  sensitivity.  What  are 
the  cellular  structures  that  convey  a  stimulus  to 
the  channel  protein,  and  what  stimulus  do  they 
convey?  What  is  the  biochemical  nature  of  the 
external  link  to  the  channel?  What  force  on  the 
channel  protein  is  required  to  open  it?  Are  there 
various  conformations  of  the  protein  that  are 
closed  or  open,  and  what  are  the  energy  differ- 
ences among  them? 

From  our  work  and  that  of  others,  an  attractive 
model  has  emerged  for  the  way  vibration  causes  a 
tension  on  the  ion  channels  in  cells  of  the  inner 
ear.  On  the  top  surface  of  these  cells  are  cilia  that 
rock  back  and  forth  with  each  cycle  of  a  sound 
wave.  Tiny  filaments,  recently  found  to  run  be- 
tween the  tips  of  the  cilia,  may  connect  directly 
to  the  channels,  such  that  oscillations  of  the  cilia 
alternately  stretch  or  relax  the  filaments,  opening 
the  channels  or  letting  them  close.  High-speed 
electrical  measurements  indicate  that  these  chan- 
nels can  open  within  a  few  millionths  of  a  second 
following  a  mechanical  stimulus. 


Although  the  essence  of  this  model  remains  to 
be  tested,  we  have  worked  on  some  important 
aspects.  We  recently  confirmed  the  location  of 
the  mechanically  sensitive  channels  at  the  tips  of 
the  cilia,  with  dyes  that  change  fluorescence 
when  they  bind  ions  coming  in  through  the  chan- 
nels. Measurement  of  the  mechanics  of  the  cilia 
by  high-resolution  videomicroscopy  indicates 
that  all  the  channels  of  a  cell  receive  essentially 
the  same  stimulus. 

To  understand  what  proteins  form  the  struc- 
tural links,  we  must  first  learn  what  proteins  con- 
stitute the  cilia.  To  this  end,  we  developed  a 
method  for  studying  the  cilia  that  utilizes  their 
adhesion  to  sticky  paper  and  separation  of  their 
proteins  according  to  size  and  electrical  charge. 
We  have  found  about  12-18  diff^erent  proteins  as 
major  constituents  and  have  identified  several  by 
size  and  their  reaction  with  certain  antibodies. 
Much  work  remains  to  identify  the  rest  and  learn 
their  roles. 

In  addition  to  this  transduction  mechanism, 
there  is  an  adaptation  mechanism  in  these  cells 
that  renders  them  sensitive  to  extremely  small  dis- 
placements while  they  respond  over  a  large  range 
of  stimuli.  This  seems  to  work  by  a  continuous 
adjustment  system,  acting  to  set  the  tension  on 
the  channels.  Within  a  tenth  of  a  second,  the  sys- 
tem can  restore  the  resting  tension,  which  is 
enough  to  keep  some  of  the  channels  open  at  any 
time. 

Our  experiments  and  those  of  others  suggest 
that  the  adaptation  comes  about  by  a  movement 
of  the  points  where  the  filaments  are  attached  to 
the  cilia.  When  the  filaments  are  stretched  to 
open  channels,  the  attachment  points  slip  to  al- 
low the  fibers  to  shorten.  Conversely,  when  the 
filaments  are  relaxed,  the  attachment  points 
climb  to  stretch  them,  to  restore  the  resting  ten- 
sion on  ion  channels.  The  slipping  process  is  ac- 
celerated by  the  movement  of  calcium  into  the 
tips  of  the  cilia,  suggesting  the  mechanism  is 
there.  Alteration  of  the  calcium  changes  the  rest- 
ing tension  set  by  this  motor.  It  also  causes  a  tiny 
movement  of  the  bundle  of  cilia — by  about  1 
millionth  of  an  inch — which  we  can  observe 
with  videomicroscopy.  Thus  the  mechanism 
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seems  more  like  an  active,  force-producing  mo- 
tor than  a  relaxation. 

Further  evidence  from  our  laboratory  suggests 
that  the  "motor  molecule"  that  moves  the  attach- 
ment point  is  like  myosin,  the  protein  that  causes 
muscle  to  contract,  and  we  are  now  seeking  to 
coniirm  or  refute  this  analogy. 

Our  ultimate  aim  is  to  describe  each  link  in  the 
mechanical  chain  from  cilia  to  channels,  in  terms 
of  the  protein  identity  of  the  links,  their  biophysi- 
cal properties,  and  their  relationship  to  each 
other.  Such  a  description  would  contribute  to  the 
long-range  goal  of  a  comprehensive  theory  of  me- 
chanically activated  channels,  not  only  in  the  ear 
but  in  the  many  other  cell  types  that  display  a 
mechanical  sensitivity. 

The  location  of  our  laboratory  in  a  hospital  has 
also  encouraged  collaborative  projects  related  to 
neurological  disease.  One  of  the  more  interesting 
recent  projects  involves  an  inherited  muscle  dis- 
order, hyperkalemic  periodic  paralysis.  This  ge- 
netic disease,  which  causes  sporadic  weakness  or 
paralysis,  is  dominantly  inherited,  so  that  a  child 
has  a  50-50  chance  of  being  affected. 


Exercise  or  certain  foods,  which  raise  the  level 
of  blood  potassium,  can  bring  on  a  paralytic  at- 
tack. Earlier  experiments  had  indicated  that  the 
increased  potassium  can  change  the  resting  volt- 
age of  the  muscle  and  (perhaps  indirectly)  acti- 
vate special  channels  that  pass  sodium  ions.  Col- 
laborators at  the  Massachusetts  General  Hospital 
showed  that  the  disease  is  genetically  linked  to 
the  muscle  sodium  channel  gene,  implicating 
these  channels.  We  then  measured  the  activity  of 
sodium  channels  in  muscle  from  an  affected  pa- 
tient and  found  them  defective.  The  defect  ex- 
plains the  pathology.  Elevated  potassium  pre- 
vents some  of  the  defective  sodium  channels 
from  closing,  and  the  steady  influx  of  sodium 
changes  the  voltage  so  as  to  open — and  then  per- 
manently close — all  the  other  channels.  The 
muscle  then  cannot  contract,  causing  paralysis. 
Consistent  with  dominant  inheritance,  a  small 
fraction  of  defective  channels  indirectly  inacti- 
vates all  the  rest,  including  the  normal  ones  in- 
herited from  the  other  parent.  Understanding  the 
pathology  at  the  molecular  level  gives  hints  for 
effective  drug  treatment. 
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CELLS  acquire  their  final  differentiated  func- 
tion by  a  complex  interplay  between  primary 
genetic  regulatory  events  in  the  nucleus  and 
interactions  at  the  cell  membrane.  Building  on 
concepts  largely  provided  from  studies  on  lower 
animals,  our  laboratory  has  been  exploring  regu- 
latory mechanisms  that  help  determine  how 
mammalian  cells  differentiate  to  assume  their 
normal  functions. 

T  Lymphocyte  Activation  and 
Differentiation 

T  lymphocytes  undergo  two  biologically  and 
medically  important  types  of  differentiation.  The 
first  occurs  in  the  thymus  and  generates  cells  ca- 
pable of  directing  an  immune  response  to  nearly 
any  antigen.  However,  the  cells  produced  by  the 
thymus  that  circulate  in  our  blood  are  immuno- 
logically nearly  inert.  They  acquire  immunologic 
function  as  a  result  of  a  second  process  of  cellular 
differentiation  that  takes  about  10-14  days  and 
produces  T  cells  able  to  coordinate  the  actions  of 
other  cells  involved  in  the  immune  response  and 
carry  out  a  variety  of  immune  functions.  This 
pathway  of  differentiation  is  initiated  by  a  com- 
plex interaction  between  the  T  cell  and  an  anti- 
gen-presenting cell.  The  essential  requirement 
for  a  commitment  to  specialized  function  is  a 
highly  specific  interaction  between  histocompati- 
bility molecules,  antigen,  and  the  antigen  recep- 
tor. This  critical  interaction  is  only  effective 
when  stabilized  by  transient  nonspecific  interac- 
tions based  on  intracellular  adhesive  molecules. 
Finally,  lymphokines  such  as  interleukin- 1  and  -6 
that  are  the  secreted  products  of  the  antigen-pre- 
senting cell  are  necessary  accessory  signals  to  ini- 
tiate differentiation.  These  three  requirements 
for  the  initiation  of  differentiation — a  highly  spe- 
cific cell-cell  interaction,  a  nonspecific  adhesive 
interaction,  and  cytokines — are  similar  to  the  re- 
quirements for  the  cellular  commitment  to  dif- 
ferentiate in  other  systems. 

Because  the  interaction  between  the  antigen- 
presenting  cell  and  a  T  lymphocyte  is  transient 
(lasting  only  about  30  minutes),  all  of  the  molec- 


ular events  required  for  the  decision  to  proceed 
down  this  1 0-  to  1 4 -day  process  of  cellular  differ- 
entiation must  occur  during  this  short  period. 
Our  laboratory  is  seeking  an  understanding  of  the 
molecular  basis  of  this  cellular  decision. 

By  analyzing  the  regulatory  regions  of  genes 
that  are  essential  for  T  cell  activation,  such  as  in- 
terleukin-2,  we  have  identified  several  molecules 
that  appear  critical  to  initiating  T  cell  activation. 
These  molecules  interact  on  the  promoters  of 
genes  necessary  for  T  cell  activation  in  such  a  way 
that  all  must  be  present  for  the  gene  to  be  acti- 
vated. Thus  the  all-or-none  decision  of  a  T  lym- 
phocyte to  differentiate  appears  related  to  coop- 
erative interactions  between  molecules  binding 
to  DNA  and  activating  the  genes,  again  in  an  on- 
or-off  manner,  essential  for  progression  toward 
differentiated  function. 

Recently,  our  studies  of  the  immunosuppres- 
sive drug  cyclosporin  A  have  given  us  insight  into 
the  relative  importance  of  the  events  involved  in 
the  initiation  of  T  cell  activation.  Although  the 
mechanism  of  action  of  this  drug  is  largely  un- 
known, it  appears  to  work  early  during  the  com- 
mitment period  for  T  cells.  By  doing  this,  it 
blocks  the  late  functions  of  T  cells  and  also  many 
of  the  functions  of  B  lymphocytes  and  other  hema- 
topoietic cells  that  are  directed  by  T  cells.  One  of 
the  proteins  we  identified  in  our  early  studies, 
nuclear  factor  of  activated  T  cells  (NFAT) ,  is  ex- 
quisitely sensitive  to  the  effects  of  cyclosporin  A, 
while  nearly  all  other  proteins  are  unaffected. 
Other  groups  had  found  that  cyclosporin  A  binds 
and  inhibits  the  function  of  a  cis-trans  prolyl  iso- 
merase.  These  enzymes  accelerate  the  folding  of 
newly  synthesized  proteins.  In  studies  with  Stuart 
Schreiber  (Harvard  University),  we  have  shown 
that  the  isomerase  activity  is  not  involved  in  the 
action  of  cyclosporin  A  and  FK-506  (another  im- 
munosuppressant) but  rather  that  an  inhibitory 
complex  formed  between  the  isomerase  and  the 
drug  blocks  signal  transduction.  In  other  studies 
we  found  that  the  specific  transcriptional  activity 
of  NFAT,  but  not  its  DNA-binding  activity,  is  af- 
fected by  cyclosporin  A  and  FK-506,  suggesting 
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that  NFAT  might  be  the  direct  target  of  these 
drugs. 

We  have  found  that  NFAT  is  a  complex  hetero- 
dimeric  protein:  one  subunit  is  constitutive,  T 
cell-specific,  and  located  in  the  cytoplasm  of 
resting  cells;  the  other  subunit  is  located  in  the 
nucleus  and  is  induced.  The  transcriptionally  ac- 
tive protein  forms  when  the  cytoplasmic  compo- 
nent translocates  to  the  nucleus  in  response  to 
stimulation  through  the  antigen  receptor  of  T 
cells.  Cyclosporin  A  and  FK-506  appear  to  func- 
tion by  inhibiting  the  translocation,  but  they  do 
not  interfere  with  the  induction  of  the  nuclear 
component.  Thus  the  prolyl  isomerase  may  func- 
tion both  as  an  isomerase  and  as  a  translocation 
molecule. 

Endodermal  and  Hepatocyte  Differentiation 

Several  years  ago  we  identified  a  tissue-specific 
transcription  factor,  HNF-1,  that  interacts  with 
essential  regions  of  the  promoters  of  a  large  fam- 
ily of  genes  expressed  in  endodermally  derived 


tissues.  During  the  past  year  we  were  able  to  pu- 
rify the  protein,  obtain  the  amino  acid  sequence 
from  protease-derived  fragments,  and  clone  its 
gene.  Not  surprisingly,  the  protein  contains  a  ho- 
meodomain  similar  to  that  found  in  genes  deter- 
mining body  form  in  insects.  Curiously,  the  pro- 
tein dimerizes  through  a  region  in  the  amino 
terminus  of  the  protein,  unlike  that  found  in 
other  homeodomain-containing  proteins.  This 
led  us  to  look  for  a  protein  that  might  heterodi- 
merize  with  it  and  hence  diversify  its  regulatory 
capabilities.  We  found  such  a  protein  by  screen- 
ing a  hepatocyte  cDNA  library  at  low  stringency. 
This  protein,  HNF-l/?,  which  is  expressed  in  an 
overlapping  group  of  tissues  with  HNF-1  a,  con- 
tains a  dimerization  and  homeodomain  similar  to 
those  of  HNF-1  a  but  a  different  transcriptional 
activation  domain.  The  HNF-1  ;S  pattern  of  ex- 
pression indicates  that  it  is  a  repressor  rather  than 
activator  of  genes  selectively  expressed  in  the 
liver.  During  the  coming  year,  we  will  investigate 
the  role  of  these  proteins  in  mammalian  endoder- 
mal development. 
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RETROVIRUSES  derive  their  name  from  their 
ability  to  reverse  the  normal  flow  of  genetic 
information  from  DNA  to  RNA.  They  have  the 
unique  ability  to  synthesize  a  double-stranded 
DNA  copy  of  their  single-stranded  RNA  genome 
and  then  to  integrate  this  DNA  copy  into  the  ge- 
nome of  the  infected  host  cell.  Once  the  genome 
of  a  retrovirus  is  integrated  into  a  host  chromo- 
some, it  is  indistinguishable  from  a  host  gene  and 
may  be  actively  transcribed  by  the  host  transcrip- 
tional machinery. 

The  infection  of  an  animal  by  retroviruses  can 
result  in  a  number  of  disease  states,  of  which  the 
most  common  is  leukemia.  The  avian  leukemia 
virus  (ALV),  discovered  by  Ellerman  and  Bang  in 
1908,  was  the  first  oncogenic  virus  to  be  defined 
experimentally.  ALV  and  the  somewhat  similar 
murine  leukemia  viruses  continue  to  be  studied 
extensively  as  models  for  this  virus  group.  This 
research  has  not  only  helped  to  delineate  the  ret- 
roviral replication  cycle  but  has  also  greatly  ad- 
vanced our  understanding  of  retroviral  oncogen- 
esis. Most  importantly,  this  research  has  allowed 
the  definition  of  a  number  of  cellular  genes,  the 
oncogenes,  whose  inappropriate  expression  can 
contribute  to  cellular  transformation. 

Although  animal  retroviruses  have  been  the 
subject  of  scientific  research  for  some  time,  the 
discovery  of  human  retroviruses  occurred  only 
within  the  last  decade.  Two  major  groups  of 
pathogenic  human  retroviruses  have  been  identi- 
fied thus  far.  Human  T  cell  leukemia  viruses 
(HTLV-I  and  HTLV-II)  are  known  to  be  causative 
agents  of  human  leukemias,  including  adult  T 
cell  leukemia,  and  are  significant  disease  agents 
in  several  parts  of  the  world,  including  Japan  and 
the  Caribbean  basin.  Of  even  more  concern  are 
the  human  immunodeficiency  viruses  (HIV-1  and 
HIV-2),  which  are  a  leading  cause  of  disease  and 
death  in  parts  of  Africa  and  in  the  United  States. 

In  the  past  I  worked  extensively  on  the  regula- 
tion of  gene  expression  by  the  avian  leukemia 
viruses.  This  background  has  greatly  facilitated 
the  current  research  of  this  laboratory,  which 
focuses  on  the  regulation  of  gene  expression 


within  the  human  retroviruses  and  particularly 
on  HIV-1.  The  genome  of  HIV-1,  like  the  ge- 
nomes of  the  avian  and  murine  leukemia  viruses, 
contains  sequences  encoding  the  viral  structural 
genes  env  (envelope  glycoprotein),  gag  (capsid 
protein) ,  and  pol  (RNA-dependent  DNA  polymer- 
ase) .  However,  the  complexity  of  the  HIV- 1  ge- 
nome, despite  its  similar  size,  is  far  greater  than 
observed  for  these  animal  retroviruses.  In  particu- 
lar, HIV- 1  is  now  known  to  encode  six  additional 
gene  products  that  have  no  equivalents  in  the 
avian  and  murine  leukemia  viruses.  Two  of  these 
viral  proteins.  Tat  and  Rev,  are  nuclear  regulatory 
proteins  that  are  essential  for  HIV-1  replication. 
Both  of  these  viral  trans-activators  appear  to  pos- 
sess unprecedented  mechanisms  of  action  that  in- 
volve highly  specific  RNA-protein  interactions. 

In  the  past  we  demonstrated  that  the  Tat  pro- 
tein of  HIV-1  acts  on  sequences  located  within 
the  HIV-1  promoter  element  (the  long  terminal 
repeat  or  LTR)  to  increase  the  level  of  expression 
of  linked  genes.  This  increased  viral  gene  expres- 
sion occurs  via  a  bimodal  mechanism  that  in- 
volves an  increase  in  the  rate  of  transcription  of 
HIV-1  mRNAs  and  in  the  efficiency  of  transla- 
tional  utilization  of  those  RNAs.  The  target  se- 
quence for  Tat  is  a  59-nucleotide  RNA  stem-loop 
structure  located  at  the  very  5'  end  of  all  viral 
mRNA  molecules.  The  direct  interaction  of  Tat 
with  this  RNA  structure  leads  to  an  enhancement 
of  viral  transcription  initiation  and,  particularly, 
elongation.  This  mechanism,  which  appears 
unique  to  Tat,  remains  poorly  understood  but  is 
likely  to  involve  the  interaction  of  Tat  with  cellu- 
lar proteins  that  are  currently  poorly  defined.  The 
identification  and  characterization  of  these  cellu- 
lar proteins  is  a  major  research  aim  of  this 
laboratory. 

A  second  HIV- 1  protein.  Rev,  is  required  for  the 
expression  of  viral  structural  proteins  but  is  dis- 
pensable for  the  expression  of  viral  regulatory 
proteins.  Our  research  has  demonstrated  that  Rev 
acts  post-transcriptionally  to  induce  the  cytoplas- 
mic expression  of  the  unspliced  or  incompletely 
spliced  RNAs  that  encode  the  viral  structural  pro- 
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teins  Gag  and  Env,  while  simultaneously  repress- 
ing the  expression  of  the  fully  spliced  RNAs  that 
encode  the  viral  regulatory  proteins,  including 
Rev  itself.  The  Rev  protein  therefore  regulates  its 
own  expression  via  a  negative  feedback  mecha- 
nism. Recent  data  suggest  that  Rev  achieves  this 
effect  by  specifically  regulating  the  export  of 
viral  RNAs  from  the  cell  nucleus  to  the  cyto- 
plasm. This  specificity  is  conferred  by  a  cis-acting 
viral  RNA  target  sequence,  the  Rev  response  ele- 
ment (RRE),  which  has  been  shown  to  form  a 
complex  RNA  secondary  structure.  Rev  function 
appears  to  require  the  sequential  binding  of  mul- 
tiple Rev  protein  monomers  to  the  RRE  sequence. 

Mutational  analysis  of  the  Rev  protein  has  dem- 
onstrated the  existence  of  two  functional  do- 
mains. The  first  is  a  sequence-specific  RNA-bind- 
ing  domain  required  for  the  interaction  with  the 
RRE,  while  the  second  appears  to  interact  with  a 
currently  unidentified  cellular  protein  that  may 
be  part  of  the  cellular  RNA  transport  machinery. 


Mutations  of  this  latter  domain,  the  Rev  activation 
domain,  give  rise  to  Rev  proteins  that  act  as 
competitive  inhibitors  of  the  wild-type  Rev  trans- 
activator.  Mutant  HIV-1  proteins  of  this  type 
(dominant  negative  mutants)  may  have  future  ap- 
plication in  the  gene  therapy  of  HIV- 1 -infected 
individuals.  A  major  focus  of  this  laboratory  is  the 
development  of  these  trans-dominant  Rev  mu- 
tants and,  in  particular,  the  further  investigation 
of  the  role  of  cellular  proteins  in  the  Rev 
response. 

Finally,  my  laboratory  has  begun  to  expand  our 
research  to  other  human  retroviruses,  including 
HIV- 2  and  the  apparently  nonpathogenic  human 
foamy  retrovirus,  as  well  as  to  related  animal  ret- 
roviruses, such  as  visna  virus.  The  elucidation  of 
similarities  and  differences  in  the  regulation  of 
gene  expression  among  these  retroviruses  should 
facilitate  the  identification  and  understanding  of 
the  cis-  and  trans-acting  elements  required  for 
their  replication  and  pathology. 
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VIRUSES  are  parasites.  They  cannot  produce 
progeny  on  their  own,  but  must  rely  on  the 
machinery  provided  by  the  host  cell  to  replicate 
the  viral  genome  and  assemble  new  virus  parti- 
cles. Infection  is  initiated  by  attachment  of  the 
virus  to  the  host  cell — the  first  step  in  a  complex 
reaction  that  results  in  transfer  of  the  viral  ge- 
nome through  the  cell  membrane  and  into  the 
cytoplasm.  This  attachment,  or  binding,  is  a  con- 
sequence of  the  interaction  between  proteins  ex- 
posed on  the  surface  of  the  virus  and  the  host  cell 
plasma  membrane. 

Cells  that  do  not  express  a  suitable  virus-bind- 
ing protein,  called  a  receptor,  are  not  susceptible 
to  infection  by  a  particular  virus.  Indeed,  the  abil- 
ity of  many  pathogenic  viruses,  such  as  human 
immunodeficiency  virus,  poliovirus,  and  certain 
herpesviruses,  to  infect  specific  host  tissues  has 
been  closely  correlated  with  the  expression  of 
specific  receptors. 

Our  laboratory  has  been  interested  in  the  mech- 
anism of  infection  utilized  by  Moloney  murine 
leukemia  virus  (Mo-MuLV) ,  a  member  of  a  group 
of  related  leukemogenic  retroviruses  found  in  vir- 
tually all  vertebrates.  We  have  isolated  a  molecu- 
lar clone,  Rec- 1,  which  confers  Mo-MuLV  infectiv- 
ity  upon  introduction  into  mammalian  cells  that 
are  not  normally  susceptible  to  infection.  Subse- 
quent experiments  have  demonstrated  that  Rec- 1 
encodes  for  a  membrane  protein  that  serves  as  the 
Mo-MuLV  receptor.  Our  current  research  is  ad- 
dressed toward  dissecting  the  molecular  details 
of  the  virus-receptor  interaction  that  mediates 
infection. 

The  Mo-MuLV  receptor  is  not  present  in  mice 
for  the  convenience  of  the  virus,  but  rather  must 
have  a  function  essential  to  normal  cell  metabo- 
lism. We  have  identified  the  receptor  protein  on 
the  plasma  membrane  of  all  mouse  cells,  suggest- 
ing it  participates  in  basic  cell  metabolism  and 
does  not  perform  a  specialized  function  limited 
to  a  subset  of  tissues.  Examination  of  the  amino 
acid  sequence  encoded  by  Ree  l  reveals  that  it 
comprises  an  abundance  of  extremely  hydropho- 
bic amino  acid  residues.  This  implies  the  recep- 
tor protein  must  exist  primarily  within  the  lipid 


environment  of  the  membrane.  A  molecular 
model  of  the  Mo-MuLV  receptor  predicts  that  it 
traverses  the  plasma  membrane  as  many  as  14 
times,  a  structure  similar  to  a  protein  found  in 
yeast  that  transports  the  amino  acids  arginine  and 
lysine.  Arginine  and  lysine  are  2  of  the  20  amino 
acids  that  are  the  building  blocks  of  proteins. 
Frog  eggs  injected  with  Rec- 1  RNA  demonstrate  a 
large  increase  in  the  transport  of  arginine  and  ly- 
sine across  the  cell  membrane.  More  detailed  ex- 
periments have  confirmed  that  the  Mo-MuLV  re- 
ceptor is  the  transporter  for  all  cationic  amino 
acids,  i.e.,  amino  acids  that  carry  a  net  positive 
charge.  Amino  acid  transporters  have  been 
known  to  exist  in  mammalian  cells,  but  none 
have  previously  been  isolated  or  their  structure 
known.  The  similarity  between  the  yeast  and 
mouse  cationic  amino  acid  transporter  is  consis- 
tent with  conservation  of  a  single  mechanism  for 
transport  of  cationic  amino  acids  over  evolution- 
ary time  and  predicts  that  proteins  similar  to  the 
Mo-MuLV  receptor  are  used  by  all  animals.  In- 
herited disorders  of  cationic  amino  acid  transport 
have  been  described  in  patients;  mutations  in 
Ree  l  genes  may  explain  these  disorders,  a  hy- 
pothesis we  are  now  examining. 

In  addition  to  protein  synthesis,  cationic  amino 
acids  have  other  roles.  For  example,  arginine  and 
ornithine  are  important  intermediates  in  the  urea 
cycle,  a  metabolic  pathway  found  in  liver  cells 
that  rids  animals  of  nitrogen  waste.  A  protein  that 
is  closely  related  to  the  Mo-MuLV  receptor  has 
been  identified  in  liver  tissue,  and  we  are  examin- 
ing its  role  in  transport  of  ornithine  across  the 
mitochondrial  membrane  of  hepatocytes,  an  im- 
portant step  in  this  pathway.  Also  within  the  past 
few  years,  arginine  has  been  identified  as  the  sub- 
strate for  nitric  oxide,  the  proximal  molecule  in  a 
cell-signaling  pathway  that  is  important  in  the 
control  of  blood  pressure,  nerve  transmission, 
and  host  immune  defense.  Currently  we  are  in- 
vestigating how  the  Mo-MuLV  receptor/trans- 
porter can  influence  nitric  oxide  production  by 
regulating  arginine  availability. 

The  laboratory  remains  interested  in  how  retro- 
viruses interact  with  the  Mo-MuLV  receptor  to 
permit  fusion  with  the  target  cell  membrane  and 
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gain  entry  into  the  cell.  Recently  we  completed 
experiments  that  identify  the  portion  of  the  Mo- 
MuLV  receptor  that  binds  to  the  virus  envelope. 
Now  we  are  examining  other  mutant  receptor 
proteins  that  have  normal  retrovirus  binding  but 
do  not  permit  infection.  These  studies  may  iden- 
tify portions  of  the  receptor  protein  that  are  im- 
portant for  the  fusion  of  the  virus  to  the  target 


cell,  the  step  in  infection  that  follows  binding. 
We  are  also  investigating  the  consequences  of 
Mo-MuLV  binding  and  infection  on  cationic 
amino  acid  transport.  Our  long-term  goal  is  to 
understand  the  chemical  basis  of  the  Mo-MuLV- 
Rec- 1  interaction  in  sufficient  molecular  detail  to 
design  small  molecules  that  can  block  virus  bind- 
ing and  prevent  infection. 
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THE  two  key  features  of  the  immune  system 
are  diversity  and  specificity.  While  it  must  be 
capable  of  recognizing  a  large  array  of  pathogens, 
it  must  subsequently  mount  a  specific  and  appro- 
priate response  to  each.  Diversity  is  accom- 
plished at  the  level  of  the  cell  surface  receptors 
on  T  and  B  lymphocytes.  Specificity  is  accom- 
plished at  several  levels,  the  first  of  which  is  in 
the  intracellular  proteolytic  processing  of  the 
pathogen.  This  processing  results  in  the  genera- 
tion of  peptide  fragments,  which  then  bind  to  ei- 
ther class  I  or  class  II  products  of  the  major  histo- 
compatibility complex  (MHC). 

The  choice  of  class  I  or  class  II  MHC  molecule 
is  determined  by  the  nature  of  the  pathogen.  In 
general,  peptide  fragments  derived  from  patho- 
gens that  have  entered  the  cell  from  outside,  like 
most  bacteria,  bind  to  class  II  molecules,  while 
fragments  derived  from  pathogens  that  use  the 
cell's  replicative  machinery  to  proliferate,  like 
viruses,  bind  to  class  I  molecules.  Once  bound, 
the  fragment  is  transported  to  the  cell  surface, 
where  it  can  be  recognized  by  an  effector  T 
lymphocyte. 

The  type  of  T  lymphocyte  responding  is  closely 
linked  to  the  type  of  MHC  molecule  presenting 
the  peptide.  Thus  a  peptide  bound  to  a  class  II 
molecule  will  stimulate  a  helper  T  cell,  which  in 
turn  will  trigger  the  production  of  antibodies.  In 
contrast,  a  peptide  bound  to  a  class  I  molecule 
targets  the  presenting  cell  for  destruction  by  stim- 
ulating a  cytotoxic  T  lymphocyte.  Our  research 
focuses  on  various  aspects  of  the  intracellular 
processes  that  allow  class  I  MHC  molecules  to 
distinguish  endogenously  synthesized  pathogens 
from  those  that  have  been  taken  up  from  outside. 

The  mature  class  I  molecule  is  a  heterodimer 
consisting  of  a  transmembrane  heavy  chain  non- 
covalently  associated  with  a  soluble  light  chain. 
Studies  with  mutant  cell  lines  have  revealed  that 
the  assembly  of  class  I  heterodimers  takes  place 
in  the  endoplasmic  reticulum  and  is  tightly  regu- 
lated. These  studies  have  shown  that  the  peptide 
itself  plays  a  role  in  determining  the  fate  of  the 
complex.  Thus  class  I  molecules  that  have  suc- 
cessfully bound  a  peptide  are  transported  to  the 


cell  surface,  while  those  that  have  failed  to  bind  a 
peptide  dissociate  and  are  degraded  within  the 
cell.  This  implies  that  there  is  an  intracellular 
"gate"  that  distinguishes  heterodimers  that  have 
bound  peptide — and  are  therefore  functional — 
from  those  that  have  not.  To  gain  insight  into  the 
mechanism  behind  this  gate,  Rebecca  Elstrom  has 
studied  the  assembly  process  in  an  in  vitro  sys- 
tem. She  has  succeeded  in  inducing  class  I  heavy 
and  light  chains,  which  have  been  synthesized  in 
vitro,  to  assemble  within  membrane  vesicles  de- 
rived from  the  endoplasmic  reticulum.  The  re- 
sults of  her  studies  indicate  that  in  an  oxidizing 
environment,  which  is  the  physiological  state  of 
the  endoplasmic  reticulum,  stable  heterodimers 
are  formed  even  in  the  absence  of  peptide.  In 
contrast,  in  a  reducing  environment,  stable  het- 
erodimers are  formed  only  when  peptide  is 
added.  These  findings  suggest  that  class  I  hetero- 
dimers form  in  the  endoplasmic  reticulum  inde- 
pendent of  peptide.  Subsequently,  while  en 
route  to  the  surface,  they  may  encounter  an  intra- 
cellular compartment  with  a  reducing  environ- 
ment. In  this  compartment,  only  heterodimers 
that  have  bound  peptide  remain  intact  and  are 
allowed  to  proceed  to  the  cell  surface,  while 
those  that  have  not  bound  peptide  dissociate. 
This  in  vitro  system  holds  great  promise  for  pro- 
ducing further  insights  into  assembly  of  class  I 
complexes. 

Studies  initiated  by  Michael  Zegans,  a  Howard 
Hughes  Medical  Scholar,  and  being  pursued  by 
Rabin  Chakrabarti,  a  Howard  Hughes  Associate, 
focus  on  a  special  problem  in  antigen  presenta- 
tion, that  of  intracellular  parasites.  Such  parasites 
replicate  within  the  host  cell  but  use  their  own 
machinery  to  do  so.  Although  they  might  be  ex- 
pected to  shed  proteins  into  the  cytoplasm  where 
they  would  be  exposed  to  proteolysis  and  thus  to 
presentation  by  class  I  molecules,  the  occurrence 
of  class  I-restricted  immune  responses  has,  for 
many  parasites,  been  difficult  to  demonstrate. 
Chlamydia  trachomatis,  a  leading  cause  of 
blindness  in  the  world,  is  an  organism  particu- 
larly suitable  for  such  studies,  since  more  than 
half  of  its  entire  mass  is  contributed  by  a  single 
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protein — the  major  outer  membrane  protein 
(MOMP) .  Dr.  Chakrabarti  has  succeeded  in  estab- 
lishing stable  cell  lines  expressing  the  MOMP 
gene.  These  cell  lines  are  being  used  for  the  de- 
tection and  characterization  of  cytotoxic  re- 
sponses in  infected  mice. 

Juerg  Baenziger,  a  Howard  Hughes  Associate, 
has  been  studying  another  aspect  of  class  I- 
mediated  processes.  This  study  is  related  to  the 
behavior  of  the  peptide-class  I  complexes  in  T 
lymphocytes  after  they  reach  the  surface.  Several 
years  ago  it  was  observed  that  class  I  complexes 
are  endocytosed,  or  internalized,  by  T  lympho- 
cytes, but  only  when  the  lymphocytes  are  acti- 
vated. To  explore  further  the  molecular  basis  as 
well  as  the  function  of  T  cell-specific  endocyto- 
sis  of  class  I  complexes,  he  has  constructed  a  se- 
ries of  site-directed  mutants  of  the  human  class  I 
MHC  molecule  A2.1.  These  constructs  include 
various  truncations,  deletions,  and  substitutions 
within  the  cytoplasmic  domain  of  this  mem- 
brane-spanning protein.  They  also  include  chi- 
meras in  which  the  extracellular  domain  is  de- 
rived from  A2.1  and  the  transmembrane  and 
cytoplasmic  domains  are  derived  from  either  the 
T  cell  surface  marker,  CD4,  or  the  receptor  for 
low-density  lipoprotein  (LDL) . 


Several  of  the  constructs  have  been  introduced 
into  Jurkat  cells,  a  human  T  cell  lymphoma  line. 
The  properties  of  the  chimeras  in  these  cells  with 
regard  to  endocytosis  exactly  parallel  those  of  the 
donors  of  the  cytoplasmic  tails.  Specifically,  CD4 
chimeras  endocytose  only  when  phosphorylated, 
whereas  LDL  receptor  chimeras  are  endocytosed 
and  recycled  constitutively. 

In  addition  to  providing  insights  into  structure- 
function  relationships  in  the  trafficking  of  the  na- 
tive molecules,  studies  in  these  cell  lines  have 
led  Dr.  Baenziger  to  conclude  that  the  class  I  mol- 
ecules that  are  endocytosed  in  activated  T  lym- 
phocytes are  structurally  distinct  from  normal 
class  1  molecules.  By  comparing  messenger  RNA 
from  resting  and  activated  human  T  cells,  he 
found  that  activated  cells  produce  a  single  new 
species  of  class  I  heavy  chains.  These  heavy 
chains,  which  lack  the  domain  that  normally  an- 
chors them  in  the  membrane,  are  secreted  from 
the  cell.  Current  studies  are  directed  toward 
determining  whether  these  soluble  class  I 
molecules  account  in  whole  or  in  part  for  the 
endocytosis  observed  in  activated  T  lymphocytes. 
Such  a  mechanism  would  have  novel  implica- 
tions for  regulation  of  T  cell-mediated  immune 
responses. 
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WE  have  focused  on  several  major  areas  in 
immunology  that  generally  resolve  into 
three  problems:  How  do  T  cells  recognize  foreign 
entities?  How  are  they  selected  in  the  thymus? 
How  is  lymphocyte  differentiation  controlled  ge- 
netically? An  additional  goal  is  to  refine  and  bet- 
ter integrate  recombinant  DNA  technology  with 
some  of  the  other  powerful  techniques  in  immu- 
nology, as  an  approach  to  defining  the  function  of 
unknown  genes  or  poorly  understood  genes  and 
their  products. 

T  Cell  Recognition  of  Foreign  Antigens 

The  work  of  many  investigators  over  the  years 
has  shown  that  T  cells,  through  their  antigen  re- 
ceptor molecules,  recognize  fragments  of  foreign 
proteins  (peptides)  embedded  in  major  histo- 
compatibility complex  (MHC)  molecules.  This  is 
in  contrast  to  antibodies,  which,  while  closely 
related  to  T  cell  antigen  receptors,  bind  intact 
foreign  proteins  directly.  Because  of  the  consis- 
tently high  concentration  of  sequence  diversity 
in  the  V-J  junctional  region  of  T  cell  receptors 
(and  other  considerations),  we  have  proposed 
that  this  is  the  important  region  for  peptide  recog- 
nition and  that  other  V  region-encoded  residues 
might  contact  the  surface  of  the  MHC  molecule. 

To  test  this  possibility  and  learn  about  the  mo- 
lecular dynamics  of  T  cell  receptor-peptide- 
MHC  interactions  in  general,  we  have  developed 
expression  systems  that  allow  us  to  produce  ei- 
ther T  cell  receptor  or  MHC  class  II  heterodimers 
in  a  solubilizable  form.  This  involves  replacing 
the  normal  membrane-spanning  sequences  of 
these  polypeptides  with  a  signal  sequence  for 
lipid  linkage,  such  as  employed  normally  by  a 
number  of  cell  surface  proteins.  Molecules  ex- 
pressed in  this  fashion  can  then  be  conveniently 
cleaved  from  the  surface  of  expressing  cells  with 
the  enzyme  phosphatidylinositol-specific  phos- 
pholipase  C.  By  utilizing  high-density  mamma- 
lian cell  culture  machines,  we  are  able  to  make 
milligram  quantities  of  a  soluble  T  cell  receptor 
and  its  cognate  MHC  molecule.  We  hope  to  dem- 
onstrate a  direct  interaction  between  these  macro- 


molecules  (with  the  appropriate  peptide  in  the 
MHC  component)  and  to  measure  the  affinity 
with  which  this  happens.  This  work  is  also  in- 
tended to  provide  the  raw  material  needed  for 
structural  studies,  such  as  x-ray  crystallography. 

We  have  already  used  the  soluble  MHC  protein 
to  show  greatly  enhanced  uptake  of  antigenic 
peptides  at  low  pH.  This  is  important  both  in 
making  significant  quantities  of  a  pure  antigen- 
MHC  complex  and  in  understanding  the  biology 
of  this  type  of  MHC  molecule  (class  II),  which 
recycles  through  low  pH  endosomal  compart- 
ments. Our  current  data  suggest  that  the  low  pH 
triggers  a  specific  comformational  change  in  the 
MHC  molecule,  which  allows  it  to  bind  new  pep- 
tides more  easily. 

Recent  experiments  relevant  to  the  issue  of 
whether  the  CDR3-equivalent  regions  of  T  cell 
receptors  are  responsible  for  peptide  recognition 
take  advantage  of  an  immunological  version  of 
classical  genetic  second-site  suppression  analy- 
sis. In  our  experiments  we  change  residues  that 
are  important  for  T  cell  recognition  (and  not 
MHC  interaction)  on  a  peptide,  immunize  mice, 
and  then  analyze  the  responding  T  cells  that 
emerge  with  respect  to  their  T  cell  receptor  se- 
quences. To  hold  part  of  the  original  receptor 
constant,  we  immunize  mice  that  are  transgenic 
for  either  chain  of  the  original  T  cell  receptor. 
With  this  new  approach,  we  have  shown  that  of 
the  two  residues  on  the  peptide  that  are  most  im- 
portant in  the  T  cell  response,  one  is  governed  by 
the  CDR3  of  the  V„  polypeptide  and  the  other, 
three  amino  acids  downstream  on  the  peptide  se- 
quence, is  specified  by  the  CDR3  of  the  poly- 
peptide. Thus  we  have  generated  significant  sup- 
port for  the  original  hypothesis. 

T  Cell  Receptor  Selection 

Another  area  of  interest  in  T  cell  recognition 
involves  the  ways  in  which  an  organism  might 
select  for  particular  T  cell  receptor-bearing 
cells.  The  specific  mechanisms  and  extent  of 
such  selections  are  just  now  becoming  clear.  To 
address  these  questions,  we  have  made  mice 
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transgenic  for  T  cell  receptors  of  defined  specific- 
ity and  have  followed  the  expression  of  these  mol- 
ecules during  T  cell  differentiation.  We  have  ob- 
tained evidence  of  selection  both  for  and  against 
particular  T  cell  receptors,  depending  on  the 
MHC  genotype  of  the  host. 

Our  evidence  concerning  positive  selection  in- 
dicates that  this  is  an  inefficient  process  that  can 
be  significantly  influenced  by  the  density  or  loca- 
tion of  the  MHC  molecules  required.  Experi- 
ments also  indicate  that  MHC  alleles  that  differ  in 
the  binding  site  of  MHC  profoundly  alter  or  abol- 
ish this  type  of  selection,  suggesting  that  specific 
peptides  (or  other  molecules  occupying  the  site) 
may  play  a  role.  Thus  it  seems  probable  that  in 
addition  to  the  complex  of  foreign  antigen  plus 
MHC  that  a  given  T  cell  receptor  may  encounter 
in  the  periphery,  there  is  first  another  MHC  + 
"X"  complex  that  must  be  successfully  encoun- 
tered in  the  thymus. 

We  have  also  developed  a  transgenic  model  for 
self-tolerance  of  T  cells  by  introducing  a  fusion 
protein  that  contains  the  target  peptide  seen  by 
our  T  cell  receptor  transgenic  mice.  This,  to- 
gether with  direct  peptide  injection/slow  deliv- 
ery systems,  has  enabled  us  to  control  the  timing 
and  dose  of  an  antigen  seen  as  "self."  We  find 
evidence  for  both  paralysis  and  deletion  of  T 
cells,  with  deletion,  which  occurs  in  both  the 


thymus  and  the  periphery,  the  favored  outcome 
as  the  dose  of  self  antigen  is  increased. 

Genes  Controlling  Lymphocyte 
Differentiation 

The  last  area  of  interest  involves  the  isolation 
and  characterization  of  genes  that  might  control 
differentiation  in  lymphocytes.  We  have  worked 
extensively  on  the  XLR  gene,  which  encodes  a 
small  (25-kDa)  previously  unidentified  nuclear 
protein  that  is  specifically  turned  on  in  late-stage 
B  cells  and  medium-  to  later-stage  T  cells.  One 
novel  feature  of  the  protein  is  that  it  is  stabilized 
in  its  nuclear  location  by  a  relatively  low  concen- 
tration of  zinc  ions.  A  number  of  other  proteins 
have  the  same  characteristics,  and  there  may  be 
some  novel  zinc-dependent  structures  in  the  nu- 
cleus. We  are  currently  trying  to  deduce  what  the 
product  of  this  gene  does. 

A  major  effort  is  also  under  way  to  isolate  other 
potential  regulatory  genes,  using  new  subtractive 
hybridization  and  cloning  schemes  and  nuclear 
localization  as  a  screen.  We  are  particularly  inter- 
ested in  genes  that  are  turned  on  late  in  B  cell 
differentiation  and  in  those  that  are  specific  for 
early  thymocytes.  Nuclear  localizing  proteins  in 
either  category  may  provide  important  clues 
about  the  regulation  of  differentiation  in  these 
cells,  clues  that  may  be  applicable  to  cellular  dif- 
ferentiation in  general. 
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CELLULAR  proliferation  is  a  highly  regulated 
process.  During  embryonic  development, 
rapid  cell  growth  is  required  to  form  the  tissues 
of  the  body.  In  contrast,  cellular  proliferation  in 
adults  is  slow,  primarily  serving  to  replace  senes- 
cent cells.  Adults,  however,  retain  a  limited  ca- 
pacity for  rapid  grovvT;h — for  example,  during 
wound  healing.  Regulation  of  this  proliferative 
capacity  is  critically  important.  Errors  in  growth 
control  result  in  a  variety  of  diseases,  including 
cancer. 

The  local  production  of  protein  growth  factors 
is  an  important  mechanism  that  can  account  for 
the  control  of  cellular  proliferation.  Our  research 
group  is  investigating  the  action  of  a  family  of 
peptides  that  includes  epidermal  grovvT:h  factor 
(EGF)  and  transforming  growth  factor-a  (TGF- 
a).  These  agents  are  synthesized  as  cell  surface 
glycoproteins  that  are  split  to  release  small  solu- 
ble peptides.  Both  the  membrane-bound  precur- 
sor and  the  diffusible  peptides  are  biologically 
active  and  bind  to  specific  receptor  molecules 
located  at  the  surface  of  responsive  cells.  Secre- 
tion of  these  peptide  growth  factors  contributes 
to  the  rapid  grovv^h  of  some  tumors. 

The  long-term  goal  of  this  laboratory  is  to  un- 
derstand the  molecular  basis  for  the  control  of 
cellular  proliferation  by  the  EGF  receptor.  It  is 
known  that  the  binding  of  growth  factors  to  this 
receptor  at  the  cell  surface  triggers  a  complex 
series  of  chemical  reactions  that  lead  to  DNA  syn- 
thesis within  the  nucleus  and  to  cell  division. 
However,  the  molecular  details  of  the  signaling 
pathways  utilized  by  the  receptor  are  poorly 
understood. 

Regulation  of  EGF  Receptor  Function 

The  EGF  receptor  is  a  glycoprotein  consisting 
of  an  extracellular  domain  that  binds  growTih  fac- 
tors, a  membrane-spanning  domain,  and  a  cyto- 
plasmic domain.  The  cytoplasmic  domain  is  an 
enzyme,  tyrosine  kinase,  that  causes  the  covalent 
attachment  of  phosphate  to  tyrosine  components 
of  substrate  proteins  (phosphorylation).  The 
binding  of  EGF  to  the  receptor's  extracellular  do- 
main causes  an  increase  in  the  tyrosine  kinase  ac- 
tivity of  the  cytoplasmic  domain.  EGF  also  causes 


the  receptor  to  aggregate  and  to  associate  tran- 
siently with  intracellular  regulatory  molecules  to 
form  a  signal  transduction  complex.  We  are  study- 
ing these  interactions  and  investigating  the  con- 
sequences of  the  phosphorylation  process. 

The  ability  of  the  growth  factor  EGF  to  increase 
the  tyrosine  kinase  activity  of  its  receptor  is 
blocked  when  cells  are  incubated  with  a  tumor 
promoter  or  with  other  growth  factors.  Under 
these  conditions,  the  EGF  receptor  is  itself  phos- 
phorylated  at  multiple  serine  and  threonine  resi- 
dues. We  are  investigating  the  significance  of  this 
phosphorylation.  Our  approach  is  to  construct 
receptors  with  point  mutations  at  the  sites  of 
phosphorylation,  using  recombinant  DNA  tech- 
nology. These  studies  have  demonstrated  that  the 
phosphorylation  of  a  single  threonine  residue 
blocks  the  ability  of  EGF  to  stimulate  the  recep- 
tor's tyrosine  kinase  activity.  Phosphorylation 
also  alters  the  internalization  of  the  receptor.  We 
are  investigating  the  structural  basis  for  the  ef- 
fects of  phosphorylation  on  the  regulation  of  EGF 
receptor  function. 

Signaling  by  the  EGF  Receptor 

A  principal  question  that  we  must  answer  in 
order  to  understand  the  mechanism  of  signal 
transduction  by  the  EGF  receptor  is  how  a  signal 
that  is  initiated  at  the  cell  surface  can  be  transmit- 
ted to  the  nucleus  to  cause  DNA  replication.  One 
class  of  regulatory  molecules  that  could  account 
for  this  process  is  the  protein  kinases.  We  have 
recently  identified  a  novel  protein  kinase  activity 
that  is  acutely  regulated  by  growth  factors.  This 
protein  kinase  is  located  in  both  the  nuclear  and 
cytosolic  compartments  of  cells.  Substrates  that 
we  have  identified  include  the  EGF  receptor  and 
the  nuclear  transcription  factors  expressed  by  the 
proto-oncogenes  c-myc  and  cfun.  We  are  inves- 
tigating the  structure  of  this  protein  kinase  by  mo- 
lecular cloning,  and  examining  the  role  of  this 
enzyme  during  signal  transduction. 

Tissue  Specificity  of  Tumor  Induction 

The  gene  for  the  EGF  receptor  is  a  frequent  site 
of  integration  by  avian  leukosis  viruses.  Insertion 
of  the  virus  causes  the  expression  of  a  truncated 
EGF  receptor.  The  formation  of  a  virus  containing 
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a  copy  of  the  truncated  receptor  gene  can  also 
occur.  This  truncated  gene  is  the  dominantly  ac- 
tive oncogene  erbB.  The  primary  disease  asso- 
ciated with  erbB  is  erythroblastosis.  However, 
mutations  in  the  carboxyl  terminus  of  erbB  that 
occur  during  viral  replication  cause  additional 
tumors — fibrosarcomas  and  angiosarcomas.  We 
are  studying  the  molecular  basis  for  the  change  in 
tissue  specificity  of  the  erbB  oncogene. 


The  approach  we  are  taking  is  to  construct  re- 
combinant viruses  containing  erbB.  The  advan- 
tage of  this  procedure  is  that  we  can  undertake  a 
systematic  analysis  of  the  effects  of  erbB  muta- 
tions on  the  tissue  specificity  of  tumor  formation. 
The  results  should  help  explain  why  erbB  causes 
tumors  in  one  tissue  but  not  in  others.  This  infor- 
mation will  be  useful  in  designing  strategies  for 
clinical  intervention  in  tumor  development. 
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WE  determine  and  study  the  three-dimen- 
sional structures  of  proteins  in  order  to  un- 
derstand their  folding,  structural  stability,  and 
function.  We  are  particularly  interested  in  pro- 
tein-pigment complexes  catalyzing  photochemi- 
cal energy  conversion,  energy  transfer,  and  elec- 
tron transfer.  Some  of  the  most  fascinating 
members  of  this  group  are  membrane-spanning 
and  membrane-associated  proteins. 

One  of  the  major  obstacles  on  the  way  to  a 
structure  analysis  by  x-ray  diffraction  is  the  ne- 
cessity to  grow  big,  well-ordered  single  crystals 
from  pure  samples  of  the  macromolecule  under 
investigation.  Although  numerous  proteins  have 
been  crystallized,  the  crystallization  process  is 
still  not  well  understood.  Successful  crystalliza- 
tions can  provide  guidelines,  but  every  different 
protein  presents  a  completely  new  problem,  and 
there  is  no  guarantee  that  good  crystals  will  grow. 

This  is  especially  true  for  proteins  that  are  em- 
bedded in  biological  membranes.  For  crystalliza- 
tion they  have  to  be  removed  from  their  natural 
environment  with  detergents.  The  choice  of  de- 
tergent is  critical;  moreover,  the  most  successful 
crystallizations  of  membrane  proteins  were  per- 
formed in  the  presence  of  a  detergent  mixed  with 
a  so-called  small  amphiphile.  Thus,  when  trying 
to  crystallize  membrane  proteins,  we  have  to  ma- 
nipulate a  complex  mixture  of  aqueous  buffer, 
detergents,  small  amphiphiles,  precipitant,  and 
our  protein.  This  complexity  is  one  of  the  reasons 
the  three-dimensional  analysis  of  membrane  pro- 
teins is  progressing  so  slowly. 

Cytochrome  bcj  Complexes 

The  chemical  reactions  during  photosynthesis 
can  be  divided  into  "light"  and  "dark,"  with  the 
former  reactions  capturing  light  energy  and  stor- 
ing it,  mainly  in  the  form  of  a  proton  gradient 
across  a  photosynthetic  membrane.  This  gradient 
is  then  used  for  making,  e.g.,  ATP,  the  universal 
"energy  currency"  of  living  cells. 

The  photosynthetic  light  reactions  in  purple 
bacteria  require  three  types  of  molecules:  photo- 


synthetic reaction  centers  (RCs),  cytochrome  be, 
complexes,  and  cytochrome  C2.  The  first  two  are 
integral  membrane  proteins,  and  the  third  is  a 
small  soluble  electron-transport  protein. 

The  RCs  perform  the  first  electron  transfer  step 
of  the  light  reactions.  A  pair  of  chlorophyll  mole- 
cules absorb  a  photon  and  subsequently  release 
one  of  its  electrons,  which  moves  within  the  RC 
through  the  membrane  to  a  quinone  molecule. 
This  electron  transfer  process  must  happen  twice 
in  succession  so  that  the  quinone  receives  two 
electrons.  The  charged  quinone  picks  up  two 
protons  and  dissociates  from  the  RC  as  a  quinol, 
which  migrates  along  the  membrane  to  the  cy- 
tochrome be,  complex. 

This  complex  removes  the  electrons  and  pro- 
tons from  the  quinol  and  transfers  them,  with  two 
additional  protons  from  the  cytoplasm,  back 
through  the  membrane.  The  electrons  are  shut- 
tled back  to  the  RC,  where  they  can  again  enter 
the  cyclic  light-driven  transfer  process;  the  pro- 
tons accumulate  on  the  outside  of  the  membrane, 
building  up  the  desired  proton  gradient.  The 
structure  analysis  at  2.3  A  resolution  of  the  RC 
from  the  purple  bacterium  Rhodopseudomonas 
viridis  has  contributed  significantly  to  our  un- 
derstanding of  the  first  half  of  this  cyclic  process. 
Determination  of  the  three-dimensional  structure 
of  a  cytochrome  bcj  complex  would  similarly 
elucidate  the  second  half. 

Complexes  of  the  bcj  type  play  a  crucial  role 
not  only  in  photosynthesis  but  also  in  cell  respira- 
tion. For  example,  the  mitochondria  of  all  higher 
organisms  have  such  complexes  in  their  inner 
membrane.  The  photosynthetic  purple  bacteria 
have  the  simplest  bcj  complexes,  consisting  of 
only  three  different  protein  subunits,  with  three 
heme  groups  and  an  iron-sulfur  cluster  as  cofac- 
tors.  They  occur  in  the  bacterial  inner  membrane 
at  a  concentration  significantly  lower  than  that  of 
RCs,  and  are  therefore  more  difficult  to  isolate. 
The  research  group  of  David  Knaff  at  Texas  Tech- 
nical University,  Lubbock,  recently  succeeded  in 
isolating  the  be,  complex  from  Rhodospirillum 
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rubrum.  In  collaboration  with  Knalf  s  group,  we 
prepared  this  complex  to  high  purity  and  in  suffi- 
cient quantity  to  start  crystallization  experi- 
ments, which  are  now  in  progress. 

DNA  Photolyase 

Light  energy  plays  an  important  role  in  reac- 
tions other  than  photosynthesis.  An  example  of 
the  use  of  light  energy  to  drive  a  chemical  reac- 
tion is  found  in  the  molecular  machinery  that  en- 
ables cells  to  repair  DNA  damaged  by  ultraviolet 
light.  One  type  of  damage  frequently  caused  by 
ultraviolet  irradiation  is  the  crosslinking  of  two 
neighboring  thymine  bases  in  a  DNA  strand.  Most 
of  these  crosslinks  are  in  the  form  of  a  cyclobu- 
tane  ring  connecting  four  carbon  atoms,  two  from 
each  thymine  base.  An  enzyme,  DNA  photolyase, 
can  locate  and  bind  to  such  lesions  and,  upon 
input  of  light  of  suitable  wavelength,  cleave  the 
carbon-carbon  bonds  between  the  bases,  repair- 
ing the  damage. 

DNA  photolyase  has  been  found  in  prokaryotes, 
eukaryotes,  and  archaebacteria.  The  enzyme  from 
Escherichia  coli  has  been  sequenced,  overex- 
pressed,  and  purified  in  the  laboratory  of  Aziz 
Sancar  at  the  University  of  North  Carolina,  Chapel 
Hill.  It  consists  of  a  single  polypeptide  chain  of 
471  amino  acids  and  two  cofactors — a  flavin- 
adenine  dinucleotide  (FAD)  and  5, 10-methenyl- 
tetrahydrofolate.  The  FAD  cofactor  fully  reduced 
to  FADH2  is  essential  for  the  enzyme's  function; 
the  folate's  role  appears  to  be  that  of  a  light- 
harvesting  antenna. 

Despite  significant  problems  with  the  en- 
zyme's tendency  to  denature,  we  were  able  to 
crystallize  DNA  photolyase  from  E.  coli.  The  crys- 
tals, obtained  after  repeated  seeding  of  crystalli- 
zation solutions,  are  plates  less  than  0.05  mm 
thick,  but  they  diffract  x-rays  to  at  least  2 .8  A  reso- 
lution. That  should  suffice  to  construct  an  atomic 
model  of  this  interesting  enzyme  after  the  phase 


problem  has  been  solved.  Collection  of  x-ray  dif- 
fraction intensity  data  and  experiments  to  bind 
heavy  atom  compounds  to  the  protein  in  the  crys- 
tal are  under  way.  We  also  are  trying  to  crystallize 
the  enzyme  in  complex  with  a  substrate,  a  five- 
nucleotide  piece  of  single-stranded  DNA  contain- 
ing a  crosslinked  pair  of  thymine  bases.  These  ex- 
periments have  to  be  done  in  the  dark,  or  under 
yellow  or  red  light,  to  prevent  the  enzyme  from 
repairing  and  releasing  the  substrate. 

Since  light  is  an  essential  ingredient  of  the  enzy- 
matic reaction  of  DNA  photolyase,  the  enzyme- 
substrate  complex  will  be  a  suitable  system  in 
which  to  study  the  time  course  of  the  reaction, 
using  the  Laue  technique.  This  utilizes  the  broad 
spectrum  of  x-ray  wavelengths  in  a  synchrotron's 
powerful  beam  to  record  within  a  very  short  time 
a  sufficient  fraction  of  a  crystal's  diffraction  pat- 
tern for  structural  interpretation.  In  such  an  ex- 
periment the  crystal  is  irradiated  with  a  light 
pulse  suitable  to  trigger  the  enzymatic  reaction, 
and  diffraction  patterns  are  recorded  at  different 
times  after  the  pulse.  Data  from  such  an  experi- 
ment can  provide  snapshots  of  the  structural  rear- 
rangements during  the  reaction,  thus  contribut- 
ing to  the  understanding  of  the  enzyme's 
mechanism. 

Other  Projects 

In  addition  to  the  two  projects  described 
above,  we  are  doing  purification  and  crystalliza- 
tion experiments  on  several  other  proteins.  The 
subjects  of  these  studies  include  the  catalytic  do- 
main of  human  HMG-CoA  reductase,  a  key  en- 
zyme in  the  synthesis  of  cholesterol  and  a  likely 
target  for  drugs;  human  synapsin  I,  a  protein  bind- 
ing to  synaptic  vesicles  and  mediating  their  re- 
lease; the  small  GDP-binding  protein  smgp25A 
from  bovine  brain;  the  DNA-binding  protein  myo- 
genin;  the  SecA  protein  from  E.  coli,  one  of  the 
key  parts  of  the  protein  export  system;  and  mam- 
malian phosphofructokinase. 
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ONE  remarkable  feature  of  the  immune  sys- 
tem is  its  ability  to  recognize  and  respond  to 
an  extraordinarily  large  variety  of  foreign  mole- 
cules, or  antigens.  This  exquisite  specificity  is 
mediated  through  protein  receptors  that  bind 
tightly  to  specific  antigens.  Such  receptors  are 
found  on  the  surfaces  of  two  types  of  immune 
cells:  B  cells  and  T  cells. 

Antibody  molecules,  or  immunoglobulins,  rep- 
resent one  type  of  antigen  receptor.  The  site  on 
the  antibody  molecule  that  binds  to  a  specific  an- 
tigen is  genetically  encoded  by  multiple,  short 
segments  of  DNA.  At  the  onset  of  development  of 
the  immune  system,  these  DNA  segments  are  lo- 
cated at  separate  places  in  the  genome;  during 
the  maturation  of  antibody-producing  cells  (B 
cells) ,  segments  are  joined  to  form  intact  immu- 
noglobulin genes.  In  addition  to  antibody,  there 
is  another  class  of  antigen  receptor  that  is  found 
on  the  surfaces  of  cells  that  mediate  cellular  im- 
munity (T  cells).  The  T  cell  receptor's  antigen- 
binding  site,  like  that  of  the  antibody  molecule,  is 
encoded  by  multiple  DNA  segments  that  are 
brought  together  during  T  cell  maturation.  After 
their  genes  are  assembled,  these  receptor  mole- 
cules are  expressed  at  the  surfaces  of  B  and  T 
cells,  where  specific  interactions  between  recep- 
tor and  antigen  trigger  division  and  further  matu- 
ration of  B  and  T  cells. 

The  total  number  of  immunoglobulin  or  T  cell 
receptor  gene  segments  is  large,  but  when  any 
particular  immunoglobulin  or  T  cell  receptor 
gene  is  assembled,  only  a  handful  of  segments  are 
selected  and  joined.  As  a  result,  many  different 
combinations  of  segments  are  possible.  It  is  this 
shuffling  of  small  bits  of  DNA  that  generates 
much  of  the  diversity  of  the  immune  response. 

Antibody  Gene  Rearrangement 

The  rearrangement  of  antibody  genes  does  not 
occur  haphazardly  during  development,  but  is  a 
well-orchestrated  process  in  which  some  seg- 
ments are  joined  first  and  others  later.  The  avail- 
able evidence  suggests  that  this  orchestration  is 
accomplished  by  mechanisms  related  to  those 
that  regulate  gene  expression.  Antibodies  are 


made  of  two  kinds  of  protein  chains — heavy  and 
light.  The  genes  that  encode  the  heavy  chains  are 
split  into  three  segments — V,  D,  and  J — that  are 
brought  together  by  two  rearrangements.  First  a  D 
segment  joins  to  a  J  segment;  then  a  V  segment  is 
fused  to  the  DJ  element. 

It  has  been  known  for  several  years  that  incom- 
pletely assembled  antibody  genes,  the  DJ  ele- 
ments, are  transcribed  into  RNA.  We  have  re- 
cently worked  out  some  of  the  details  of  this 
process.  Each  D  segment  carries  a  DNA  sequence 
that  supports  initiation  of  transcription,  a  pro- 
moter sequence.  The  promoter  is  inactive,  how- 
ever, unless  it  is  near  a  second  regulatory  ele- 
ment, an  enhancer.  When  a  D  segment  joins  to  a  J 
segment,  the  D  promoter  is  brought  near  the  en- 
hancer region,  and  transcription  is  initiated.  The 
promoters  that  drive  expression  of  completely  as- 
sembled antibody  genes — the  promoters  that  lie 
upstream  of  V  segments — are  activated  in  much 
the  same  way,  but  the  V  and  D  promoters  are 
structurally  quite  dissimilar.  Thus,  at  the  level  of 
DNA-protein  interactions,  the  D  promoters  are 
likely  to  be  regulated  differently  from  the  V  pro- 
moters. An  understanding  of  these  differences 
will  shed  light  on  regulatory  mechanisms  that  are 
in  force  during  the  earliest  stages  of  B  cell 
development. 

Although  specific  DNA  rearrangements  play  a 
central  role  in  the  development  of  the  immune 
system,  we  know  little  about  the  mechanics  of 
this  process.  We  have  taken  two  approaches  to 
the  problem.  One  has  been  to  analyze  the  prod- 
ucts of  rearrangement  and  the  DNA  sequences 
that  mediate  the  reaction.  The  second  has  been  to 
determine  how  these  DNA  sequences  are  recog- 
nized by  the  machinery  that  carries  out 
recombination. 

To  facilitate  our  study  of  antibody  gene  rear- 
rangement, we  have  made  artificial  DNA  mole- 
cules that  contain  unrecombined  antibody  gene 
segments.  By  simple  tests,  we  can  detect  rear- 
rangements of  these  artificial  substrates  after  they 
are  introduced  into  immature  B  cells.  By  follow- 
ing the  fates  of  these  engineered  molecules  we 
have  been  able  to  infer  some  general  features  of 
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antibody  gene  assembly.  Rearrangement  of  anti- 
body and  T  cell  receptor  genes  is  mediated  by 
conserved  DNA  sequences  (signal  sequences) 
that  lie  next  to  the  gene  segments,  near  the  sites 
of  recombination.  It  appears  that  during  the  ini- 
tial stages  of  rearrangement,  two  antibody  seg- 
ments are  paired  and  then  cleaved  to  form  an  in- 
termediate in  which  four  DNA  ends  (the  ends  of 
the  gene  segments  and  the  ends  of  their  flanking 
signal  sequences)  are  held  near  each  other  by  the 
recombinational  machinery.  One  of  our  aims  is  to 
test  this  idea  by  recovering  such  intermediates 
from  B  cells  that  carry  model  recombinational 
substrates. 

Essential  to  an  understanding  of  B  and  T  cell 
development  is  a  molecular  description  of  anti- 
body and  T  cell  receptor  gene  rearrangement. 
Our  approach  to  this  problem  began  with  the 
guess  that  the  recombinational  signal  sequences 
represent  parts  of  a  scafi'old  on  which  the  recom- 
binational machinery  is  assembled.  We  examined 
extracts  of  immature  B  and  T  cells  for  proteins 
that  could  bind  specifically  to  these  signals.  Our 
search  uncovered  a  specific  DNA-binding  protein 
whose  target  coincides  precisely  with  one  of  the 
recombinational  signals:  a  conserved  9-base  pair 
(bp)  DNA  segment  that  is  needed  for  efficient 
rearrangement  of  antibody  genes.  At  the  same 
time,  other  laboratories  identified  at  least  three 
genes  whose  products  likely  play  a  role  in  rear- 
rangement. We  are  now  examining  the  relation- 
ship between  these  gene  products  and  the  DNA- 
binding  protein  that  we  have  purified,  with  the 
goal  of  understanding  what  the  recombinational 
machinery  is  and  how  it  works. 

Lymphocyte  Activation 

A  second  area  of  work  in  the  laboratory  ad- 
dresses a  different  problem:  the  processes  by 
which  specific  antigen  and  growth  factors  trigger 
the  activation  of  B  and  T  cells.  We  have  set  out  to 
identify  molecules  that  participate  in  these  pro- 
cesses, with  the  goal  of  understanding  how  these 
molecules  function  in  the  generation  of  an  im- 
mune response.  We  have  begun  by  focusing  on  a 
group  of  enzymes  called  tyrosine  kinases.  Bio- 
chemical evidence  has  long  suggested  that  these 
enzymes  might  be  intimately  involved  in  B  and  T 
cell  activation,  but  the  molecules  themselves 
have  proven  elusive.  Among  genes  expressed  in 


interacting  B  and  T  cells  we  have  found  one  that 
encodes  a  novel  tyrosine  kinase  that  is  homolo- 
gous to  the  cellular  proto-oncogene  c-src.  Unlike 
c-src,  however,  this  gene  shows  a  strikingly  re- 
stricted pattern  of  expression,  being  transcribed 
preferentially  in  B  cells  and  in  developmental 
precursors  of  B  cells.  We  have  accordingly  called 
this  gene  blk  (B  lymphoid  kinase) .  The  blk  gene 
is  activated  early  in  B  cell  development  and  is 
expressed  along  with  several  proteins  that  are 
known  to  transmit  signals  across  the  B  cell  mem- 
brane, including  the  antibody  heavy  chain.  When 
B  cells  develop  into  antibody-secreting  cells  or 
plasma  cells,  they  no  longer  express  these  signal 
transmission  proteins  and  also  stop  expressing 
blk.  This  suggests  that  the  product  of  the  blk  gene 
interacts  with  a  receptor  that  spans  the  B  cell 
membrane  and  that  senses  the  presence  of  anti- 
gen or  a  specific  growth  signal. 

How  might  extracellular  signals  activate  the 
blk  kinase?  Our  recent  results  suggest  that  the  blk 
kinase  is  inactivated  by  addition  of  a  chemical 
group  to  a  specific  site  on  the  kinase  molecule; 
the  enzyme  is  activated  when  this  chemical 
group  is  removed.  If  the  target  site  on  the  blk 
kinase  is  mutated  so  that  it  cannot  accept  the  in- 
activating chemical  group,  the  enzyme  is  locked 
into  the  "on"  state.  Expression  of  this  mutant  ki- 
nase in  certain  types  of  cells  leads  to  alterations  in 
growth.  By  concentrating  on  the  specific  extra- 
cellular signals  that  turn  the  blk  kinase  on,  we 
hope  to  identify  the  pathways  in  which  blk 
functions. 

Aside  from  its  likely  role  in  B  cell  signaling,  blk 
provides  a  window  into  how  gene  expression  is 
restricted  to  specific  cell  types  during  develop- 
ment. To  date,  most  work  on  regulated  expres- 
sion in  B  cells  has  focused  on  the  immunoglobu- 
lin genes.  The  pattern  of  expression  of  blk  among 
B  cell  lines  differs  from  the  pattern  of  expression 
of  immunoglobulin  genes.  The  blk  gene  occu- 
pies approximately  30  kbp  of  DNA  on  mouse 
chromosome  14.  Its  unusual  structure  carries 
two  transcriptional  promoters.  One  of  these  is 
preferentially  used  early  in  B  cell  development, 
while  both  promoters  are  used  at  later  stages.  We 
expect  that  this  temporal  regulation  reflects  dif- 
ferences in  the  arrays  of  DNA-protein  complexes 
that  drive  transcription  of  blk  in  immature  and 
mature  B  cells,  and  we  now  wish  to  explore  these 
differences  in  detail. 
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Double  staining  of  wild- type  and  hunchback-mw^aw?  embryos, 
using  an  anti-Fushi  tarazu  (Ftz)  antibody  and  a  DGG-labeled 
probe  for  lacZ  mRNA. 

1.  The  wild-type  embryo  exhibits  a  normal  seven-stripe  pat- 
tern of  Ftz  expression  and  shows  the  expression  of  a  reporter 
lacZ  gene  in  the  anterior  part  of  the  embryo.  This  reporter  gene 
is  controlled  by  the  hsplO  TATA  box  promoter  containing  arti- 
ficial binding  sites  for  both  the  bicoid  and  hunchback  gene 
products. 

2.  In  the  hunchback- mwtow?  embryo,  the  pattern  o/fushi  tar- 
azu is  disrupted  in  the  anterior  part,  while  the  reporter  is  not 
affected.  As  discussed  in  the  text,  this  points  to  a  synergistic 
activation  of  the  reporter  gene  by  the  product  of  the  maternal 
gene  bicoid  and  the  maternal  component  o/hunchback. 

Research  of  Claude  Desplan. 
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ALL  information  required  for  the  develop- 
ment of  a  fertilized  egg  into  a  complete  or- 
ganism is  contained  in  its  own  genetic  material, 
contributed  by  both  parents,  and  in  products  pro- 
vided by  the  mother  as  the  egg  is  formed.  Genetic 
studies  on  the  fruit  fly  Drosophila  have  identi- 
fied most  of  the  genes  involved  in  the  process  of 
pattern  formation.  Many  of  these  genes  appear  to 
encode  transcription  factors  that  contain  con- 
served domains  such  as  homeodomains  or  zinc 
finger  domains.  We  believe  the  zygotic  genome 
responds  to  maternal  organizing  factors  through  a 
network  of  transcriptional  regulators  to  set  up  the 
body  pattern  of  the  embryo.  Since  several  genes 
expressed  during  mammalian  embryogenesis 
share  homology  with  developmental  genes  of  the 
fly,  it  is  likely  that  the  mechanisms  uncovered  are 
of  general  significance  for  the  development  of 
multicellular  organisms. 

The  goal  of  our  laboratory  is  to  understand  the 
mechanisms  involved  in  these  regulatory  interac- 
tions. In  light  of  the  complexity  of  the  system,  a 
productive  approach  is,  first,  to  characterize  the 
molecular  interactions  in  vitro  and,  second,  to 
design  in  vivo  systems  to  test  models  of  regula- 
tory interactions  consistent  with  properties  un- 
covered through  the  first  approach.  We  are  inves- 
tigating the  mechanism  of  action  of  two  classes  of 
developmental  gene  products  containing  either  a 
homeodomain  (HD)  or  zinc  finger  DNA-binding 
motifs.  These  motifs  are  known  to  bind  to  DNA 
and  to  target  regulatory  proteins  to  their  down- 
stream genes. 

Interactions  Among  Gap  Genes 

The  first  zygotic  genes  to  be  expressed  in  the 
Drosophila  embryo  are  the  gap  genes.  Their  role 
is  to  read  and  interpret  coarse  positional  informa- 
tion deposited  in  the  egg  by  the  mother  and  to 
refine  this  information  by  cross-regulatory  inter- 
actions. Three  gap  genes,  Kriippel  (Kr),  hunch- 
back {hb),  and  knirps  {kni),  have  been  analyzed 
molecularly.  All  contain  zinc  finger  motifs  char- 
acteristic of  many  eukaryotic  DNA-binding  pro- 
teins. This  homology  suggests  that  they  act  as  reg- 
ulators of  transcription. 


A  relatively  simple  circuit  appears  to  control 
expression  of  hb.  It  is  activated  in  response  to  the 
homeodomain-containing  product  of  the  mater- 
nal gene  bicoid  (bed)  and  is  repressed  by  the 
neighboring  gap  gene  Kr.  The  hb  promoter  con- 
tains DNA-binding  sites  for  the  products  of  the 
bed,  Kr,  and  hb  genes  (Bed,  Kr,  and  Hb),  which 
have  been  defined  in  vitro.  Another  zygotic  gene 
involved  early  in  the  development  of  the  head  is 
orthodentiele  (otd) .  Its  product  (Otd)  contains 
an  HD  that  recognizes  the  same  sequence  as  the 
Bed  protein  and  may  be  part  of  the  same  regula- 
tory network  in  the  anterior  part  of  the  embryo. 

Hb,  Kr,  Otd,  and  Bed  act  very  early,  when  the 
fly  embryo  is  still  a  syncytium  and  when  few 
other  proteins  are  expressed.  To  study  the  molec- 
ular interactions  among  the  genes  of  origin,  we 
have  introduced  into  transgenic  flies  artificial 
genes  containing  combinations  of  the  binding 
sites  for  Kr,  Hb,  or  Bcd/Otd  in  promoters  driving 
the  reporter  gene  jS-galactosidase.  The  patterned 
expression  of  the  reporter  gene  leads  to  the  con- 
clusion that  Bed  is  not  sufficient  for  expression  in 
the  anterior  part  of  the  embryo,  despite  strong 
Bed-binding  sites.  Addition  of  Hb-binding  sites  is 
required  to  obtain  a  promoter  whose  expression 
is  maintained  in  the  anterior  part,  mimicking  that 
of  hb.  Our  work  points  to  a  synergistic  activation 
of  the  promoter  by  Bed  and  the  maternal  compo- 
nent of  hb. 

The  DNA-binding  Specificity  of  the 
Homeodomain 

Over  20  genes  involved  in  development  en- 
code HD-containing  proteins.  The  HD  includes  a 
domain  similar  to  the  helix-turn-helix  motif  of 
many  prokaryotic  DNA-binding  proteins.  Our 
analysis  of  the  function  of  the  HD  has  led  us  to 
describe  how  we  can  change  the  DNA-binding 
specificity  of  one  HD  to  another,  simply  by  re- 
placing a  single  amino  acid  at  the  critical  position 
9  of  the  recognition  helix.  As  we  proposed,  struc- 
tural analysis  of  HD-DNA  complexes  later  showed 
that  the  interaction  between  DNA  and  the  HD  is 
not  analogous  to  that  of  the  prokaryotic  helix- 
turn-helix  proteins,  but  instead  involves  a  new 
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mode  of  recognition.  In  this  mode  amino  acid  9 
fits  in  the  major  groove  of  DNA,  in  close  contact 
with  the  base  pairs  recognized. 

The  Paired  Gene  Encodes  a  Multifunctional 
Transcription  Factor 

'-rThe  product  of  the  pair-rule  gene  paired  (Prd) 
contains  not  only  an  HD  but  also  other  domains 
conserved  in  evolution  that  are  found  in  develop- 
mental genes.  We  have  shown  that  the  128- 
amino  acid  Paired  domain  is  also  DNA  binding, 
making  the  Prd  protein  a  bifunctional  DNA-bind- 
ing  transcription  factor.  Although  both  the  Paired 
domain  and  the  HD  can  bind  to  DNA  indepen- 
dently, the  promoter  of  another  pair-rule  gene 
has  revealed  sites  that  are  bound  cooperatively  by 
Prd  when  both  domains  are  present  in  the  same 
protein.  Extensive  mutagenesis  analysis  of  the 
protein  has  allowed  the  definition  of  at  least  four 
subfunctions  in  Prd. 

Transcriptional  Functions  of  the 
Homeodomain  Proteins  Fushi  tarazu 
and  Engrailed 

In  order  to  understand  the  mechanism  by 
which  proteins  with  the  same  DNA-binding  speci- 
ficity can  have  different  developmental  func- 
tions, we  have  analyzed  the  roles  of  two  proteins 
binding  to  the  same  target  site  in  an  in  vitro  tran- 
scription system.  We  have  shown  that  the  Fushi 
tarazu  (Ftz)  protein  activates  in  vitro  transcrip- 
tion in  a  binding  site-dependent  manner.  This 
activation  is  prevented  by  the  addition  of  En- 


grailed (En) ,  which  competes  with  Ftz  for  bind- 
ing to  the  same  sites.  When  more  En  protein  is 
added,  a  further  repression  is  mediated  by  En 
binding  to  the  basal  promoter.  We  have  shown 
that,  in  this  case.  En  acts  by  binding  to  the  TATA 
box  and  competing  with  the  TATA  box-binding 
factor  TFIID.  Formation  of  a  committed  complex 
by  preincubation  of  the  promoter  with  TFIID 
prevents  both  the  repression  by  En  and  its  bind- 
ing to  the  TATA  box. 

Targets  of  the  Homeodomain 
Developmental  Gene  engrailed 

The  HD  transcription  factor  Engrailed  belongs 
to  a  small  set  of  genes  required  for  pattern  forma- 
tion within  each  embryonic  segment  and  within 
imaginal  discs.  We  are  using  the  "enhancer  trap" 
technique  and  a  +/—  screen  for  cDNA  from  undif- 
ferentiated imaginal  disc  cells  in  culture  to  iden- 
tify target  genes  affected  by  engrailed.  In  this  ap- 
proach, we  want  to  obtain  the  effector  genes 
essential  to  the  definition  of  cellular  and  com- 
partmental  identity  in  discs.  These  genes  may 
prove  to  be  components  of  signaling  pathways 
relevant  to  both  early  and  late  development. 

We  have  investigated  both  the  general  tran- 
scription functions  carried  out  by  the  HD  and 
zinc  finger  proteins  and  the  particular  properties 
of  each  gene  product.  Our  attention  is  now  fo- 
cused on  the  combinatorial  effects  of  such  factors 
in  vitro  and  in  vivo,  and  we  expect  to  acquire 
insight  into  the  mechanisms  controlling  the  coor- 
dinate expression  of  developmental  genes. 
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MORE  than  a  billion  people  in  tropical  areas 
of  the  world  endure  parasitic  infections 
during  most  of  their  life.  The  various  protozoan 
and  helminthic  parasites  that  cause  these  in- 
fectious diseases  have  evolved  a  variety  of  mecha- 
nisms for  evading  the  immune  response  of  their 
hosts.  Our  laboratory  studies  the  interactions  be- 
tween several  of  these  parasites  and  the  immune 
system,  in  anticipation  that  a  more  detailed  un- 
derstanding of  these  immune  evasion  mecha- 
nisms will  suggest  ways  in  which  to  combat  or 
prevent  the  infections.  The  parasites  that  cause 
three  of  these  tropical  diseases  are  described 
here. 

Leishmaniasis 

Leishmania  parasites  are  protozoan  pathogens 
found  in  many  tropical  countries,  where  they 
cause  a  spectrum  of  diseases  that  include  cutane- 
ous, mucocutaneous,  and  visceral  leishmaniasis. 
The  parasites  have  a  two-stage  life  cycle  that  oc- 
curs in  the  sandfly  vector  and  a  mammalian  host. 
Within  sandflies  they  exist  as  uniflagellar  pro- 
mastigotes  and  develop  from  a  less  infectious 
form  to  the  final  highly  infectious  form.  This  de- 
velopmental process  can  be  mimicked  during 
cultivation  of  promastigotes  in  liquid  medium. 
After  transmission  from  the  sandfly  to  the  mam- 
malian host,  promastigotes  are  phagocytosed  by 
host  macrophages,  where  they  reside  as  spherical 
amastigotes  in  acidic  phagolysosomes.  Thus  the 
parasites  evade  the  immune  response  by  "hid- 
ing" inside  macrophages — one  of  several  cell 
types  of  the  immune  system  that  normally  help  to 
destroy  foreign  pathogens  and  substances.  When 
a  sandfly  ingests  amastigote-laden  macrophages 
during  a  blood  meal,  the  life  cycle  is  completed. 
We  are  studying  how  these  organisms  are  able  to 
survive  in  hostile  environments  as  diverse  as  a 
sandfly  midgut  and  an  acidic  macrophage 
phagolysosome . 

A  major  glycoprotein  on  the  surface  of  both 
promastigotes  and  amastigotes  is  a  metallopro- 
tease  of  about  63  kDa  (gp63)  This  enzyme  proba- 
bly participates  in  the  uptake  of  the  promasti- 
gotes by  macrophages  and  contributes  to  the 


survival  of  the  amastigotes  within  them.  The 
amount  of  gp63  on  the  surface  of  promastigotes 
increases  about  10-fold  as  the  parasites  develop 
into  the  highly  infectious,  virulent  form  during 
growth  in  culture.  We  have  recently  discovered 
that  three  different  RNA  species  encoding  gp63 
occur  during  cultured  promastigote  growth.  One 
RNA  species  is  present  in  the  organisms  only  dur- 
ing the  early,  less  infectious  stages  of  cultured 
growth,  when  little  gp63  is  on  the  promastigote 
surface.  This  gp63  RNA  disappears  and  another 
gp63  RNA  species  appears  as  the  promastigotes 
become  more  infectious.  The  amount  of  this  sec- 
ond RNA  correlates  with  the  increased  amount  of 
gp63  protein.  The  third  gp63  RNA  species  is  con- 
tinuously present  at  a  low  level  throughout  cul- 
tured growth.  The  three  RNAs  encode  similar 
gp63  proteins  and  difl'er  primarily  in  the  nucleo- 
tide sequences  of  their  3'-untranslated  regions. 
They  are  derived  from  diff'erent  genes,  and  we  are 
determining  the  molecular  mechanisms  that  regu- 
late their  differential  expression.  In  addition,  we 
are  introducing  recombinant  DNAs  into  leish- 
mania to  amplify,  alter,  or  delete  individual  gp63 
genes  so  that  we  may  determine  the  functions  of 
each  of  the  three  gp63  proteases  during  the  pro- 
mastigote and  amastigote  stages. 

Trypanosomiasis 

African  trypanosomes  are  protozoan  parasites 
that  cause  sleeping  sickness  or  trypanosomiasis 
throughout  equatorial  Africa.  They  are  transmit- 
ted from  tsetse  flies  to  the  mammalian  blood- 
stream, where  they  continually  confront  the  hu- 
moral and  cellular  immune  systems.  Each 
trypanosome  in  the  bloodstream  is  covered 
by  about  10  million  copies  of  a  single  protein, 
the  variant  surface  glycoprotein  (VSG).  The  try- 
panosome population  survives  the  continuous 
immune  assault  because  individual  parasites  oc- 
casionally switch  spontaneously  from  the  expres- 
sion of  one  VSG  to  another — a  process  called  an- 
tigenic variation.  A  new  immune  response  must 
be  mounted  against  the  VSG  of  the  switched  para- 
site and  its  descendants,  enabling  the  trypano- 
some population  as  a  whole  to  stay  "one  step 
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ahead"  of  the  host  immune  defenses.  We  are  ex- 
amining the  events  at  the  DNA  and  RNA  level  that 
are  responsible  for  this  antigenic  variation. 

We  know  that  the  trypanosome  genome  con- 
tains about  1,000  different  genes  encoding  anti- 
genically  distinct  VSGs.  Usually  one,  and  only 
one,  of  these  VSG  genes  is  expressed  at  a  time. 
Rearrangements  and  duplications  of  these  genes 
are  partly  responsible  for  the  selection  of  the  VSG 
gene  to  be  transcribed  and  for  the  switch  event 
itself. 

The  rearrangements  maneuver  specific  VSG 
genes  into  and  out  of  special  chromosomal  loca- 
tions, called  expression  sites,  where  transcrip- 
tion occurs.  These  expression  sites  are  always  ad- 
jacent to  the  ends  of  the  chromosomes — the 
chromosomal  telomeres.  The  process  is  compli- 
cated by  the  fact  that  several,  and  perhaps  many, 
potential  expression  sites  exist  in  the  genome, 
yet  only  one  is  normally  activated  at  any  time. 
Our  goal  is  to  understand  this  activation  mecha- 
nism at  the  molecular  level.  In  one  project  we 
have  identified  a  protein  that  specifically  binds  to 
a  region  upstream  of  a  VSG  gene  in  an  expression 
site  activated  during  the  final  developmental 
stage  of  the  parasite  in  the  tsetse  fly,  i.e.,  the 
metacyclic  stage.  This  protein  may  contribute  to 
the  developmental  regulation  of  that  VSG  gene's 
transcription.  In  another  project  we  have  de- 
tected a  high  rate  of  mutation  in  the  telomere- 
linked  VSG  genes,  which  may  increase  the  effec- 


tiveness of  antigenic  variation  still  further.  A  third 
project  involves  characterization  of  a  group  of 
small  chromosomes  that  are  unique  to  African 
trypanosomes  and  contain  many  of  the  VSG  genes 
destined  for  sequential  expression. 

Onchocerciasis 

Onchocerca  volvulus  is  a  filarial  nematode 
that  causes  onchocerciasis,  or  river  blindness,  in 
Africa  and  Latin  America.  Female  O.  volvulus 
worms  grow  to  50  cm  in  length  and  reside 
throughout  the  body  within  nodules.  Here  they 
produce  thousands  of  microfilariae  each  day  that 
migrate  throughout  the  body  and  enter  the  eyes, 
where  they  produce  lesions  that  can  lead  to 
blindness.  It  is  not  known  how  any  of  the  develop- 
mental stages  of  the  parasite  in  the  host  evade  the 
immune  response.  The  parasites  are  difficult  to 
study  in  the  laboratory,  because  there  is  no  good 
experimental  animal  model;  they  infect  only  hu- 
mans and  chimpanzees.  We  have  constructed 
cDNA  expression  libraries  of  the  mRNAs  from  the 
infective  L3  stage  larvae  of  the  parasite,  and  from 
the  L4  stage  larvae  that  quickly  develop  after  in- 
fection, and  are  using  specific  cDNA  clones  to 
overproduce  parasite  antigens  from  both  stages 
that  are  recognized  by  antisera  from  onchocercia- 
sis patients.  This  approach  has  revealed  several 
proteins  that  are  unique  to  the  parasite  and  may 
be  valuable  for  improved  diagnosis,  treatment, 
and  prevention  of  the  disease. 
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Localization  by  immunofluorescence  microscopy  of  the  hnRNP  L  protein  on  the  lampbrush  chro- 
mosomes of  the  North  American  newt  Notophthalmus  viridescens.  The  intense  yellow  staining 
represents  a  high  concentration  of  the  L  protein  on  the  landmark  giant  loops.  This  is  the  first 
protein  localized  to  these  mysterious  segments  of  the  chromosome.  The  immunofluorescence  was 
performed  with  the  monoclonal  antibody  4D1 1,  in  collaboration  with  foe  Gall  of  the  Carnegie 
Institution. 

From  Pinol-Roma,  S.,  Swanson,  M.S.,  Gall,  f.G.,  and  Dreyfuss,  G.  1989.  J  Cell  Biol  109: 
2575-2587. 
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MESSENGER  RNAs  (mRNAs),  the  functional 
translatable  intermediates  of  gene  expres- 
sion, are  formed  in  the  nuclei  of  eukaryotic  cells 
by  extensive  and  tightly  regulated  post-transcrip- 
tional processing  of  primary  RNA  polymerase  II 
transcripts.  These  transcripts  are  termed  hetero- 
geneous nuclear  RNAs  (hnRNAs),  which  de- 
scribes their  variable  size  and  cellular  localiza- 
tion. It  is  possible  that  only  a  subset  of  hnRNAs 
are  actually  precursors  of  mRNA  and  that  the  rest 
turn  over  in  the  nucleus. 

The  collective  term  for  the  proteins  that  bind 
hnRNAs,  and  that  are  not  stable  components  of 
other  classes  of  rihonucleoprotein  (RNP)  com- 
plexes, is  hnRNP  proteins.  They  are  significant  in 
that  they  are  bound  to  hnRNAs  and  thus  influence 
their  structure,  fate,  and  processing  into  mRNAs. 
The  hnRNP  proteins  are  also  as  abundant  in  grow- 
ing vertebrate  cells  as  histones,  and  their  com- 
plexes with  hnRNA  are  also  of  interest  as  major 
nuclear  structures. 

Once  formed,  the  mRNAs  are  transported  to  the 
cytoplasm  via  nuclear  pores — a  little-known  pro- 
cess that  appears  to  be  one  of  the  most  important 
regulatory  steps  in  the  post-transcriptional  path- 
way of  gene  expression.  In  the  cytoplasm,  mRNP 
proteins  are  likely  to  be  involved  in  regulating 
the  translation,  stability,  and  localization  of 
mRNAs.  Our  goal  is  to  understand,  in  molecular 
detail  and  cellular  architecture,  how  the  post- 
transcriptional  portion  of  the  pathway  of  gene 
expression  operates  in  the  cell.  To  do  so  we  in- 
vestigate the  structure,  function,  and  localization 
of  the  hnRNP  and  mRNP  proteins  and  the  RNP 
complexes. 

We  have  identified  the  hnRNP  and  mRNP  pro- 
teins by  photochemical  RNA-protein  crosslinking 
in  intact  cells  and  have  produced  monoclonal  an- 
tibodies to  many  of  them.  The  antibodies  were 
used  to  develop  an  immunopurification  proce- 
dure for  hnRNA-hnRNP  complexes  from  verte- 
brate cells  and  to  begin  characterizing  the  pro- 
teins. Human  cells  have  provided  the  most 
detailed  picture  of  the  protein  composition  of 
these  complexes.  Considerable  information  is 
also  becoming  available  for  invertebrates,  partic- 


ularly Drosophila.  Immunopurified  complexes 
contain  large  RNA  of  up  to  10,000  nucleotides 
and  at  least  20  major  proteins,  designated  A-U,  in 
the  range  of  34-120  kDa.  There  are  also  many 
lower-abundance  hnRNP  proteins,  and  these  ap- 
pear to  bind  only  to  specific  subsets  of  hnRNAs. 
Nuclease  digestion  experiments  indicate  that  the 
hnRNP  proteins  are  not  simply  dispersed  on  the 
hnRNAs,  but  rather  are  organized  into  interacting 
units. 

The  molecular  cloning  and  sequencing  of 
cDNAs  for  several  RNP  proteins  led  to  the  discov- 
ery of  a  conserved  RNA-binding  domain  (RBD) 
and  a  rihonucleoprotein  consensus  sequence 
(RNP-CS).  The  RNP-CS  has  subsequently  been 
found  in  RNP  proteins  from  many  sources.  This 
octapeptide  sequence,  Lys/Arg-Gly-Phe/Tyr-Gly/ 
Ala-Phe-Val-X-Phe/Tyr,  is  the  most  highly  con- 
served segment  in  a  generally  conserved  region  of 
about  90-100  amino  acids  present  at  least  once 
in  many  RNA-binding  proteins  of  the  nucleus  and 
cytoplasm  in  organisms  as  evolutionarily  diver- 
gent as  yeast,  plants,  and  humans.  This  region, 
because  of  its  general  conservation  as  a  unit,  was 
considered  to  be  an  RNA-binding  domain,  and  the 
RNP-CS  was  suggested  to  be  an  important  ele- 
ment in  the  RNA-binding  activity  of  the  RBD. 
These  predictions  have  turned  out  to  be  correct, 
and  the  predictive  value  of  consensus  sequences 
for  the  identification  of  proteins  as  having  RNA- 
binding  activities  has  proved  very  useful. 

We  have  produced  the  94-amino  acid  RBD  of 
the  hnRNP  CI  protein  and  purified  it.  The  three- 
dimensional  structure  of  this  domain  and  of  its 
complex  with  RNA  oligonucleotide  substrate  are 
being  determined. 

Experiments  on  mitotic  cells  unexpectedly 
provided  important  insights  into  the  assembly 
and  general  nature  of  hnRNP  complexes.  In  mito- 
sis, as  the  nuclear  envelope  breaks  down,  hnRNP 
proteins  disperse  throughout  the  cell,  but  they 
remain  associated  in  complexes  with  RNA.  After 
mitosis,  once  the  nuclear  envelope  reforms, 
preexisting  hnRNP  proteins  return  to  the  nu- 
cleus. Double-label  immunofluorescence  micros- 
copy with  monoclonal  antibodies  to  various 
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hnRNP  proteins  on  postmitotic  cells,  however, 
revealed  that  at  the  end  of  mitosis  the  hnRNP 
complexes  dissociate  in  the  cytoplasm  and  the 
different  proteins  are  transported  to  the  nucleus 
separately.  Some,  including  CI,  C2,  and  U,  like 
snRNP  proteins  and  lamins,  are  transported  imme- 
diately (early  group) ,  v^^hile  others,  including  Al , 
A2,  Bl,  B2,  E,  G,  H,  and  L,  are  transported  later 
(late  group).  Thus  immediately  following  reas- 
sembly of  the  nuclear  envelope  at  the  end  of  mi- 
tosis, pairs  of  cells  are  detected  in  which  some 
hnRNP  proteins  are  in  the  nucleus  and  others  in 
the  cytoplasm.  These  observations  show  that 
hnRNP  complexes  are  dynamic  structures  in  that 
hnRNP  proteins  can  dissociate  from  the  com- 
plexes and  return  to  the  nucleus  separately. 

It  is  possible  that  the  association  and  dissocia- 
tion of  the  complexes  are  controlled  by  post- 
translational  modifications.  An  interesting  candi- 
date effector  of  such  modifications  is  a  cell 
cycle-regulated  protein  kinase  that  phosphory- 
lates  the  hnRNP  C  proteins.  The  phosphorylation 
results  in  the  appearance  of  novel,  electrophoreti- 
cally  distinct  forms  of  the  proteins.  These  slower- 
migrating  forms,  which  we  term  Cs  proteins,  are 
readily  detectable  by  immunoblotting  with  anti- 
C  protein  monoclonal  antibodies. 

We  partially  purified  the  kinase  (termed  Cs  ki- 
nase) that  phosphorylates  the  C  proteins  to  form 
the  Cs  proteins  (several  hundredfold)  by  high- 
performance  liquid  chromatography  (HPLC). 
The  activity  of  this  novel  cell  cycle-regulated 
protein  kinase  is  increased  during  mitosis.  It  is 
distinct  from  the  histone  HI  kinase  component  of 
maturation-promoting  factor  (MPF/c<^c2) .  In  ad- 
dition to  the  Cs  kinase,  two  other  protein  kinases 
were  found  to  phosphorylate  the  C  proteins.  One 
of  these  is  the  previously  identified  casein  kinase 
type  II.  The  phosphorylated  residues  are  serines, 
but  the  specific  sites  on  the  C  proteins  that  are 


phosphorylated  by  the  three  kinases  have  not  yet 
been  determined. 

Cs  kinase  activity  was  also  detected  in  extracts 
of  the  yeast  Schizosaccheromyces  pombe.  This 
should  help  in  determining  the  relationship  be- 
tween Cs  kinase  and  MPY/cdc2  and  in  under- 
standing the  function  of  the  Cs  kinase.  It  may  also 
make  it  possible  to  identify  the  C  proteins  of  S. 
pombe.  Among  the  likely  possible  functions  of 
the  phosphorylations  of  the  C  proteins  are  a  role 
in  the  assembly/disassembly  of  hnRNP  com- 
plexes, an  effect  on  the  RNA-binding  activity  of 
these  proteins,  and  a  role  in  the  transport  of  the  C 
proteins  to  the  nucleus. 

Immunofluorescence  microscopy  with  mono- 
clonal antibodies  to  hnRNP  proteins  and  photo- 
chemical crosslinking  of  RNAto  proteins  in  living 
cells  by  ultraviolet  light  indicated  that  the  A,  B,  C, 
L,  M,  and  U  proteins  are  nucleoplasmic — that  is, 
they  are  nonchromatin  proteins  that  are  confined 
to  the  nucleus  and  excluded  from  nucleoli.  The 
mRNP  proteins  that  have  been  characterized  so 
far  are  different  from  the  known  hnRNP  proteins 
and  are  confined  to  the  cytoplasm.  Therefore  the 
mRNAs  must  exchange  most,  if  not  all,  of  the  pro- 
teins with  which  they  are  associated  in  the  nu- 
cleus as  they  are  transported  to  the  cytoplasm. 

The  dissociation  of  these  proteins  from  the  nu- 
clear mRNA  and  the  subsequent  binding  of  mRNP 
proteins  must  be  an  important  aspect  of  nucleo- 
cytoplasmic  transport.  Understanding  the  mecha- 
nism that  regulates  the  disassembly  of  hnRNP 
complexes  prior  to  the  transport  of  their  proteins 
to  the  nucleus  following  mitosis  may  also  be  im- 
portant for  understanding  this  nucleocytoplas- 
mic  transport  process.  The  most  abundant  pro- 
tein that  is  bound  to  mRNA  in  the  cytoplasm  is  the 
poly  (A) -binding  protein.  We  have  isolated  cDNA 
clones  for  it  from  several  organisms,  determined 
its  sequence,  and  produced  antibodies  to  it.  We 
are  currently  studying  its  function. 
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THE  response  of  cells  to  the  polypeptide  hor- 
mone insulin  begins  with  the  binding  of  insu- 
lin to  a  specific  cell  surface  receptor;  this  results 
in  the  rapid  autophosphorylation  of  the  cytoplas- 
mic protein  tyrosine  kinase  domain  of  the  recep- 
tor specifically  on  several  tyrosine  residues.  Al- 
though the  postreceptor  mechanisms  involved 
in  the  mediation  of  the  multitude  of  physiologi- 
cal responses  to  insulin  remain  largely  obscure, 
this  initial  transmembrane  signaling  event  is  re- 
quired for  all  known  insulin-dependent  cellular 
responses. 

To  begin  to  relate  the  structural  features  of  the 
insulin  receptor  protein  to  the  details  of  its  func- 
tion, we  have  established  two  experimental  ap- 
proaches. First,  the  availability  of  cDNAs  that  en- 
code the  insulin  receptor  protein,  together  with 
methods  to  introduce  mutations  into  receptor 
coding  sequences  and  to  express  these  altered  re- 
ceptors in  heterologous  cell  expression  systems, 
has  provided  the  tools  with  which  to  test  ideas 
concerning  the  functional  roles  of  particular  re- 
ceptor residues.  Second,  the  size  and  membrane- 
associated  nature  of  the  protein  have  made  it 
possible  to  exploit  the  relatively  simple  trans- 
membrane topology  of  the  receptor  to  study  its 
two  functional  domains  individually  as  soluble 
molecules:  the  extracellular  domain  is  secreted 
as  a  heterotetramer  that  binds  insulin  with  high 
affinity,  and  the  cytoplasmic  domain  is  an  active 
protein  tyrosine  kinase. 

The  extracellular  ligand-binding  domain  of  the 
insulin  receptor  is  a  complex  molecule:  each 
half- receptor  comprises  929  residues  derived 
from  both  a-  (735  amino  acids)  and  (8-subunits 
(194  amino  acids)  and  includes  16  potential 
asparagine-linked  glycosylation  sites  and  4 1  cys- 
teines. There  is  little  information  at  present  as  to 
how  this  domain  folds  during  biosynthesis  or 
how  it  interacts  with  insulin.  The  study  of  an  ex- 
tensive series  of  deletion  mutants  has  revealed 
sites  within  the  receptor  primary  sequence  at 
which  truncation  results  in  the  generation  of  in- 
dependently folded  soluble  subdomains.  These 
sites  now  provide  landmarks  that  guide  further 
biochemical  and  molecular  dissections  of  this 


complex  domain.  Furthermore,  the  establish- 
ment of  a  heterologous  cell  expression  system 
that  provides  tens  of  milligrams  of  a  secreted  solu- 
ble derivative  of  the  extracellular  domain  pro- 
vides sufficient  protein  for  its  further  biochemi- 
cal dissection  and  renders  feasible  efforts  to 
crystallize  this  functional  domain  of  the  receptor, 
a  prerequisite  for  the  elucidation  of  its  three-di- 
mensional structure  by  x-ray  crystallography,  in 
collaboration  with  Wayne  Hendrickson  (HHMI, 
Columbia  University  College  of  Physicians  and 
Surgeons) . 

The  cytoplasmic  protein  tyrosine  kinase  do- 
main of  the  receptor  has  been  expressed  indepen- 
dently of  the  cell  membrane  and  as  a  soluble 
monomeric  insulin-independent  enzyme  (401 
amino  acids) .  To  be  relevant  to  the  study  of  the 
function  of  the  kinase  in  the  wild-type  mem- 
brane-associated insulin  receptor,  the  soluble  ki- 
nase must  exhibit  functional  properties  charac- 
teristic of  the  wild-type  receptor.  In  fact,  this 
soluble  enzyme  is  recognized  by  a  panel  of  con- 
formation-sensitive antireceptor  monoclonal  anti- 
bodies, it  exhibits  a  low  level  of  basal  activity 
toward  exogenous  substrates  that  increases  upon 
autophosphorylation,  and  the  sites  of  autophos- 
phorylation of  the  enzyme  are  typical  of  those 
observed  following  insulin-dependent  autophos- 
phorylation of  the  wild-type  receptor  in  intact 
cells.  Thus,  although  synthesized  free  of  the  cell 
membrane  and  now  insulin-independent,  this  sol- 
uble derivative  of  the  protein  tyrosine  kinase  do- 
main exhibits  the  two  functional  states  observed 
for  the  kinase  in  the  context  of  the  wild-type 
transmembrane  receptor. 

An  efficient  heterologous  cell  expression  sys- 
tem renders  feasible  the  use  of  biophysical  meth- 
ods for  functional  studies  of  this  soluble  enzyme, 
which  are  not  yet  possible  for  the  wild-type 
membrane-associated  insulin  receptor.  For  exam- 
ple, by  the  use  of  nuclear  magnetic  resonance 
(NMR)  spectroscopy  (in  collaboration  with  Barry 
Levine,  Oxford  University),  we  have  begun  to  ex- 
plore the  interaction  of  small  molecules  (metal 
ions,  ATP,  peptide  substrates)  with  the  enzyme 
and  to  study  catalysis  by  the  kinase  in  solution  in 
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real  time:  we  can  now  follow  the  phosphoryla- 
tion of  individual  tyrosine  residues  of  such  pep- 
tides (especially  the  order  of  phosphorylation  of 
multiple  tyrosines) ,  as  well  as  examine  the  con- 
tribution of  individual  amino  acid  residues  to  the 
kinetics  of  tyrosine  phosphorylation  of  the  pep- 
tide. Thus  it  should  be  possible  to  determine 


directly  the  stereochemical  requirements  and 
dynamics  of  efficient  exogenous  substrate 
phosphorylation  by  this  tyrosine  kinase.  These 
studies  complement  efforts  to  obtain  the  three- 
dimensional  structure  of  the  enzyme  through 
x-ray  crystallography  (in  collaboration  with 
Wayne  Hendrickson) . 
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LIFE  in  all  large  animals  requires  the  capacity 
to  ship  oxygen  and  food  to  all  organs  and  the 
cells  within  the  organs.  Blood  serves  this  capac- 
ity, but  the  development  of  such  a  delivery  sys- 
tem is  not  without  its  inherent  problems.  The 
blood  vessels  that  carry  the  blood  are  subject  to 
injury.  When  cut,  bleeding  is  a  complication.  For- 
tunately, a  complex  system  exists  that  allows  the 
blood  to  clot  at  a  wound  site  and  prevent  fatal 
bleeding  in  all  but  the  most  severe  wounds.  Rela- 
tively few  people  are  troubled  with  an  inability  to 
clot  blood.  In  contrast,  the  formation  of  un- 
wanted blood  clots  either  causes  or  contributes 
to  nearly  half  of  all  deaths  in  developed  coun- 
tries. The  major  problems  occur  in  heart  attacks, 
strokes,  pulmonary  embolisms,  and  septic  shock. 
The  system  that  leads  to  blood  clots  involves 
more  than  1 0  separate  proteins  and  several  differ- 
ent cells.  To  prevent  unwanted  clots,  an  equally 
complex  collection  of  systems,  referred  to  as  anti- 
coagulant complexes,  has  evolved  to  block  or 
limit  the  blood  clotting  process.  Our  working  hy- 
pothesis is  that  changes  in  the  activity  of  these 
anticoagulant  complexes  may  contribute  to  the 
formation  of  lethal  blood  clots. 

Protein  C,  protein  S,  and  thrombomodulin  con- 
stitute one  of  the  natural  anticoagulant  com- 
plexes that  prevents  unwanted  blood  clots.  We 
have  focused  on  this  system  because  we  have 
been  able  to  identify  patients  with  a  history  of 
unwanted  blood  clots  who  have  abnormal  pro- 
tein C,  protein  S,  or  thrombomodulin.  To  under- 
stand how  the  system  functions,  it  is  useful  to 
review  the  function  of  the  components.  Throm- 
bomodulin is  found  primarily  on  the  surface  of 
endothelial  cells,  the  cells  that  line  the  blood 
vessels.  Thrombin,  the  enzyme  that  causes  blood 
to  clot,  can  bind  to  thrombomodulin;  when 
thrombin  is  bound,  it  no  longer  clots  the  blood 
but  instead  converts  protein  C  into  the  active 
blood  clotting  inhibitor,  activated  protein  C.  Ac- 
tivated protein  C  then  binds  to  protein  S  on  the 
surface  of  platelets  (small  cells  in  the  blood)  or 
endothelial  cells,  where  it  functions  as  an  antico- 
agulant. The  activated  protein  C-protein  S  com- 
plex works  as  an  anticoagulant  by  cutting  up  and 


inactivating  two  of  the  clotting  proteins,  factor 
VIII  (the  protein  missing  in  hemophilia)  and  fac- 
tor V. 

This  broad  outline  of  how  the  system  functions 
fails  to  tell  us  much  about  where,  when,  or  how 
the  system  might  function  in  human  disease  pro- 
cesses. This  knowledge  is  important  both  in 
terms  of  understanding  the  basic  properties  of  the 
system  and  in  the  design  of  new  therapeutic  ap- 
proaches to  the  diagnosis  and  prevention  of 
blood  clots.  Of  particular  interest,  and  still  unex- 
plained, is  the  observation  that  administration  of 
activated  protein  C  at  levels  that  can  prevent  un- 
wanted blood  clots  does  not  increase  blood  loss 
at  surgical  sites.  This  contrasts  with  available  anti- 
coagulants, such  as  heparin,  which  block  un- 
wanted clot  formation  but  also  dramatically  in- 
crease blood  loss  at  surgical  sites.  One  of  our 
goals  is  to  understand  how  this  natural  anticoagu- 
lant can  accomplish  this  remarkable  specificity. 
New  therapeutic  agents  with  these  properties 
could  greatly  decrease  morbidity  and  mortality 
associated  with  thrombotic  complications. 

Most  healthy  individuals  have  adequate 
amounts  of  protein  C  and  the  other  components 
of  the  system  that  control  blood  clot  formation 
under  normal  circumstances.  When  people  be- 
come sick,  unwanted  clotting  is  often  a  problem. 
Studies  from  other  laboratories  have  shown  that 
protein  S  circulates  in  humans  both  free  and 
bound  to  an  inhibitor  of  the  complement  system 
(the  system  that  helps  protect  from  infection), 
called  C4b-binding  protein  (C4bBP). 

We  found  that  only  the  free  form  of  protein  S 
could  work  to  form  the  anticoagulant.  Patients 
with  clinical  conditions  known  to  cause  an  in- 
creased risk  of  blood  clots  also  had  reduced  lev- 
els of  free  protein  S  and  more  C4bBP-protein  S 
complex.  Families  with  inherited  thrombotic 
complications  were  identified  in  which  the  fam- 
ily members  who  developed  blood  clots  had  high 
levels  of  the  complex.  These  observations  sug- 
gested that  alteration  in  the  levels  of  free  protein 
S  might  contribute  to  the  clotting  complications 
observed  in  these  patients.  To  test  this  hypothe- 
sis, we  developed  some  animal  models  of  throm- 
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bosis  and  examined  whether  elevation  of  the 
C4bBP  levels  to  those  found  in  patients  would 
increase  the  risk  of  thrombotic  complications 
(unwanted  blood  clots) .  This  was  exactly  what 
was  observed.  More  importantly  from  a  therapeu- 
tic point  of  view,  simultaneously  increasing  pro- 
tein,S  prevented  the  thrombotic  complications. 
These  results  suggest  that  if  we  could  block  the 
decrease  in  free  protein  S,  the  risk  of  thrombosis 
would  be  reduced.  This  approach  could  poten- 
tially return  the  patient  to  normal  status  without 
significantly  increasing  the  risk  of  bleeding. 

Recently  it  has  become  clear  that  clot  forma- 
tion in  the  arteries  of  the  heart  is  responsible  for 
most  heart  attacks.  Damage  that  results  from  the 
attack  may  involve  injury  to  the  small  vessels  that 
feed  the  heart.  When  we  examined  whether  the 
protein  C  system  helps  prevent  this  damage,  we 
found  that  protein  C  is  activated  following  a  short 
(2  min  or  less)  blockage  of  an  artery  in  the  heart. 
When  this  activation  is  prevented,  the  heart  re- 
covers less  completely,  if  at  all.  These  experi- 
ments imply  that  patients  with  decreased  func- 
tion of  the  protein  C  system  may  have  increased 
damage  to  the  heart  if  they  do  have  a  heart  attack 
and  that  activated  protein  C  may  prove  a  useful 
agent  in  preventing  some  aspects  of  this  damage 
to  the  heart.  Clinical  trials  will  test  these 
concepts. 

A  major  interest  in  our  laboratory  has  been  to 
understand  how  thrombomodulin  causes  throm- 
bin to  change  from  a  clotting  to  a  clot-inhibiting 
enzyme.  To  understand  this  question,  we  have 


made  mutants  of  human  thrombin,  because  the 
binding  of  thrombin  to  thrombomodulin  proba- 
bly changes  thrombin's  shape.  This  shape  change 
probably  accounts  for  how  the  complex  acceler- 
ates protein  C  activation.  One  possibility  is  that 
thrombin  has  one  or  two  specific  amino  acids  that 
hit  protein  C  in  such  a  way  that  it  cannot  "dock" 
or  bind  with  thrombin  effectively.  If  this  were  the 
case,  then  altering  the  amino  acids  that  block  the 
docking  event  could  make  thrombin  a  better  acti- 
vator. We  have  examined  two  classes  of  such 
changes.  In  proteins,  likes  repel  and  unlikes  at- 
tract. In  the  case  of  protein  C,  very  close  to  the 
place  where  activation  occurs,  there  is  a  nega- 
tively charged  amino  acid.  There  is  also  a  nega- 
tive amino  acid  in  the  region  of  thrombin  very 
near  where  this  negatively  charged  amino  acid 
must  dock.  If  the  slow  activation  of  protein  C  by 
thrombin  is  related  to  the  charge  repulsion  (it  is 
difficult  to  get  negative  charges  together),  then 
removing  this  negative  charge  from  either  throm- 
bin or  protein  C  should  increase  the  rate  of  pro- 
tein C  activation  in  a  manner  analogous  to  forma- 
tion of  the  thrombin-thrombomodulin  complex. 
When  either  the  negative  charge  on  protein  C  or 
on  thrombin  is  removed,  protein  C  activation  is 
increased  5-20  times.  This  change  in  thrombin 
does  not  influence  the  ability  to  clot  fibrinogen. 
These  studies  provide  insights  into  how  regula- 
tory proteins  such  as  thrombomodulin  may  func- 
tion and  also  provide  the  framework  for  the 
design,  by  selective  mutation,  of  new  anticoagu- 
lants that  alter  the  function  of  normal  clotting 
proteins. 
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AN  understanding  of  the  mechanisms  by 
which  apparently  distinct  regulatory  sys- 
tems integrate  to  modulate  body  function  and  be- 
havior poses  one  of  the  most  important  chal- 
lenges of  modern  biology.  Hence  we  have 
focused  our  attention  on  the  action  of  steroid, 
retinoid,  and  thyroid  hormones  in  regulatory  cel- 
lular and  organ  physiology.  This  field  has  under- 
gone an  extraordinary  development  in  the  last 
several  years  as  a  consequence  of  the  cloning  and 
sequencing  of  the  genes  encoding  the  receptors 
for  these  hormones  in  target  cells. 

It  has  been  demonstrated  that  these  receptors 
are  all  structurally  related  and  constitute  a  super- 
family  of  nuclear  regulatory  proteins  that  are  ca- 
pable of  modulating  gene  expression  in  a  ligand- 
dependent  fashion.  One  challenge  is  to  define 
each  receptor's  molecular  properties  that  deter- 
mine its  interactions  with  the  transcription  ma- 
chinery regulating  gene  expression.  Another 
challenge  is  to  elucidate  the  contributions  of  indi- 
vidual regulatory  systems  to  the  integrated  and 
complex  processes  associated  with  cell  growth, 
differentiation,  and  organ  function. 

The  Glucocorticoid  Regulatory  Network 

The  glucocorticoid  receptor  (GR)  is  well  char- 
acterized as  a  potent  activator  of  transcription. 
More  recently  the  GR  has  also  been  shown  to  re- 
press expression  of  a  variety  of  genes  in  response 
to  hormones.  Glucocorticoids,  in  addition  to 
their  inductive  effects  on  specific  genes,  inhibit 
proliferation  of  a  variety  of  cultured  cell  lines 
and  oppose  the  action  of  lymphokines  in  the  acti- 
vation of  T  cells.  T  cell  activation  is  in  part  me- 
diated by  activation  of  Jun/AP-1,  which  is  a 
member  of  the  class  of  nuclear  proteins  encoded 
by  proto-oncogenes  and  a  factor  in  diverse 
aspects  of  cell  growth,  differentiation,  and 
development. 

We  have  provided  evidence  that  the  GR  and 
Jun/AP-1  can  reciprocally  repress  each  other's 
transcriptional  activation  and  that  this  inhibition 
is  a  consequence  of  protein-protein  interaction. 
The  interaction  of  two  distinct  regulatory  pro- 
teins, referred  to  as  "cross-coupling,"  has  several 


important  implications.  First,  it  provides  a  molec- 
ular framework  for  understanding  the  opposing 
effects  of  growth  factors  and  steroid  hormones. 
Growth  factors  stimulate  cellular  division, 
whereas  steroid  hormones  inhibit  cell  division 
while  promoting  differentiation. 

Perhaps  the  most  intriguing  aspect  of  cross- 
coupling  is  that  it  allows  interaction  of  distinct 
classes  of  transcription  factors  prior  to  their  in- 
teraction with  target  DNA  sequences.  Thus  the 
GR  prevents  AP- 1  from  binding  to  DNA  and  acti- 
vating a  downstream  set  of  target  genes.  Accord- 
ingly, glucocorticoids  indirectly  regulate  the 
AP- 1  gene  network  by  modulating  the  DNA-bind- 
ing  properties  of  Jun/AP-1.  Although  glucocorti- 
coids are  thought,  classically,  to  regulate  target 
genes  by  promoting  the  receptor  to  bind  their 
DNA,  cross-coupling  allows  steroid  regulation  of 
the  AP- 1  -responsive  genes,  but  in  a  DNA-indepen- 
dent  fashion.  This  DNA-dependent  pathway  is  a 
novel  mechanism  for  steroid  control  of  gene 
expression. 

A  Novel  Retinoic  Acid  Response  Pathway 

The  retinoids  make  up  a  group  of  compounds 
that  include  retinoic  acid,  retinol  (vitamin  A), 
and  a  series  of  natural  and  synthetic  derivatives 
that  exen  profound  effects  on  development  and 
differentiation  in  a  wide  variety  of  systems.  Reti- 
noic acid  has  also  been  shown  to  induce  the  tran- 
scription of  several  genes,  supporting  the  hy- 
pothesis that  it  has  functions  analogous  to  those 
of  steroid  and  thyroid  hormones.  In  previous  stud- 
ies we  described  the  cloning  and  characterization 
of  a  retinoic  acid-dependent  transcription  factor, 
referred  to  as  RARa.  Additional  RAR-related  genes 
have  been  isolated,  and  at  least  three  different 
RAR  subtypes  (a,  ^,  and  7)  are  now  known  in 
mice  and  humans. 

Retinoic  acid  receptors  share  homology  with 
the  superfamily  of  steroid  and  thyroid  hormone 
receptors  and  have  been  shown  to  regulate  spe- 
cific gene  expression  by  a  similar  ligand-depen- 
dent  mechanism.  Complicating  these  observa- 
tions is  our  recent  identification  of  a  group  of 
receptors  termed  RXRs  (retinoid  X  receptors). 
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which  are  only  distantly  related  to  the  RARs.  At 
least  three  RXR-related  genes  (a,  |8,  and  7)  are 
now  known  to  be  located  at  genetically  distinct 
loci.  Each  is  capable  of  transactivating  through 
cis-DNA-binding  sequences  similar  to  those  of  the 
RARs.  Northern  blot  analyses  of  the  RXRs  indicate 
that  each  isoform  has  a  unique  pattern  of  expres- 
sion in  adult  tissue  and  is  temporally  and  spatially 
expressed  in  the  embryo.  Binding  experiments 
demonstrate  that  the  RXR  protein  has  low  affinity 
for  retinoic  acid  and,  taken  together  with  the 
transactivation  studies,  indicate  that  the  RXR  li- 
gand  may  be  a  metabolite  of  retinoic  acid.  Thus  a 
function  of  the  RXRs  may  both  overlap  and  be 
unique  to  that  of  the  RARs.  These  studies  suggest 
a  role  for  RXRs  in  adult  physiology  and  embry- 
onic development. 

The  discovery  in  RXR  of  a  second  transduction 
pathway  with  distinct  pharmacological  proper- 
ties may  lead  to  a  better  understanding  of  how 
retinoids  can  elicit  such  an  enormous  diversity  of 
biological  responses  and  suggests  that  retinoid 
metabolism  may  provide  important  clues  to  the 
identification  of  new  ligands. 


In  view  of  the  homology  of  the  RARs  and  RXRs, 
their  genes  would  seem  to  have  arisen  indepen- 
dently during  evolution,  possibly  to  cope  with 
the  increasing  demands  of  ever-more-complex 
evolving  organisms.  An  understanding  of  the  ori- 
gins of  these  two  gene  families  may  afford  in- 
sights to  their  modern  roles.  Thus,  in  search  of 
invertebrate  homologues,  we  have  screened  a 
Drosophila  genomic  library  and  identified  the 
product  of  a  novel  steroid-receptor-like  gene — 
ultraspiracle  (usp) — that  bears  striking  resem- 
blance to  RXR  in  both  its  DNA-  and  ligand-bind- 
ing  domains. 

Genetic  analysis  indicates  that  the  usp  gene 
product  has  both  maternal  and  zygotic  functions. 
In  the  germline,  it  is  important  for  formation  of 
the  embryonic  cuticle  choriogenesis  and  fertiliza- 
tion. In  the  embryo,  it  is  necessary  for  larval  pu- 
pation. Recent  functional  studies  indicate  that 
the  usp  gene  product  does  not  respond  to  reti- 
noic acid,  which  suggests  that  a  nonretinoid  li- 
gand  may  exist  in  Drosophila.  This  is  particularly 
interesting  in  that  its  structure  may  provide  clues 
to  the  identification  of  the  vertebrate  usp  ligand. 
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THE  major  objective  of  my  laboratory  is  to  elu- 
cidate the  mechanisms  of  action  of  hor- 
mones, neurotransmitters,  and  other  agents  that 
transmit  information  in  the  nervous  system  and 
other  organ  systems  by  altering  membrane  lipids 
and  increasing  the  concentration  of  calcium  ions 
in  their  target  cells.  A  large  number  of  agents  act 
this  way.  They  include  regulators  of  heart  func- 
tion and  blood  flow,  such  as  epinephrine,  nor- 
epinephrine, acetylcholine,  angiotensin,  and 
vasopressin;  other  neurotransmitters,  such  as  se- 
rotonin, neurotensin,  and  substance  P;  and  agents 
that  control  certain  pituitary  and  pancreatic  se- 
cretions, food  digestion,  bladder  and  uterine  con- 
traction, platelet  aggregation,  and  certain  re- 
sponses to  trauma  and  infection. 

We  initially  established  that  many  actions  of 
hormones  and  neurotransmitters  are  not  due  to 
changes  in  the  cellular  levels  of  cyclic  nucleotide 
second  messengers  but  result  from  an  increase 
in  the  level  of  calcium  ions.  The  next  phase  of 
the  work  involved  the  demonstration  that  these 
calcium-mediated  agents  act  both  by  mobilizing 
calcium  ions  from  internal  stores  and  by  stimulat- 
ing the  inflow  of  these  ions  across  the  cell  mem- 
brane. We  also  demonstrated  that  the  receptors 
for  these  agents  are  located  on  the  outer  surface 
of  their  target  cells.  Thus  these  cells  must  have 
some  means  of  signaling  from  the  receptors  to 
their  internal  calcium  stores. 

Efforts  were  directed  toward  elucidating  the 
signaling  mechanisms.  Initially  we  tested  the  hy- 
pothesis that  the  signal  was  generated  by  the 
breakdown  of  phosphatidylinositol,  a  phospho- 
lipid in  the  cell  membrane.  However,  this  break- 
down was  too  slow  to  account  for  the  changes  in 
calcium.  A  related  phospholipid,  phosphatidyl- 
inositol  4,5-bisphosphate  (PIP2),  broke  down 
more  rapidly  in  response  to  hormones,  but  the 
change  was  transient.  The  situation  became 
clearer  when  inositol  1 ,4,5-trisphosphate  (IP3) 
was  identified  as  the  signaling  molecule  for  cal- 
cium. This  compound  is  generated  when  PIP2  is 
broken  down  by  the  enzyme  phospholipase  C. 
The  other  compound  produced  is  1,2-diacylgIy- 


cerol  (DAG),  which  is  also  a  signaling  molecule, 
since  it  activates  a  specific  protein  kinase  (pro- 
tein kinase  C) . 

The  present  activities  of  the  laboratory  encom- 
pass three  major  research  areas.  The  first  involves 
elucidating  how  calcium-mediated  agents  stimu- 
late the  breakdown  of  PIP2  .  A  major  discovery  has 
been  the  finding  that  a  G  protein  (a  regulatory 
protein  that  binds  the  nucleotide  GTP)  is  in- 
volved in  coupling  the  receptors  for  these  agents 
to  the  phospholipase  C  enzyme  that  breaks  down 
PIP2.  Our  work  has  involved  purifying  and  char- 
acterizing the  relevant  G  protein  from  liver  cell 
membranes  and  reconstituting  it  with  other  com- 
ponents of  the  signaling  system.  Reconstitution 
of  the  G  protein  with  PIP2  phospholipase  C  has 
been  achieved,  and  the  system  has  been  used  to 
purify  the  protein  to  homogeneity  in  both  the 
complete  form  (^a^y  heterotrimer)  and  in  the 
form  of  the  free  a-subunit.  The  a-subunit  (molec- 
ular weight  42,000)  has  been  shown  immuno- 
logically to  be  a  member  of  the  Gq  family  of  G 
proteins. 

The  a-subunit  activated  by  a  GTP  analogue  has 
been  used  to  identify  the  specific  isozyme  form  of 
the  PIP2  phospholipase  C  controlled  by  the  G 
protein.  In  both  liver  and  brain,  it  is  the  l48-kDa 
18-isozyme.  Evidence  that  the  purified  G  protein  is 
coupled  to  at  least  one  receptor  for  a  calcium- 
mobilizing  agent  has  been  obtained  by  the  dem- 
onstration that  binding  of  a  GTP  analogue  to  the 
G  protein  heterotrimer  is  enhanced  by  its  recon- 
stitution with  the  Ml  (calcium-mobilizing)  mus- 
carinic receptor  for  acetylcholine,  but  not  with 
the  M2  form  of  this  receptor.  In  addition,  three 
calcium-mediated  agents  (vasopressin,  epineph- 
rine, and  angiotensin)  specifically  stimulate  the 
labeling  of  two  G  proteins  by  a  radioactive,  light- 
reactive  analogue  of  GTP,  azidoanilido  GTP,  in 
liver  cell  membranes.  These  proteins  (apparent 
molecular  weights  42,000  and  43,000)  have  also 
been  shown  immunologically  to  be  members  of 
the  Gq  family  of  G  proteins.  It  is  likely  that  the 
smaller  protein  is  identical  to  the  a-subunit 
(42,000  molecular  weight)  that  activates  PIP2 
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phospholipase  C.  Possible  functions  of  the  larger 
protein  are  being  examined. 

The  second  major  research  area  arose  from  the 
observation  that  hormones  and  neurotransmitters 
that  break  down  PIP2  in  their  target  cells  also 
break  down  another  cell  membrane  phospho- 
lipid, phosphatidylcholine  (PC).  This  break- 
down yields  DAG  and  phosphatidic  acid  (PA)  and 
involves  two  other  phospholipases  (C  and  D), 
which  are  being  purified  and  characterized.  Fur- 
ther work  has  shown  that  PC  breakdown  is  regu- 
lated in  several  ways:  by  mechanisms  involving  G 
proteins,  Ca^^,  protein  kinase  C,  and  tyrosine  ki- 


nases. In  vivo  and  in  vitro  approaches  are  being 
used  to  explore  these  mechanisms. 

The  third  major  research  area  involves  studies 
of  the  actions  and  metabolism  of  PA  in  cells.  This 
work  has  shown  that  high  concentrations  of  PA 
accumulate  rapidly  in  cells  stimulated  with  many 
hormones,  neurotransmitters,  and  growth  fac- 
tors. Although  it  is  widely  believed  that  PA  acts  as 
a  second  messenger,  its  cellular  targets  and  func- 
tions remain  unknown.  We  have  recently  identi- 
fied a  protein  kinase  that  is  activated  by  PA  and 
may  therefore  mediate  its  cellular  effects.  This 
en2yme  is  being  purified  and  characterized. 
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OUR  laboratory  is  studying  the  molecular 
basis  of  human  cancer.  In  particular  we  are 
attempting  to  elucidate  the  earliest  events  that 
convert  a  normal  cell  to  a  malignant  cell  and  the 
role  of  gene  inactivation  in  cancer  development. 

The  Earliest  Events  in  Malignant 
Transformation 

We  have  found  that  human  cancer  involves  al- 
terations of  many  genes,  even  within  the  same 
tumor.  Most  of  these  changes  occur  relatively  late 
and  probably  do  not  play  a  causal  role  in  initia- 
tion of  malignancy.  Cancer  can  require  several 
decades  to  develop,  including  a  long  premalig- 
nant  or  benign  phase.  For  example,  in  colorectal 
cancer,  the  time  course  of  premalignant  disease 
(adenomas  or  polyps)  can  be  as  long  as  10  years. 
The  only  genetic  change  we  have  found  in  early 
adenomas  is  an  alteration  in  DNA  methylation — a 
tissue-specific  modification  of  cytosine,  one  of 
the  four  DNA  nucleotides.  Hypomethylation,  the 
loss  of  methyl  groups  from  cytosine,  appears  to 
play  an  important  role  in  normal  gene  activation 
or  expression,  and  thus  altered  DNA  methylation 
could  contribute  to  the  abnormal  gene  expres- 
sion that  characterizes  cancer. 

In  addition  to  studying  the  problem  of  DNA 
methylation  in  cancer,  we  would  like  to  know 
which  genes  are  expressed  specifically  in  cells 
that  are  transforming.  By  the  time  a  cell  is  fully 
malignant,  more  than  1,000  genes  have  under- 
gone changes  in  their  expression.  If  we  could  de- 
termine which  genes  are  activated  in  premalig- 
nant cells,  we  might  better  understand  how 
structural  changes  in  the  genome  mediate  the  dra- 
matic changes  in  cell  behavior  that  define  malig- 
nancy, such  as  unregulated  growth,  tissue  inva- 
sion, and  metastasis. 

To  approach  these  two  problems,  we  have  de- 
veloped a  novel  experimental  system.  We  have 
captured  cells  in  culture  that  have  been  treated 
with  a  carcinogen  that  causes  hypomethylation  of 
DNA.  We  have  devised  a  way  to  tell  which  cells 
treated  with  this  drug  will  become  malignantly 
transformed,  and  which  will  not,  before  they 
have  undergone  any  of  the  changes  in  growth 


properties  or  appearance  that  define  malignancy. 
In  this  manner,  we  have  isolated  cells  that  look 
normal  but  are  committed  to  malignant  transfor- 
mation, as  well  as  cells  that  have  been  handled  in 
the  same  manner  but  will  not  become  malignant 
(the  ideal  control). 

By  preparing  cDNA  libraries  (large  collections 
of  the  expressed  genes) ,  we  have  identified  a  rela- 
tively small  number  that  are  specifically  asso- 
ciated with  commitment  to  neoplastic  transfor- 
mation. We  have  also  identified  several  genes  that 
are  turned  off  early  in  that  process.  We  are  now 
determining  the  identity  and  function  of  these 
genes,  one  of  which  appears  to  be  a  novel  onco- 
gene with  the  ability  to  transform  cells  when  in- 
troduced into  them. 


The  Role  of  Gene  Inactivation  in 
Cancer  Development 

One  of  the  most  important  areas  of  cancer  ge- 
netics is  the  ascertainment  and  characterization 
of  tumor-suppressor  genes — genes  whose  inacti- 
vation contributes  to  cancer.  Since  almost  all 
genes  are  present  in  two  copies  in  the  cell,  inacti- 
vation of  one  copy  could  be  transmitted  in  fami- 
lies from  parent  to  child.  Individuals  inheriting 
one  bad  copy  of  the  gene  would  be  at  increased 
risk  of  developing  cancer.  Cancer  would  develop 
from  deletion  or  inactivation  of  both  copies  of 
these  suppressor  genes. 

In  the  1970s  it  was  discovered  that  some  chil- 
dren with  Wilms'  tumor,  a  childhood  kidney 
cancer,  are  missing  a  large,  microscopically  visi- 
ble portion  of  chromosome  1 1  (in  band  1  lpl3). 
Later  we  found  that  more  subtle  gene  deletions 
on  chromosome  1 1  could  be  detected  in  Wilms' 
tumors  indirectly,  using  restriction  fragment 
length  polymorphisms  (RFLPs) ,  which  can  distin- 
guish the  maternal  and  paternal  alleles,  or  copies, 
of  a  given  gene  (one  on  each  chromosome) . 

During  the  past  year,  we  have  cloned  750  kilo- 
bases  of  DNA  from  band  1  lpl3  of  chromosome 
1 1,  using  yeast  artificial  chromosomes  (YACs),  in 
collaboration  with  David  Schlessinger  and  May- 
nard  Olson  (HHMI,  Washington  University)  and 
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Bryan  Williams  (Hospital  for  Sick  Children,  To- 
ronto). Within  this  chromosomal  region,  we 
found  a  cluster  of  genes  that  appear  to  be  in- 
volved in  both  childhood  kidney  cancer  and  nor- 
mal kidney  development.  In  one-half  of  the  tu- 
mors we  examined,  at  least  two  of  these 
candidate  tumor-suppressor  genes  were  turned 
oflfr  Remarkably,  the  tumors  in  which  the  genes 
were  turned  off  also  contained  tissues  not  nor- 
mally found  in  the  kidney,  such  as  muscle,  sug- 
gesting that  the  same  genes  are  involved  in  both 
tumor  development  and  normal  kidney  develop- 
ment. We  now  need  to  find  out  what  role  these 
genes  play  in  kidney  development  and  why  they 
are  turned  off  in  cancer. 

A  surprising  result  from  the  cloning  and  map- 
ping of  this  large  genomic  region  was  the  identi- 
fication of  an  "archipelago"  of  cytosine-guanine 
islands,  or  clusters  of  CG  nucleotides  that  are  of- 
ten near  genes.  Several  of  the  CG  islands  in  the 
Wilms'  tumor  gene  region  were  methylated,  a 
feature  normally  characterizing  the  inactive  X 
chromosome.  This  offers  a  possible  mechanism 
for  inactivation  of  tumor-suppressor  genes,  as 


well  as  an  unexpected  link  to  our  research  on 
DNA  methylation. 

In  addition  to  the  known  gene  on  chromosome 
1 1  that  predisposes  to  Wilms'  tumor,  we  have 
discovered  another  gene  at  a  different  location  on 
chromosome  11  (band  Hp  15)  that  also  predis- 
poses to  Wilms'  tumor  but  is  involved  in  bladder, 
breast,  and  lung  cancer  as  well.  Loss  of  one  copy 
of  this  gene  in  the  germline  appears  to  cause 
Beckwith-Wiedemann  syndrome,  a  disorder  of 
organ  overgrowth  and  malignancy.  We  are  now 
attempting  to  clone  this  tumor-suppressor/Beck- 
with-Wiedemann  syndrome  gene.  In  this  eff'ort 
we  are  using  a  strategy  we  have  developed  to  iso- 
late tumor-suppressor  genes  directly.  We  frag- 
ment human  chromosomes  into  subfragments  of 
2-10  million  nucleotides  in  a  way  that  allows  us 
to  transfer  these  into  tumor  cells.  This  novel  tech- 
nique will  allow  us  to  bridge  a  gap  in  cloning 
methods  between  chromosome-size  pieces  (aver- 
age 100  million  nucleotides)  and  yeast  artificial 
chromosomes  (average  300,000  nucleotides), 
and  it  may  have  general  application  to  cloning  a 
wide  variety  of  genes. 
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HISTOCOMPATIBILITY  antigens  are  cell  sur- 
face molecules  that,  when  foreign,  lead  to 
the  rejection  of  grafted  tissues  and  organs  by  the 
vertebrate  immune  system.  Because  they  form  a 
major  obstacle  to  clinical  transplantation,  H  anti- 
gens have  been  studied  for  over  50  years.  They 
are  complexes  of  a  small  peptide  ligand  and  an 
MHC  molecule  (encoded  by  genes  of  the  major 
histocompatibility  complex).  A  given  individual 
has  MHC  molecules  of  a  few  different  kinds,  each 
of  which  can  present  to  the  immune  system  a 
large  variety  of  peptides  on  the  surface  of  cells. 
These  peptides  might  be  derived  from  proteins 
produced  by  intracellular  parasites,  bacteria,  or 
viruses  or  by  the  body's  own  cells,  such  as  tumor- 
specific  or  minor  H  antigens.  The  amino  acid  side 
chains  that  line  the  peptide-binding  groove  of  an 
MHC  molecule  determine  which  peptides  that 
molecule  can  bind,  and  therefore  what  antigens 
can  be  presented  to  induce  an  immune  response 
in  the  individual  with  this  MHC. 

The  immune  system  is  capable  of  recognizing  a 
difference  in  either  of  the  H  antigens'  two  parts.  A 
difference  in  the  MHC  molecule  itself  will  alter 
many  complexes  and  induce  a  strong  immune  re- 
sponse, hence  the  term  "major"  H  antigen.  By 
contrast,  a  difference  in  a  peptide  alters  only  one 
of  many  kinds  of  complexes  and  induces  a  weaker 
response,  hence  the  term  "minor"  H  antigen.  Un- 
like the  major  H  antigens,  human  minor  H  anti- 
gens remain  ill-defined.  In  the  mouse,  however, 
more  than  50  genes  that  encode  minor  H  antigens 
have  been  mapped,  and  almost  every  chromo- 
some, including  the  mitochondrial  genome,  car- 
ries at  least  one. 

The  Maternally  Transmitted  Antigen 

The  sixth  amino  acid  in  the  mouse  mitochon- 
drial protein  NDl  is  polymorphic.  When  cells 
with  one  form  are  transplanted  to  a  mouse  with 
another  form,  the  amino-terminal  peptide  of  NDl 
will  act  as  a  transplantation  antigen,  called  Mta. 
This  peptide  is  presented  on  the  cell  surface  by 
an  MHC  class  I  molecule  called  M3.  Mta  was  the 
first  minor  H  antigen  to  be  analyzed  in  molecular 


detail.  It  has  turned  out  to  be  an  excellent  model, 
although  discovered  by  virtue  of  its  unusual  fea- 
tures of  mitochondrial,  hence  maternal,  inheri- 
tance and  its  presentation  by  a  novel,  highly  con- 
served MHC  molecule. 

M3  only  binds  the  NDl  peptide  when  the  me- 
thionine at  the  end  carries  a  formyl  group,  and 
M3  can  also  bind  other  peptides  with  a  formyl- 
methionine.  This  is  characteristic  of  the  amino 
terminus  of  mitochondrial  and  bacterial  proteins, 
and  distinguishes  them  from  proteins  made  in  the 
cytoplasm  of  mammalian  cells.  It  is  of  great  inter- 
est to  understand  how  the  formyl-methionine 
peptides  are  bound  by  M3  and  which  amino  acids 
in  M3  are  important  for  this  specificity. 

We  have  now  cloned  the  gene  M3  from  mice, 
and  considerable  effort  has  been  spent  during  the 
past  year  on  various  systems  for  deriving  M3  in 
amounts  sufficient  for  a  structural  analysis.  We 
can  express  the  M3  gene  in  insect  cells,  which 
make  milligrams  of  the  protein  in  a  single  culture 
flask.  However,  the  protein  is  not  expressed  on 
the  surface  of  these  cells,  probably  because  it  is 
folded  incorrectly,  and  is  detectable  only  with 
rabbit  antisera  against  short  fragments  of  it.  The 
heavy  chain  of  MHC  class  I  molecules  is  known  to 
be  unstable  in  the  properly  folded  conformation 
at  body  temperature  unless  binding  the  ^2'^^' 
croglobulin  light  chain  as  well  as  a  peptide.  We 
are  now  trying  to  achieve  correct  folding  by  mix- 
ing the  M3  heavy  chain  from  a  cell  extract  with 
(82-microglobulin  light  chains  and  Mta. 

We  can  express  M3  in  mouse  fibroblasts,  where 
Mta  can  be  detected  by  killer  T  lymphocytes.  The 
M3  protein  is  therefore  folded  correctly,  but  in 
amounts  too  small  for  biochemical  analysis.  This 
system  does  allow  us,  however,  to  change  single 
amino  acids  in  the  protein  by  introducing  muta- 
tions at  specific  sites  in  the  gene.  By  testing 
whether  killer  T  cells  still  recognize  the  mutant 
protein,  we  can  assay  whether  it  presents  Mta. 
From  sequencing  naturally  occurring  variants  of 
the  M3  gene  from  wild  mice,  we  are  learning 
more  about  which  amino  acids  are  essential  for 
the  ability  to  present  the  peptide. 
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RMA-S  Mutant  Cells 

The  RMA-S  mutant  cell  line  has  been  of  funda- 
mental importance  in  understanding  peptide  an- 
tigen presentation.  The  cells  make  MHC  class  I 
heavy  chains  and  /52-microglobulin,  but  do  not 
display  them  on  their  surface.  The  defect  can  be 
circumvented  by  adding  synthetic  peptides  to  the 
cells.  It  is  thought  to  affect  either  an  enzyme  that 
cleaves  proteins  into  peptides,  or  a  protein  that 
helps  transport  peptides  from  the  cytoplasm  into 
the  endoplasmic  reticulum,  where  MHC  class  I 
molecules  fold. 

We  have  confirmed  that  these  cells  are  devoid 
of  classical  MHC  class  I  molecules  that  can  be 
recognized  by  antibodies  or  killer  T  lymphocytes. 
But  to  our  surprise,  they  still  display  detectable, 
albeit  reduced,  amounts  of  other  MHC  class  I  an- 
tigens, such  as  Mta  and  Qa- 1 .  We  infer  that  either 
these  molecules  bind  their  selected  peptides 
with  much  higher  affinity  than  the  promiscuous 
classical  MHC  I  molecules,  or  that  these  peptides 
are  brought  to  the  endoplasmic  reticulum  by  a 
special  pathway. 

Most  mitochondrial  proteins  are  made  in  the 
cytoplasm,  and  much  is  known  about  how  the 
mitochondria  import  them,  but  nothing  is  known 
about  how  proteins  or  peptides  escape  from  the 
mitochondria.  The  Mta  antigen  is  proof  that  mito- 
chondrial peptides  make  it  to  the  cell  surface.  We 
are  now  exploring  mechanisms  that  may  increase 
the  production  or  release  of  mitochondrial  pro- 
teins and  peptides.  We  use  both  standard  bio- 
chemical assays  to  test  for  the  release  of  proteins 


from  purified  mitochondria  and  immunological 
assays  on  intact  RMA-S  cells  to  test  specifically  for 
the  NDl  peptide.  Antibodies  against  mitochon- 
drial proteins  are  often  present  in  diseases  with 
an  autoimmune  component,  such  as  primary  cir- 
rhosis of  the  liver.  We  hope  to  learn  what  brings 
about  the  initial  exposure  of  these  mitochondrial 
self-antigens  to  the  immune  system. 

Mta  in  the  Rat 

To  understand  the  importance  and  evolution  of 
the  Mta  system,  we  look  for  parallels  in  other  spe- 
cies. We  have  shown  previously  that  the  NDl 
peptide  of  the  rat  can  be  recognized  by  mouse 
killer  T  lymphocytes  if  presented  by  mouse  M3. 
We  now  know  that  the  rat  has  its  own  maternally 
transmitted  antigen  system,  which  again  involves 
a  mitochondrial  peptide.  It  differs  in  two  impor- 
tant respects:  the  peptide  is  not  from  the  amino 
terminus  of  NDl ,  and  it  is  presented  by  the  classi- 
cal RTIA  of  the  rat,  not  by  a  special  MHC  class  I 
molecule. 

The  rat,  however,  does  have  an  expressed  MHC 
class  I  gene  like  the  mouse  M3.  We  have  cloned 
and  sequenced  it.  The  rat  and  mouse  M3  genes 
are  more  similar  to  each  other  than  they  are  to  the 
other  MHC  class  I  genes  of  their  own  species.  This 
is  particularly  striking  in  the  rat,  where  the  class  I 
genes  are  otherwise  very  similar.  These  observa- 
tions suggest  that  the  specialized  function  of  M3 
evolved  long  ago  in  a  species  from  which  both 
rats  and  mice  are  descended,  and  that  it  has  been 
conserved  in  both  species  during  their  separate 
evolution. 
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Borrelia  burgdorferi,  the  Lyme  disease  agent,  in  heart  tissue  of  infected  mice.  These  rod-like  bacte- 
ria are  in  the  center  and  bottom  of  the  picture  and  appear  as  a  thin,  dark  line. 
Research  of  Richard  A.  Plavell. 
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MY  laboratory  has  concerned  itself  for  many 
years  with  the  expression  of  the  genes  of 
the  murine  MHC  (major  histocompatibility  com- 
plex) .  In  the  mouse  these  genes  are  encoded  on 
chromosome  17,  and  prior  work  has  shown  that 
there  are  approximately  25  class  1-related  genes 
and  a  handful  of  class  II  genes.  Class  I  genes  en- 
code a  protein  of  approximately  45,000  molecu- 
lar weight  that  is  found  in  association  with  a  small 
subunit,  ;82'"^icroglobulin.  Together  this  com- 
plex forms  a  symmetrical  molecule  consisting  of 
four  extracellular  globular  domains  anchored 
through  the  cell  membrane  with  the  transmem- 
brane segment  and  having  a  short  stretch  of 
amino  acids  that  extend  into  the  cytoplasm.  Class 
II  molecules  achieve  a  similar  symmetry,  but 
with  two  polypeptide  chains,  a  and  /3,  each  of 
which  has  two  extracellular  domains  and  a  trans- 
membrane and  cytoplasmic  segment. 

Both  class  I  and  class  II  gene  products  serve  as 
recognition  elements,  which  bind  antigenic  pro- 
tein fragments  and  present  them  to  T  cells.  In  the 
case  of  class  I  genes,  the  presentation  is  to  T  cells 
carrying  the  CDS  co-receptor  molecule.  These 
cells  are  usually  cytotoxic  T  cells,  whose  role  is 
to  destroy  cells  that  are  virally  infected.  In  the 
case  of  class  II  molecules,  it  is  commonly  soluble 
antigen  that  is  presented,  this  time  to  helper  or 
inflammatory  T  cells  that  carry  the  CD4  co-recep- 
tor. Both  types  of  T  cells  secrete  hormone-like 
molecules  called  lymphokines,  which  in  turn  act 
on  other  cell  types — for  example,  on  B  cells, 
which  are  stimulated  to  multiply  and  to  make 
antibody. 

Much  of  our  work  since  joining  HHMI  has  cen- 
tered on  the  regulation  of  expression  of  the  class 
II  genes  and  the  study  of  their  biological  role  in 
transgenic  animals.  Class  I  and  class  II  genes  are 
both  regulated  in  vivo  by  various  lymphokines. 
For  example,  interferon-7  secreted  by  activated  T 
cells  stimulates  the  synthesis  of  MHC  class  I  and  II 
molecules  and,  as  a  result,  presumably  renders  a 
cell  better  able  to  present  antigen  and  thus  to 
potentiate  an  immune  response. 


In  the  past  year  we  have  taken  a  genetic  ap- 
proach to  attempt  to  understand  how  interferon- 
7  activates  the  synthesis  of  these  MHC  molecules. 
A  new  approach  has  been  used  to  isolate  a  series 
of  mutant  cell  lines  that  are  not  capable  of  re- 
sponding to  interferon-7.  These  cell  lines  appear 
to  have  a  series  of  different  defects,  which  we  are 
attempting  to  elucidate.  This  genetic  approach 
should  help  us  understand  how  this  important 
hormone  regulates  gene  expression  in  this  situa- 
tion and  elsewhere. 

In  studying  the  expression  of  class  II  genes,  we 
have  also  analyzed  class  II  protein  function  in  an 
animal  setting,  employing  the  strategies  of  molec- 
ular genetics  to  perturb  that  expression  and  then 
observing  the  efl'ect  on  immune  function.  A  key 
issue  in  the  functioning  of  the  immune  system  is 
how  the  body  discriminates  its  own  tissues  (self) 
from  foreign  components  such  as  pathogens.  An 
organism's  inability  to  destroy  self  tissues  is 
known  as  immune  tolerance.  Tolerance  is  gener- 
ally believed  to  occur  during  the  production  of 
new  T  cells  in  the  thymus  by  a  process  of  clonal 
deletion;  that  is,  self-reactive  T  cells  are  de- 
stroyed at  the  site  of  synthesis. 

We  have  been  interested  in  determining  the 
mechanisms  of  tolerance  to  those  components  of 
the  body  that  are  never  found  in  the  thymus. 
Transgenic  mice  can  be  used  to  study  this  pro- 
cess, since  the  expression  of  a  given  gene — and 
hence  the  protein  encoded  by  that  gene — can  be 
directed  to  the  tissue  of  choice  by  linking  the 
gene  for  the  desired  protein  to  the  regulatory  sig- 
nals that  function  in  that  specific  tissue.  We  have 
previously  performed  such  experiments  by  di- 
recting the  synthesis  of  MHC  class  II  proteins  to 
the  pancreatic  ^  cells  of  transgenic  mice.  In  these 
experiments  the  mice  were  indeed  found  to  be 
tolerant  to  the  MHC  antigens,  even  though  they 
appeared  to  be  expressed  specifically  at  these 
sites  very  distant  from  the  thymus.  Interestingly, 
it  appears  that  T  cells  that  would  normally  react 
with  this  MHC  product  are  not  eliminated,  which 
is  what  would  be  found  if  the  class  II  antigen  is 
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expressed  in  thymic  tissues.  Instead,  these  T  cells 
are  present  but  have  been  inactivated  in  some 
way,  such  that  they  are  no  longer  able  to  respond 
to  the  MHC  antigen,  either  in  the  animal  or  in 
test-tube  experiments.  As  a  result,  no  destruction 
of  the  pancreatic  tissue  occurs. 

To  determine  whether  the  above-mentioned 
mechanism  applies  to  the  more  general  situation 
of  immune  tolerance — namely  tolerance  to  pro- 
tein antigens  themselves — we  have  generated 
transgenic  mice  that  express  the  T  cell  receptor 
reactive  with  a  specific  antigen,  the  T  antigen  of 
SV40  (simian  virus  40).  We  have  obtained  from 
several  collaborators  transgenic  mice  that  ex- 
press SV40  T  antigen  in  various  peripheral  tissues 
in  the  body,  including  the  jS  cells  of  the  pancreas, 
the  secretory  tissues  of  the  pancreas  that  produce 
digestive  enzymes,  and  the  lens  of  the  eye.  In  the 
former  two  cases  these  experiments  are  analo- 
gous to  our  previous  studies,  except  that  now  we 
are  able  to  study  directly  tolerance  to  a  protein 
antigen  measured  by  its  effects  on  a  monoclonal 
population  of  T  cells  obtained  by  the  use  of  T  cell 
receptor  transgenic  mice.  In  the  case  of  the  lens 
of  the  eye,  antigen  is  expressed  in  a  compartment 
of  the  body  traditionally  believed  to  be  an  im- 
munoprivileged  site  (inaccessible  to  the  immune 
system).  Tolerance  in  this  compartment  has  not 
been  previously  studied.  We  have  now  bred  the  T 
cell  receptor  transgenic  mice  with  those  mice  ex- 
pressing antigens  in  these  respective  tissues  and 
are  studying  tolerance  mechanisms. 

We  have  also  initiated  research  to  study  the  re- 


sponse of  the  immune  system  to  the  spirochete 
Borrelia  burgdorferi,  which  causes  Lyme  dis- 
ease, the  notorious  inflammatory  disease.  We 
have  produced  recombinant  proteins  for  several 
of  the  outer-surface  components  of  this  spiro- 
chete and  have  immunized  mice  with  these  pro- 
teins. A  mouse  model  has  been  developed  by  Ste- 
phen Barthold  (Yale  University)  that  exhibits 
many  of  the  symptoms  of  the  human  disease,  in- 
cluding arthritis  and  inflammation  of  the  tissues 
surrounding  the  heart.  Mice  immunized  with 
these  proteins  were  challenged  by  infection  with 
the  Lyme  disease  agent.  Strikingly,  mice  immu- 
nized with  recombinant  outer-surface  protein  A 
(OspA)  were  fully  protected  from  infection  with 
the  spirochete.  Both  the  protection  and  the  immu- 
nity of  the  mouse  to  this  organism  appear  to  be 
long-lived.  These  encouraging  results  suggest 
that,  administered  appropriately,  recombinant 
OspA  could  serve  as  a  vaccine  for  protection  of 
humans  and  animals  against  the  Lyme  disease 
agent.  Vaccination  may  well  be  a  desired  ap- 
proach, since  this  disease  is  widespread,  causes 
serious  chronic  symptoms,  and  frequently  goes 
undetected  in  early  stages.  We  have  also  charac- 
terized the  response  of  the  human  immune  sys- 
tem to  outer-surface  proteins  of  this  organism. 
The  human  immune  response  to  OspA  is  qualita- 
tively similar  to  the  mouse  response,  suggesting 
that  it  should  be  possible  to  obtain  a  protective 
response  in  humans.  Humans,  however,  respond 
to  OspA  later  in  infection  than  do  mice,  suggest- 
ing that  further  work  may  be  necessary  before  an 
effective  human  vaccine  is  obtained. 
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ALTHOUGH  the  information  that  directs  the 
folding  of  a  protein  molecule  into  a  defined 
three-dimensional  structure  is  genetically  en- 
coded, the  mechanisms  and  pathways  of  the  fold- 
ing process  are  poorly  understood.  One  approach 
to  this  problem  is  an  analysis  of  partially  struc- 
tured folding  intermediates,  combined  with  a 
mutational  analysis.  We  use  nuclear  magnetic  res- 
onance (NMR)  spectroscopy  and  chemical  meth- 
ods to  probe  for  structural  and  kinetic  interme- 
diates in  the  folding  process. 

Many  polypeptide  sequences  adopt  a  common 
folded  motif,  but  they  frequently  differ  in  the  de- 
tailed arrangement  or  conformation  of  structural 
elements  in  ways  that  are  functionally  significant. 
Certain  loops  of  the  immunoglobulins  (antibod- 
ies) are  examples.  We  are  working  to  understand 
the  manner  in  which  the  amino  acid  sequence  of 
a  secondary  structural  element  dictates  its  de- 
tailed conformation  in  the  context  of  a  folded 
protein  molecule,  using  staphylococcal  nuclease 
as  a  model  protein  system.  We  plan  to  test  the 
generality  of  our  observations  and  conclusions  by 
examining  the  relationship  between  the  amino 
acid  sequence  of  immunoglobulin  loops  and 
their  structure  and  the  resulting  ligand  affinity. 

We  combine  a  number  of  methodologies  to  ad- 
dress these  aspects  of  protein  structure  and  fold- 
ing, including  x-ray  crystallography,  NMR  spec- 
troscopy, and  molecular  biology. 


are  used  as  a  test  system.  A  polar  chelator  has  been 
designed  and  synthesized  that  can  be  specifically 
attached  to  a  cysteine  residue  engineered  into  the 
protein  chain.  When  this  chelator  is  loaded  with 
iron  it  can  be  used  to  generate  hydroxyl  radicals, 
which  in  turn  cleave  peptide  bonds  at  positions 
in  the  protein  chains  in  proximity  to  the  chelator. 
The  cleavage  sites  can  be  determined  by  peptide 
mapping  and  protein  sequencing.  Experiments 
designed  to  characterize  the  system  and  map  dis- 
tances in  the  unfolded  chain  are  in  progress. 

Analysis  of  Protein  Folding  Using 
NMR  Spectroscopy 

In  collaboration  with  Christopher  Dobson's 
laboratory  in  Oxford,  we  have  developed  a  series 
of  NMR  experiments  to  characterize  the  equilib- 
rium-folding kinetics  of  staphylococcal  nuclease. 
These  methods  have  been  used  to  investigate  mul- 
tiple folded  and  unfolded  states  of  the  protein. 
Applied  to  nuclease  variants  that  differ  in  thermal 
stability,  the  methods  have  allowed  us  to  discrimi- 
nate between  mutants  that  influence  the  protein 
folding  pathway  and  those  that  modify  the  stabil- 
ity of  the  tertiary  structure.  By  combining  site- 
directed  mutagenesis  with  NMR  spectroscopy, 
we  have  been  able  to  quantitate  slow  intramolec- 
ular equilibria,  which  should  serve  as  a  basis  for 
the  experimental  quantitation  of  the  forces  that 
stabilize  protein  molecules. 


Mapping  Structure  in  the  Unfolded 
State  of  Proteins 

Protein  molecules  in  the  unfolded  and  molten 
globular  states  are  often  more  compact  than 
would  be  expected  for  a  true  random  coil  confor- 
mation. If  this  conformational  bias  is  toward  that 
of  the  folded  structure,  it  may  explain  the  rapid 
rate  at  which  proteins  fold.  We  have  developed  a 
chemical  approach  to  map  close  contacts  be- 
tween a  variable  reporter  residue  site  and  all 
other  residues  of  a  protein  chain  in  these  states. 
Staphylococcal  nuclease  variants  and  fragments 


Genetic  Analysis  of  a  |S-Tum 

A  sharp  change  in  the  trajectory  of  a  polypep- 
tide chain  between  secondary  structure  elements 
in  a  globular  protein  has  been  defined  as  a  reverse 
turn  or  /3-turn.  These  structures  occur  in  a  num- 
ber of  defined  geometric  types  and  frequently 
contribute  side  chains  to  the  active  site  of  the 
enzymes,  such  as  staphylococcal  nuclease  or  the 
combining  site  of  binding  proteins  such  as  the 
immunoglobulins.  We  wish  to  determine  the  se- 
quence requirements  for  the  formation  of  differ- 
ent (8-turn  types  to  understand  better  the  detailed 
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structure  of  globular  proteins  and  to  define  de- 
sign principles  for  protein  engineering. 

We  have  developed  a  genetic  system  to  deter- 
mine which  amino  acid  sequences  are  consistent 
with  a  particular  jS-turn  structure  in  staphylococ- 
cal nuclease.  Each  member  of  our  gene  library 
contains  a  unique  sequence  at  this  ^-turn.  Only  a 
small  fraction  of  the  sequences  examined  are 
consistent  with  an  enzymatically  active  and 
stable  protein  in  Escherichia  coli.  The  i8-turn 
under  consideration  is  well  removed  from  the  ac- 
tive site,  suggesting  that  the  modulation  in  the 
observed  enzyme  activity  is  due  to  changes  in  the 
stability  of  the  protein.  There  are  strong  biases  in 
the  amino  acids  occurring  at  each  position  in  the 
i8-tum.  Recently,  a  statistical  analysis  of  the  ge- 
netic data  has  led  to  a  predictive  model  for  this 
j8-turn  type  in  all  globular  proteins.  This  ap- 
proach may  be  of  general  use  in  defining  other 
sequence-secondary  structure  relationships. 

Crystal  structure  determinations  of  several 
mutants  derived  from  a  genetic  analysis  of  this 
i8-turn  site  indicate  that  substitutions  can  be  made 
without  greatly  influencing  the  tertiary  structure. 
In  contrast,  crystallographic  analyses  of  several 
point  mutants  in  another  (8-turn  near  the  active 
site  demonstrate  that  single-amino  acid  substitu- 
tions can  result  in  a  change  from  one  ;8-turn  type 
to  another,  causing  a  significant  rearrangement  in 
the  local  protein  structure.  We  are  currently 
working  to  understand  the  physical  bases  for  the 
selections  observed  in  the  genetic  experiment 
and  the  conformational  alteration  they  impart  to 
the  protein.  Results  of  this  and  related  experi- 
ments should  provide  insight  into  the  relation- 
ship between  amino  acid  sequence  and  structure 


required  for  the  rational  design  and  engineering 
of  protein  molecules. 

Structural  Basis  of  Immunoglobulin 
Maturation 

The  sequence  diversity  found  in  immunoglobu- 
lin molecules  is  generated  at  several  different  lev- 
els. As  B  cells  develop,  combinatorial  variability 
arises  from  the  rearrangement  of  germline  V,  D, 
and  J  gene  segments.  When  a  B  cell  recognizes  an 
antigen,  somatic  mutation  of  the  variable  region 
of  the  immunoglobulin  genes  is  stimulated,  add- 
ing further  diversity  to  the  immune  response.  Al- 
though these  mechanisms  have  been  well  charac- 
terized, the  structural  basis  by  which  sequence 
differences  modify  antibody  affinities  remains  rel- 
atively unexplored. 

Harden  McConnell  and  his  laboratory  at  Stan- 
ford University  have  prepared  a  panel  of  1 2  mono- 
clonal antibodies  to  a  particular  hapten  (a  small 
molecule  that  interacts  with  antibody  but  does 
not  itself  elicit  an  immune  response)  and  have 
sequenced  the  heavy-  and  light-chain  cDNAs.  The 
antibody  panel  provides  an  opportunity  to  inves- 
tigate the  structural  role  of  such  rearrangements 
in  determining  antibody-hapten  affinity. 

We  have  crystallized  a  fragment  of  one  of  these 
monoclonal  antibodies,  with  and  without  bound 
hapten,  and  have  solved  the  structure  of  the  com- 
plex, in  collaboration  with  Axel  Briinger  (HHMI, 
Yale  University).  The  crystal  structure  provides 
an  opportunity  to  assess  the  effect  of  antigen  bind- 
ing on  antibody  structure  and  will  serve  as  a  basis 
for  understanding  the  influence  on  hapten  affin- 
ity of  sequence  variants  resulting  from  gene  rear- 
rangement or  somatic  mutation. 
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Fluorescent  in  situ  hybridization  to  answer  a  clinical  question.  This 
shows  chromosomes  of  a  black  man  with  tyrosinase-negative  albinism. 
Since  only  one  mutation  in  the  tyrosinase  gene  was  identified  and  the 
man  was  also  mentally  retarded,  the  question  arose  whether  the  other 
gene  was  lost  through  a  submicroscopic  deletion.  In  situ  hybridization 
of  a  biotin- labeled  cosmid  clone  containing  the  human  tyrosinase  gene 
revealed  a  signal  (large yellow  spots)  on  both  chromosomes  11,  where 
the  tyrosinase  genes  reside,  thus  ruling  out  a  deletion  and  forcing  the 
conclusion  that  he  is  homozygous  for  the  rare  mutation. 

Cells  and  cosmid  provided  by  Richard  Spritz.  Experiment  done  by 
Chih-Lin  Hsieh,  HHMl  Associate  at  Stanford.  Imaging  on  a  Phoibos 
confocal  laser  scanning  microscope. 

From  Spritz,  R.A.,  Strunk,  K.M.,  Hsieh,  C.  L.,  Sekhon,  G.S.,  and 
Francke,  U.  1991.  Am  J  Hum  Genet  48:318-324.  ®  1991  by  the  Ameri- 
can Society  of  Human  Genetics. 
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THE  physical  and  genetic  maps  of  the  human 
and  mouse  have  undergone  spectacular 
growth  and  development  in  recent  years.  Almost 
500  homologous  loci  have  been  mapped  in  both 
species,  and  nearly  60  chromosome  regions  have 
been  delineated  that  contain  conserved  syntenic 
groups  of  genes,  through  the  efforts  of  many  labo- 
ratories, including  ours.  Thus  it  has  become  possi- 
ble, after  mapping  a  gene  in  one  species,  to  pre- 
dict the  location  of  the  homologue  in  the  other 
species.  Comparative  mapping  information  is 
also  used  to  evaluate  the  possibility  of  a  mouse 
mutation  being  a  true  model  of  a  human  genetic 
disorder. 

Our  laboratory  is  employing  chromosomal  in 
situ  hybridization  and  somatic  cell  genetic  ap- 
proaches to  locate  cloned  genes  of  know^n  func- 
tion on  human  and  mouse  chromosomes.  We  are 
using  this  information  to  define  candidate  genes 
for  human  inherited  disorders  or  for  phenotypic 
mutations  in  mice  and  to  delineate  regions  that 
contain  homologous  genes  in  both  species.  Our 
goal  is  to  identify  genes  involved  in  producing 
phenotypic  abnormalities  in  chromosomal  imbal- 
ance syndromes  and  in  inherited  disorders,  to  un- 
derstand their  function,  and — based  on  this 
knowledge — to  devise  precise  diagnostic  tests 
and  rational  treatment  strategies. 

Rett  Syndrome 

Rett  syndrome  is  a  neurodegenerative  disorder 
that  affects  1  in  15,000  girls.  The  affected  girl  is 
usually  the  only  affected  person  in  her  family. 
Although  she  appears  normal  at  birth  and  in  her 
early  infancy,  her  development  slows  and  re- 
gresses in  early  childhood.  The  disorder  is  char- 
acterized by  stereotypic  hand  movements,  loss  of 
speech,  and  profound  mental  retardation  with 
seizures  and  microcephaly. 

Although  the  progression  of  this  disorder  has 
been  clinically  delineated,  the  diagnosis  remains 
tentative  in  the  first  few  years.  Pathophysiologi- 
cal and  biochemical  investigations  have  been 
unrevealing.  The  clinical  picture  is  most  consis- 
tent with  progressive  neuronal  loss  during  early 
childhood,  reaching  a  stable  state  that  continues 


into  adulthood.  A  genetic  basis  for  this  disorder 
has  been  established  by  concordance  in  identical 
twins  and  discordance  in  fraternal  twins.  A  new 
mutation  in  a  neuron-specific  cis-acting  gene  on 
the  X  chromosome,  which  would  be  lethal  in  a 
male  but  allows  females  to  survive  due  to  their  X 
chromosome  inactivation  mosaicism,  appears  to 
be  the  most  plausible  (although  not  the  only  pos- 
sible) explanation  of  the  underlying  defect. 

We  are  defining  possible  locations  of  the  re- 
sponsible gene  by  identifying  the  regions  of  the 
maternal  X  chromosome  shared  by  affected  half- 
sisters  who  have  a  common  mother.  To  find  can- 
didate genes,  we  are  carrying  out  chromosomal 
mapping  studies  of  neuron-specific  genes  as  they 
are  cloned  and  become  available.  Genes  that  map 
to  the  X  chromosome  in  a  region  not  yet  ex- 
cluded from  containing  the  Rett  syndrome  locus 
(such  as  genes  encoding  synapsin  I,  synaptophy- 
sin,  and  serotonin  receptor  Ic)  are  being  tested  in 
25  unrelated  Rett  families  for  structural  rear- 
rangements and  sequence  changes.  Establishing 
the  molecular  defect  in  Rett  syndrome  will  pro- 
vide not  only  a  much  needed  diagnostic  test  but 
also  information  about  the  role  of  the  specific 
gene  in  the  development  and  function  of  the  hu- 
man nervous  system. 

Inherited  Dwarfism 

Individuals  with  an  autosomal  recessive  form 
of  growth  failure,  despite  having  high  levels  of 
circulating  growth  hormone,  are  diagnosed  as  suf- 
fering from  Laron  dwarfism  (LD)  when  they  lack 
the  activity  of  a  specific  growth  hormone-bind- 
ing protein  in  serum.  They  are  also  characterized 
by  low  levels  of  insulin-like  growth  factors  that 
are  normally  produced  by  liver  cells  in  response 
to  growth  hormone.  Therefore  the  underlying  de- 
fect in  LD  is  thought  to  involve  the  grov^h  hor- 
mone receptor  (GHR) .  This  receptor  is  encoded 
by  a  single  gene,  previously  mapped  by  us  to  the 
proximal  short  arm  of  human  chromosome  5  and 
to  mouse  chromosome  15.  The  sequence  of  the 
growT:h  hormone-binding  protein  is  identical  to 
that  of  the  extracellular  domain  of  the  GHR.  Rear- 
rangements of  the  GHR  gene  have  been  observed 
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only  rarely  in  LD  patients,  which  indicates  molec- 
ular genetic  heterogeneity  of  this  disorder,  and 
point  mutations  are  yet  to  be  identified. 

Our  laboratory  is  engaged  in  a  collaborative 
study  of  a  population  in  the  mountains  of  south- 
em  Ecuador  that  is  derived  from  early  Spanish 
settlers  and  suffers  from  a  high  incidence  of  LD. 
E>etailed  clinical  and  biochemical  studies  of  af- 
fected individuals  by  Arlan  L.  Rosenbloom  (Uni- 
versity of  Florida,  Gainesville)  and  his  colleagues 
have  revealed  the  classical  features  of  LD,  in  addi- 
tion to  other  manifestations  not  previously  re- 
ported. We  have  established  lymphoblastoid  cell 
lines  from  several  affected  individuals  and  family 
members  from  different  geographic  locations. 
Because  no  gross  rearrangements  of  the  GHR 
gene  have  been  identified,  we  have  designed 
primers  to  use  in  the  PGR  (polymerase  chain  reac- 
tion) amplification  of  each  of  the  1 0  exons  of  this 
gene  based  on  the  published  sequence.  Single 
nucleotide  changes  in  the  amplified  fragments 
are  detected  by  denaturing  gradient  gel  electro- 
phoresis and  confirmed  by  sequencing.  A  poly- 
morphism in  a  GHR  exon  that  cosegregates  with 
the  disease  allele  in  this  population  has  been  de- 
tected and  is  currently  used  to  develop  a  PGR- 
based  allele-specific  oligonucleotide  hybridiza- 
tion test  for  the  eventual  detection  of  gene 
carriers.  Our  search  for  the  molecular  defect  that 
renders  the  GHR  gene  nonfunctional  is  continu- 
ing. We  are  especially  interested  in  learning 
whether  a  single  mutation  is  responsible  or 
whether  more  than  one  mutation  contributes  to 
the  high  incidence  of  LD  in  this  population. 

Another  form  of  inherited  dwarfism,  autosomal 
recessive  hypopituitarism,  is  due  to  a  combined 
defect  in  three  hormones  produced  by  distinct 
cell  types  in  the  anterior  pituitary  gland:  growth 


hormone,  thyroid-stimulating  hormone,  and  fol- 
licle-stimulating hormone.  Besides  failing  to 
grow,  these  individuals  fail  to  mature  sexually 
and  may  suffer  from  hypothyroidism.  We  have 
begun  a  collaborative  study  of  an  inbred  Hispanic 
family  with  several  affected  members.  Our  ap- 
proach to  unraveling  the  molecular  defect  is 
based  on  the  human-mouse  comparative  map  and 
on  candidate  genes.  Two  mutations  in  the  mouse 
are  phenotypically  similar  to  this  disorder:  Ames 
dwarf  {df)  on  mouse  chromosome  1 1  and  Snell 
dwarf  {dw)  on  chromosome  16.  By  studying  the 
inheritance  of  genetic  markers  on  the  human 
chromosome  regions  that  are  homologous  to  the 
relevant  regions  of  mouse  chromosomes  1 1  and 
16,  we  hope  to  establish  whether  df  or  dw  is  a 
model  for  this  disorder. 

The  candidate  gene  approach  suggested  that 
Pit- 1 ,  a  pituitary  transcription  factor,  may  be  a 
likely  candidate.  Indeed,  recent  studies  from  Mi- 
chael Rosenfeld's  laboratory  (HHMI,  University 
of  Galifornia,  San  Diego)  have  documented  a  de- 
fect in  the  Pit-1  gene  in  two  different  mutant  al- 
leles at  the  dw  locus.  We  have  ruled  out  a  struc- 
tural rearrangement  of  the  Pit-1  gene  in  our 
hypopituitarism  family  and  have  mapped  the  Pit- 
1  locus  to  a  region  of  human  chromosome  3.  If 
the  human  disease  gene  cosegregates  with 
markers  in  this  region,  we  will  analyze  the  Pit-1 
gene  for  mutations.  Alternatively,  if  the  mutation 
is  homologous  to  df,  it  should  be  linked  to 
markers  on  either  human  chromosome  1 7  or  5q, 
and  the  responsible  gene  will  be  identified  after 
it  has  been  localized.  This  project  exemplifies 
the  goals  of  this  laboratory — the  generation  of 
gene  mapping  information  that  is  then  used  to 
identify  and  test  candidate  genes  for  human  in- 
herited disorders  or  mouse  mutation  models. 
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EXTENSIVE  studies  of  humans  and  other  organ- 
isms have  suggested  that  body  weight,  body 
composition  (percent  body  fat) ,  and  food  intake 
are  under  strict  physiologic  control.  The  set 
point  hypothesis  predicts  that  both  energy  intake 
and  expenditure  are  physiologically  regulated  in 
a  particular  individual  to  maintain  a  predeter- 
mined body  weight.  Implicit  in  this  hypothesis  is 
the  notion  that  signal  molecules  that  reflect  the 
nutritional  state  of  an  individual  are  synthesized 
in  the  body  (the  periphery) .  The  levels  of  these 
signals  are  sensed  by  control  centers  in  the  brain 
so  that  an  appropriate  response  is  generated  to 
maintain  a  stable  body  weight.  Studies  in  which 
separate  regions  of  the  rodent  brain  have  been 
lesioned  suggest  that  this  feeding  control  center 
resides  at  least  in  part  in  the  hypothalamus.  The 
site  of  synthesis  of  the  molecules  that  signal  nu- 
tritional state  is  unknown,  although  fat  cells,  in- 
testinal cells,  or  other  cells  of  the  gastrointestinal 
tract  have  been  suggested.  A  knowledge  of  the 
site  of  synthesis  and  the  molecular  nature  of  the 
signals  that  aff'ect  brain  centers  controlling 
appetite  could  have  important  implications  for 
our  understanding  of  nutritional  disorders  in 
humans. 

To  understand  more  about  these  signaling 
mechanisms,  we  have  been  utilizing  genetic  and 
molecular  biologic  approaches  to  study  the  role 
of  specific  gene  products  in  the  control  of  feed- 
ing behavior  and  body  composition.  (The  studies 
that  aim  to  clone  the  mutant  obesity  genes  ob  and 
db  from  mice  have  been  supported  by  the  Na- 
tional Institute  of  Diabetes,  Digestive,  and  Kid- 
ney Diseases.) 

Molecular  Basis  of  Obesity  in  ob/ob 
and  db/db  Mice 

If  one  wishes  to  understand  the  basis  for  differ- 
ences in  the  complicated  system  of  energy  ho- 
meostasis, there  are  a  number  of  experimental 
advantages  to  studying  mutant  mice,  including 
the  ability  to  control  the  animals'  environment 
and  to  set  up  genetic  crosses.  In  collaboration 
with  Rudolph  Leibel,  we  have  begun  a  study  of 
mice  carrying  recessive  mutations  that  result  in 


profound  obesity.  At  least  four  different  muta- 
tions in  mice  that  cause  obesity  are  available  for 
study:  obese  {ob),  diabetes  {db),  fatty  (fat),  and 
tubby  {tub).  In  each  case,  a  mouse  becomes 
obese  because  of  a  defect  in  a  single  gene. 

We  have  focused  on  the  ob  and  db  mutations 
for  several  reasons.  Mutant  ob  and  db  mice  be- 
come profoundly  obese,  often  weighing  three 
times  as  much  as  normal  mice.  As  in  humans,  the 
obese  phenotype  in  mutant  animals  appears  to 
result  from  both  increased  food  intake  and  dimin- 
ished energy  expenditure.  Furthermore,  Douglas 
Coleman  at  the  Jackson  Laboratory  has  suggested 
that  ob  mice  are  missing  a  circulating  factor  that 
suppresses  appetite  and  that  db  mice,  which  are 
unable  to  respond  to  the  ob  factor,  may  be  miss- 
ing the  receptor  for  this  factor. 

Current  techniques  in  molecular  genetics, 
such  as  Southern  blots  and  chromosome  walking, 
make  it  possible  to  clone  genes,  such  as  ob  and 
db,  whose  function  is  known  on  the  basis  of 
a  mutant  phenotype  but  whose  gene  product  is 
unknown.  This  approach  makes  extensive  use 
of  restriction  fragment  length  polymorphisms 
(RFLPs),  which  are  genetic  markers  defined  by 
specific  cloned  pieces  of  DNA.  The  first  step  in 
this  approach  makes  use  of  genetic  crosses  be- 
tween obese  (or  diabetic)  mice  and  normal  mice 
in  which  the  pattern  of  inheritance  of  the  obese 
phenotype  is  compared  with  that  of  RFLPs.  Spe- 
cific RFLPs  that  are  inherited  along  with  the 
obese  phenotype  are  said  to  be  linked  genetically 
to  the  obesity  gene. 

By  performing  this  analysis  on  several  hundred 
mice  with  several  dozen  different  RFLPs  we  have 
been  able  to  identify  a  series  of  different  RFLPs 
that  are  tightly  linked  to  the  ob  and  db  genes. 
RFLPs  that  are  genetically  linked  to  these  muta- 
tions are  in  physical  proximity  to  ob  and  db  and 
can  be  used  as  starting  points  to  characterize  the 
adjacent  DNA  and  clone  the  mutant  genes.  In  the 
case  of  ob  mice,  we  have  used  three  different 
RFLPs  for  several  genes,  including  the  met  onco- 
gene, a  pancreatic  enzyme  carboxypeptide  A,  the 
irp  gene,  and  the  cystic  fibrosis  gene,  to  generate 
a  detailed  genetic  map  around  the  ob  locus.  Simi- 
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larly  the  mouse  db  gene  has  been  mapped  rela- 
tive to  RFLPs  for  interferon-a  and  a  complement 
gene. 

To  identify  other  RFLPs  that  are  more  tightly 
linked  than  these  probes,  we  have  used  the  chro- 
mosomal microdissection  technique:  small  slices 
of  individual  chromosomes  are  dissected  and 
tKen  cloned.  Separate  libraries  have  been  made 
from  proximal  chromosome  6,  where  ob  maps, 
and  from  mid  chromosome  4,  where  db  maps. 
Two  probes  from  the  chromosome  4  library  that 
flank  db  are  ~  1  cM  apart,  and  two  probes  that 
flank  ob  are  ~1.5  cM  apart.  The  proximity  of 
these  markers  and  their  density  will  enable  us  to 
use  evolving  techniques  in  an  attempt  to  clone 
the  ob  and  db  genes.  Techniques  such  as  pulsed- 
field  gel  electrophoresis  and  the  cloning  of  large 
fragments  in  yeast  artificial  chromosomes  are 
currently  being  used.  The  cloning  of  these  genes 
should  further  our  understanding  of  the  mecha- 
nisms that  control  food  intake  and  body  weight. 

Cholecystokinin  Regulation,  Function,  and 
Expression  in  Human  Tumors 

The  hormone  cholecystokinin  (CCK)  was  origi- 
nally found  in  the  small  intestine  by  virtue  of  its 
ability  to  stimulate  gallbladder  contraction  and 
pancreatic  secretion  in  response  to  feeding.  High 
levels  of  CCK  have  also  been  found  in  neurons  of 
the  mammalian  brain,  where  it  functions  as  a  neu- 
rotransmitter. The  first  demonstration  that  CCK 
could  affect  behavior  was  reported  by  Gerry 
Smith  and  Dick  Gibbs,  who  showed  that  peripher- 
ally administered  CCK  had  an  appetite-suppress- 
ing effect  on  rats.  It  has  also  been  demonstrated 
that  CCK  antagonists  increase  feeding  behavior  in 
rodents.  These  observations  suggest  that  the  regu- 
lation and  function  of  this  gene  are  important  in 
the  control  of  appetite. 

We  have  been  conducting  experiments  to  elu- 
cidate the  molecular  mechanisms  controlling  the 
expression  of  the  CCK  gene  in  both  brain  and 
intestine.  In  addition,  we  are  using  a  variety  of 
techniques  to  explore  the  function  (s)  of  this  hor- 
mone, with  particular  reference  to  the  role  of  the 
CCK  gene  in  the  control  of  feeding  behavior. 

To  define  DNA  elements  involved  in  the  regula- 
tion of  the  CCK  gene,  it  was  necessary  to  identify 
cultured  cell  lines  that  express  this  gene.  Tumor 
cell  lines  derived  from  peripheral  neuroepithe- 
lioma (a  rare  pediatric  nerve  tumor  that  usually 


develops  in  the  chest  wall)  were  found  to  synthe- 
size CCK.  Cell  lines  derived  from  another  pediat- 
ric tumor,  Ewing's  sarcoma  of  bone,  also  express 
CCK  mRNA,  and  subsets  of  other  pediatric  tu- 
mors, including  rhabdomyosarcoma  (a  malignant 
muscle  tumor) ,  also  appear  to  make  CCK. 

Characterization  of  these  tumor  cell  lines  also 
suggested  that  there  is  a  class  of  pediatric  tumors 
that  express  this  hormone  and  that  synthesis  of 
this  peptide  may  be  of  diagnostic  and  prognostic 
value  in  pediatric  solid  tumors.  Of  note,  how- 
ever, is  that  biologically  active  CCK  is  generated 
only  after  enzymatic  cleavage  of  a  precursor  mole- 
cule (CCK  prohormone)  to  an  active  form.  The 
tumors  that  we  have  analyzed  appear  to  synthe- 
size the  precursor  but  lack  the  ability  to  cleave  it. 
The  failure  to  note  an  association  between  these 
tumors  and  the  expression  of  the  CCK  gene  previ- 
ously was  likely  a  consequence  of  the  failure  of 
these  tumors  to  synthesize  processed  CCK.  Since 
the  tumors  do  synthesize  CCK  mRNA  and  also  se- 
crete the  CCK  precursor,  we  have,  together  with 
Bruce  Schneider,  developed  a  novel  radioimmu- 
noassay that  specifically  detects  the  CCK  precur- 
sor. In  preliminary  experiments  it  appears  that 
the  blood  levels  of  the  CCK  precursor  are  ele- 
vated in  patients  with  these  tumors. 

The  identification  of  human  tumor  cell  lines 
that  express  the  CCK  gene  will  make  it  possible 
to  identify  the  DNA  sequences  required  for  cell- 
specific  expression  of  this  gene.  These  observa- 
tions also  suggest  that  CCK  overproduction  could 
in  some  cases  have  pathophysiologic  effects  in 
humans. 

To  describe  the  possible  effects  of  ectopic  CCK 
production  on  the  control  of  feeding  behavior, 
we  have,  in  collaboration  with  Richard  Palmiter 
(HHMI,  University  of  Washington)  and  Ralph 
Brinster  (University  of  Pennsylvania),  artificially 
expressed  high  levels  of  this  hormone  in  rodents 
by  fusing  the  metallothionein  (MT)  promoter  to 
the  CCK-coding  sequence  and  introducing  the 
MT-CCK  transgene  into  mice.  Transgenic  mice 
that  express  high  levels  of  the  CCK  precursor  in 
liver,  plasma,  and  elsewhere  are  now  available 
and  have  been  characterized.  Most  of  the  tissues 
that  expressed  the  CCK  gene  were  similar  to  the 
human  tumor  cell  lines,  in  that  none  of  the  ex- 
pressing tissues  processed  the  CCK  precursor. 
Experiments  to  target  CCK  expression  to  tissues 
(such  as  stomach  and  pituitary)  that  can  process 
the  precursor  to  active  forms  are  under  way. 
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Cultured  epidermal  cells  from  a  transgenic  mouse  expressing  a  mutant  keratin  gene.  The  cells 
were  stained  with  an  antibody  specific  to  the  mutant  keratin  protein.  Note  the  presence  of  small 
balls,  or  aggregates,  at  the  ends  of  the  filaments.  These  structures  are  aberrancies  in  the  filament 
network  caused  by  the  presence  of  the  mutant  keratin. 
Research  of  Elaine  V.  Fuchs. 
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THE  long-range  objective  of  my  research  is  to 
understand  the  biochemical  mechanisms  that 
operate  and  regulate  the  expression  of  human 
genes  during  development  and  differentiation  in 
skin.  Our  present  knowledge  of  the  biochemistry 
of  human  skin  and  its  diseases  is  limited.  Al- 
though dermatologists  have  always  directed  their 
interest  toward  human  skin  biology  and  skin  dis- 
eases, the  field  of  molecular  biology  has  only  re- 
cently approached  a  level  of  understanding  that 
permits  probing  the  complex  biochemistry  of 
human  skin.  A  major  factor  facilitating  such  stud- 
ies is  the  ability  to  grow  human  skin  cells — in- 
cluding epidermal  cells,  dermal  fibroblasts,  mela- 
nocytes, and  dermal  papillae  cells — in  tissue 
culture.  These  culture  systems  provide  essential 
experimental  models  for  studying  many  genetic 
skin  diseases  and  skin  cancers. 

Much  of  our  research  on  human  skin  has  fo- 
cused on  the  epidermis.  The  epidermis  com- 
prises about  20  cell  layers,  of  which  the  outer- 
most is  the  skin  surface.  Only  the  inner  or  basal 
layer  of  the  epidermis  is  truly  living  and  under- 
goes DNA  synthesis  and  cell  division.  Under  an  as 
yet  unidentified  trigger,  a  basal  cell  ceases  to  di- 
vide and  makes  a  commitment  to  differentiate  ter- 
minally. As  the  cell  moves  outward  to  the  skin 
surface,  it  undergoes  a  variety  of  morphological 
and  biochemical  changes.  The  most  pronounced 
of  these  changes  is  the  production  of  a  dense  net- 
work of  keratin  filaments,  which  are  tough,  resil- 
ient protein  fibers.  Many  skin  diseases  of  the  epi- 
dermis, including  psoriasis  and  basal  and 
squamous  cell  carcinomas,  involve  a  malfunc- 
tioning of  the  diflferentiative  process  that  is  fre- 
quently associated  with  some  abnormality  in  the 
production  or  organization  of  these  keratin  fila- 
ments. Our  investigation  is  focused  on  the  regula- 
tion of  the  expression  of  keratin  proteins  and 
their  genes  in  human  epidermis  and  in  epidermal 
cells  differentiating  in  tissue  culture. 

The  keratins  are  a  group  of  10-20  closely  re- 
lated proteins  (40-70  kDa)  that  form  the  10-nm 
keratin  filaments  in  the  cytoplasm  of  epidermal 
cells.  Only  a  subset  (typically  2-6)  of  the  kera- 
tins are  ever  expressed  at  any  one  time.  As  a  nor- 


mal epidermal  cell  differentiates,  it  changes  the 
subset  of  keratins  that  it  makes.  In  addition,  the 
cell  increases  its  keratin  synthesis,  leaving  the 
fully  differentiated  epidermal  cell  with  85  per- 
cent of  its  total  protein  as  keratins.  In  diseases  of 
the  skin  involving  epidermal  hyperproliferation, 
including  psoriasis  and  squamous  cell  carci- 
nomas, a  new  subset  of  keratins  not  normally 
made  in  the  epidermis  is  produced.  This  new  set 
of  keratins  is  diagnostic  for  hyperproliferative  ab- 
normalities of  the  skin. 

A  coordinated  genetic  and  biochemical  ap- 
proach is  necessary  to  determine  the  regulation 
of  the  multiple  keratins  and  to  decipher  their 
structural  and  functional  roles  in  the  differentiat- 
ing epidermal  cell.  A  number  of  years  ago,  we 
showed  that  expression  of  different  subsets  of 
epidermal  keratins  is  due  to  changes  in  the  syn- 
thesis of  different  keratin  mRNAs.  We  used  DNA 
recombinant  technology  to  show  that  the  epider- 
mal keratin  mRNAs  are  encoded  by  about  20  dif- 
ferent genes  and  that  these  genes  are  of  two  dis- 
tinct types.  Type  I  encodes  small  keratins  (40-53 
kDa);  type  II  encodes  larger  keratins  (53- 
67  kDa). 

Keratins  are  expressed  as  specific  pairs  of  type 
I  and  II  proteins.  In  the  past  year,  we  showed  that 
the  basic  subunit  of  keratin  filaments  is  a  hetero- 
dimer,  composed  of  one  molecule  of  each  of  the 
two  keratin  types.  Approximately  20,000  hetero- 
dimers  are  necessary  to  form  a  single  1 0-nm  fila- 
ment; the  assembly  process  is  energy-indepen- 
dent and  does  not  appear  to  require  any  auxiliary 
proteins  or  factors. 

Using  DNA  sequencing,  we  have  determined 
the  amino  acid  sequences  for  several  different 
keratin  pairs.  The  sequences  that  are  likely  to  be 
involved  in  the  assembly  process  are  highly  con- 
served among  keratin  pairs,  whereas  the  se- 
quences that  protrude  along  the  surface  of  the 
keratin  filament  are  different  for  different  pairs  of 
keratins.  Hence  the  structures  of  keratin  fila- 
ments assembled  from  different  protein  pairs  are 
similar,  but  they  are  coated  with  different  se- 
quences and  are  likely  to  interact  with  different 
proteins  inside  the  cell.  In  this  way  the  cytoskele- 
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tal  architecture  of  keratin  filaments  may  be  specif- 
ically tailored  to  suit  the  particular  structural 
needs  of  each  epidermal  cell  at  various  stages  of 
differentiation  and  development. 

To  determine  the  molecular  details  of  the  fila- 
ment assembly  process  and  to  investigate  the  na- 
ture of  interactions  of  keratin  filaments  with 
other  proteins  and  organelles,  we  have  used  dele- 
tion and  site-directed  mutagenesis  to  engineer  al- 
terations in  the  coding  sequences  of  K5  and  Kl4, 
the  keratin  pair  expressed  in  the  living  cells  of 
the  epidermis.  To  generate  substantial  quantities 
of  keratins  for  filament  assembly  studies,  we  used 
genetic  engineering  to  overexpress  wild-type  and 
mutant  human  keratins  in  bacteria,  which  do  not 
have  keratin  filament  networks.  We  purified 
these  keratins  and  examined  the  consequences  of 
mutations  and  deletions  on  keratin  filament  as- 
sembly in  vitro.  We  have  begun  to  identify  those 
sequences  involved  in  filament  elongation  and 
those  that  are  more  important  for  lateral  associa- 
tions. In  addition,  we  are  using  gene  transfection 
of  human  tissue  culture  cells  to  examine  1)  the 
dynamics  of  keratin  filament  assembly  in  vivo 
and  2)  how  the  abnormal  expression  of  keratins 
in  malignant  and  hyperproliferating  epidermal 
cells  might  change  their  biology  and  cytoskeletal 
architecture.  The  results  of  these  studies  have  be- 
gun to  yield  valuable  insights  into  the  complex 
assembly  process  of  keratin  filaments  and  into  the 
function  that  these  filaments  perform  in  provid- 
ing a  relatively  indestructible  skin  surface. 

In  addition  to  examining  the  structure  of  kera- 
tin filaments,  we  have  also  been  interested  in  de- 
termining whether  there  might  be  natural  muta- 
tions in  human  epidermal  keratin  genes  that 
could  lead  to  a  genetic  skin  disease.  We  had  previ- 
ously shown  that  elevated  epidermal  keratin  gene 
expression  is  a  relatively  late  event  in  develop- 
ment and  that  certain  keratin  mutants  have  a  dom- 
inant effect  in  cultured  cells  (i.e.,  they  integrate 
into  and  disrupt  the  endogenous  keratin 
filament  network).  In  the  past  year,  we  made 
transgenic  mice  and  used  epidermal  keratin  pro- 
moters to  target  expression  of  some  of  these  domi- 


nant mutant  keratins  to  the  mouse  epidermis.  Un- 
expectedly, the  transgenic  mice  exhibited  a 
phenotype  that  strongly  resembles  that  of  pa- 
tients with  epidermolysis  bullosa  simplex  (EBS), 
a  dominant  and  sometimes  devastating,  blistering 
human  skin  disease  of  unknown  etiology.  We  are 
currently  examining  patients  with  EBS  to  deter- 
mine whether  they  have  mutations  in  their  epi- 
dermal keratin  genes. 

We  are  also  analyzing  the  complexity  of  the 
multiple  keratin  genes  in  the  human  genome  and 
the  mechanisms  underlying  their  differential  reg- 
ulation. We  have  already  shown  that  keratin  ex- 
pression is  controlled  at  the  transcriptional  level 
and  that  it  is  regulated  by  vitamin  A.  The  two 
genes  expressed  in  the  basal  layer  of  the  epider- 
mis have  been  isolated  and  characterized,  and  we 
are  focusing  on  how  these  genes  are  regulated 
during  development  and  how  their  expression  is 
influenced  by  retinoids.  In  the  past  year,  we  iden- 
tified proximal  and  distal  domains  that  synergisti- 
cally  act  to  regulate  expression  of  one  of  these 
basal  genes.  Our  goal  is  to  identify  the  sequences 
and  transcription  factors  involved  in  the  regula- 
tion of  these  genes.  This  analysis  of  the  factors 
controlling  the  cell's  major  structural  genes 
should  lead  us  to  the  factors  involved  in  deter- 
mining keratinocyte  fate.  Unraveling  the  nature 
of  promoter  and  enhancer  sequences  involved  in 
regulating  epidermal  genes  will  not  only  be  im- 
portant for  understanding  epidermal  develop- 
ment but  may  also  be  useful  for  targeting  prod- 
ucts to  the  epidermis,  e.g.,  in  drug  therapy. 
Because  epidermal  cells  can  be  removed  from  a 
patient,  cultured  in  vitro,  and  grafted  back  onto 
the  patient,  such  techniques  are  potentially  use- 
ful for  medical  research. 

Knowledge  of  the  normal  regulation  processes 
of  epidermal  and  hair  differentiation  will  be  es- 
sential to  identify  the  points  at  which  these  pro- 
cesses go  awry  in  different  genetic  skin  diseases 
and  skin  cancers.  Elucidating  the  molecular  na- 
ture of  the  normal  and  abnormal  programs  of  dif- 
ferentiation in  the  skin  should  lead  to  new  and 
badly  needed  treatments  for  dermatological 
diseases. 
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THE  focus  of  the  research  in  this  laboratory 
centers  on  the  mechanisms  by  which  cells 
communicate  with  each  other — specifically  the 
mechanisms  by  which  sea  urchin  or  mammalian 
spermatozoa  detect  signals  from  the  egg.  These 
studies  have  turned  out  to  be  applicable  to  so- 
matic cells  and  have  led  to  the  identification  of  a 
new  cell  surface  receptor  family  in  humans  and 
other  mammals.  Members  of  this  family  serve  as 
receptors  for  molecules  that  regulate  blood  pres- 
sure, as  well  as  a  large  number  of  other  physiolog- 
ical processes. 

Around  1981  our  laboratory  reported  the  puri- 
fication of  a  peptide  that  can  stimulate  sperm  mo- 
tility. This  peptide  was  derived  from  media  in 
which  sea  urchin  eggs  had  been  allowed  to  stand. 
Subsequently  it  was  demonstrated  that  different 
species  of  sea  urchins  contain  different  peptides 
and  that  the  molecules  from  one  species  do  not 
necessarily  stimulate  the  sperm  cells  from  an- 
other. In  later  research  it  became  clear  that  sperm 
cells  detect  higher  concentrations  of  peptide  and 
swim  toward  them.  Since  the  highest  concentra- 
tions are  around  the  peptide-releasing  egg,  the 
sperm  cell  swims  directly  toward  the  egg  under 
normal  conditions. 

This  laboratory  set  out  to  determine  how  the 
cell  detects  the  egg  peptide.  It  was  found  that  a 
particular  protein  on  the  sperm  plasma  mem- 
brane specifically  bound  the  egg  peptide.  This 
protein  appeared  to  serve  as  the  detector  or  re- 
ceptor molecule,  but  how  it  signaled  to  the 
sperm  cell  that  a  specific  egg  peptide  had  been 
bound  was  not  clear. 

To  help  resolve  the  question,  we  purified  the 
receptor  protein  on  the  plasma  membrane.  It  was 
identified  as  the  enzyme  guanylyl  cyclase,  which 
catalyzes  the  formation  of  cyclic  GMP.  The  small 
molecule  cGMP  causes  a  change  in  the  behavior 
of  many  different  cells.  It  seemed  possible  that 
the  receptor  is  situated  with  part  of  it  outside  the 
cell,  where  it  could  bind  the  egg  peptide,  and  the 
other  part  inside,  where  it  forms  cGMP.  The 
cGMP  thus  formed  would  then  serve  as  a  signal 
that  egg  peptide  is  being  detected,  lending 
greater  speed  and  direction  to  the  spermatozoon. 


To  provide  evidence  that  the  membrane  recep- 
tor protein  is  in  fact  guanylyl  cyclase,  we  isolated 
complementary  DNA  clones  for  this  enzyme. 
Such  clones  allow  one  not  only  to  predict  the 
primary  structure  of  a  protein,  the  receptor  in 
this  case,  but  to  direct  protein  synthesis  in  quan- 
tity. Unfortunately,  the  sea  urchin  sperm  receptor 
was  not  formed  in  the  proper  manner,  and  re- 
search continues  on  its  expression. 

Under  appropriate  conditions,  DNA  will  bind 
(hybridize)  to  closely  related  DNA.  Therefore, 
sea  urchin  DNA  was  used  to  determine  whether 
mammals  contain  a  related  protein  that  might 
serve  as  a  receptor.  Clones  containing  comple- 
mentary DNA  were  isolated  from  rat  brain.  The 
DNA  sequence  revealed  that  a  rat  brain  guanylyl 
cyclase  is  a  component  of  the  plasma  membrane, 
with  approximately  one-half  of  the  protein  out- 
side and  one-half  inside  the  cell.  The  intracellu- 
lar region  of  the  sea  urchin  sperm  enzyme  is  very 
similar  to  the  part  of  the  rat  brain  enzyme  inside 
the  cell.  In  the  regions  outside  the  cell,  however, 
the  two  proteins  show  little  similarity.  This 
would  be  expected  if  guanylyl  cyclase  serves  as  a 
cell  surface  receptor  for  peptides  and  the  peptide 
of  mammals  is  different  from  that  of  the  sea  ur- 
chin. That  is,  the  detector  part  of  the  molecule 
would  need  to  change  to  recognize  a  different 
peptide,  but  intracellular  regions  of  the  receptor 
could  remain  unaltered. 

Subsequent  binding  studies  demonstrated  that 
the  cloned  rat  brain  guanylyl  cyclase  could  specif- 
ically bind  and  be  activated  by  certain  peptides 
synthesized  in  the  heart  and  brain.  These  atrial 
natriuretic  peptides  (ANPs)  regulate  blood  pres- 
sure, as  well  as  various  other  physiological 
events. 

Since  sea  urchin  sperm  cells  respond  to  pep- 
tides that  do  not  resemble  ANP,  other  animals 
may  be  expected  to  contain  yet  other  peptides 
that  interact  with  guanylyl  cyclase.  In  addition, 
multiple  membrane  forms  of  guanylyl  cyclase 
may  exist  within  mammals.  We  have  used  the 
DNA  that  encodes  the  ANP  receptor  to  determine 
whether  other  guanylyl  cyclase  receptors  exist. 
Another  receptor  with  properties  similar  to  the 
one  described  above  has  been  identified.  This 
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guanylyl  cyclase  receptor  is  similar  to  the  first 
ANP  receptor  within  intracellular  regions  but  is 
only  43  percent  identical  in  the  extracellular,  li- 
gand-binding  region.  Although  this  receptor  also 
will  bind  ANP,  high  concentrations  of  the  pep- 
tide appear  required  for  enzyme  activation, 
which  suggests  the  existence  of  a  more  potent 
natural  peptide. 

We  also  have  recently  discovered  the  existence 
of  a  third  guanylyl  cyclase  receptor.  This  receptor 
binds  small  peptides  released  from  various  bacte- 
ria that  cause  acute  diarrhea.  This  form  of  diar- 
rhea (often  referred  to  as  traveler's  diarrhea)  is 
prevalent  in  infant  humans  and  young  domestic 


animals.  The  receptor  has  the  same  general  fea- 
tures as  the  two  ANP  receptors  described  above 
but  has  a  markedly  different  amino  acid  sequence 
within  the  toxin-binding  region.  Whether  a  regu- 
latory molecule  not  of  bacterial  origin  normally 
exists  that  binds  to  this  receptor  is  not  yet  known. 

These  results  suggest  that  drugs  specific  to  dif- 
ferent forms  of  guanylyl  cyclase  may  result  in  spe- 
cific physiological  effects.  For  example,  one  re- 
ceptor may  control  smooth  muscle  relaxation  and 
be  principally  involved  in  the  regulation  of  blood 
pressure,  another  may  be  involved  in  neural 
functions,  and  yet  another  may  regulate  epithe- 
lial cell  secretion. 
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THE  immune  system  functions  to  rid  the  body 
of  foreign  objects  such  as  bacteria,  viruses, 
tumors,  and  transplants.  The  portion  of  such  mat- 
ter that  is  recognized  as  foreign  by  the  immune 
system  is  called  an  antigen.  Antigens  that  are 
found  on  cells  are  "presented"  to  the  immune 
system  by  cell  surface  molecules  called  histocom- 
patibility molecules  (also  called  HLA  molecules 
in  humans  and  H-2  molecules  in  the  mouse).  His- 
tocompatibility molecules  are  able  to  bind  anti- 
genic fragments  of,  for  example,  viruses,  and 
stimulate  white  blood  cells  (lymphocytes)  to  at- 
tack the  virus-infected  cell,  thereby  limiting  the 
spread  of  infection.  Without  these  antigen- 
presenting  molecules  the  host  would  be  unable 
to  mount  an  immune  response  against  pathogens 
and  would  not  survive. 

Different  H-2  molecules  can  bind  and  present 
different  types  of  antigens.  Because  inbred  mice 
have  about  three  different  types  of  H-2  molecules 
on  their  cells,  they  can  bind  and  present  a  large, 
but  limited,  number  of  antigens  to  the  immune 
system. 

To  ensure  the  survival  of  the  species,  it  is  bene- 
ficial that  many  varieties  of  H-2  molecules  be 
present  in  the  population.  In  this  way  there  will 
always  be  some  portion  of  the  population  that 
will  mount  an  immune  response  to  a  given  anti- 
gen. An  extraordinary  number  of  different  histo- 
compatibility molecules  have  been  found  in  al- 
most all  species  investigated.  In  humans,  the 
large  variety  of  HLA  molecules  ensures  our  sur- 
vival but  is  the  major  obstacle  confounding  tissue 
transplantation.  Our  research  interest  lies  in  the 
genetic  mechanism  that  generates  the  different 
histocompatibility  genes  in  mice  and  other 
species. 

The  H-2  genes  of  the  mouse  are  part  of  the 
larger  major  histocompatibility  complex  class  I 
multigene  family.  This  gene  family  also  contains 
genes  that  are  structurally  similar  to  H-2  genes 
and  have  unknown  functions.  The  genetic  mecha- 
nism that  generates  variety  in  H-2  genes  is  the 
microrecombination  process,  which  reassorts 
DNA  among  H-2  genes  and  other  related  class  I 
genes.  By  substituting  small  segments  of  class  I 


gene  sequences  into  H2  genes,  the  microrecom- 
bination process  can  create  new  H-2  molecules 
that  have  different  antigen-presenting  capabili- 
ties, thereby  expanding  the  immune  responsive- 
ness of  the  population. 

Our  interest  is  to  understand  better  the  mecha- 
nism underlying  the  microrecombination  pro- 
cess. This  process  has  previously  been  studied  by 
identifying  microrecombinant  mice  that  differed 
from  their  otherwise  identical  siblings  by  altered 
H-2  genes.  Since  microrecombinant  H-2  mole- 
cules elicit  skin  graft  rejection,  these  studies 
were  accomplished  by  testing  thousands  of  mice 
with  skin  grafts.  The  rejection  of  a  skin  graft  by  a 
sibling  mouse  signaled  an  alteration  in  H-2  mole- 
cules. These  labor-intensive  studies  found  that, 
on  the  average,  one  microrecombinant  mouse 
was  detected  for  every  5,000  skin  grafts 
performed. 

To  gain  further  insight  into  the  microrecom- 
bination process,  we  are  using  a  novel  approach 
to  detect  microrecombinant  H-2  genes.  Mice 
with  microrecombinant  H-2  genes  arise  from 
germ  cells  (sperm  or  egg)  that  contain  microre- 
combinant H-2  genes  formed  in  the  gonads  (ova- 
ries or  testes)  of  normal  mice.  Therefore,  if  one 
looks  in  the  sperm  or  eggs  of  normal  mice,  one 
should  find  microrecombinant  H-2  genes.  We  are 
using  this  approach  to  investigate  the  microre- 
combination process.  Since  we  can  obtain  thou- 
sands of  eggs  or  millions  of  sperm  from  each 
mouse,  we  can  analyze  the  equivalent  of  millions 
of  mice  for  microrecombinant  H-2  genes.  We  are 
using  the  polymerase  chain  reaction  (PGR)  to 
amplify  specific  H-2  genes  and  then  clone  the 
genes.  Small  DNA  probes  are  being  utilized  to 
identify  the  clones  containing  microrecombinant 
H-2  genes. 

We  are  presently  analyzing  the  microrecom- 
bination frequency  in  ovarian  cells,  as  data  from 
previous  studies  indicate  that  microrecombina- 
tions  occur  in  female  germ  cells.  We  also  hope  to 
determine  whether  microrecombinations  occur 
in  sperm  cells  as  well,  and  at  what  frequency. 
(Evidence  from  skin  graft  studies  suggested  that 
microrecombination  may  occur  only  in  female 
mice.)  Some  strains  of  mice  may  undergo  micro- 


153 


Detection  of  Genetic  Recombination  in  Germ  Cells 


recombinations  more  frequently  than  others.  Our 
goal  is  to  determine  microrecombination  fre- 
quencies in  several  mouse  strains  and  perhaps 
identify  critical  parameters  in  the  microrecom- 
bination process.  These  studies  will  contribute  to 
our  understanding  of  the  genetic  processes  that 
control  the  evolution  and  ultimately  the  function 
of  the  mammalian  immune  system. 

Although  sequence  diversity  and  polymor- 
phism are  the  hallmark  of  H-2  genes,  Qa  and  Tla 
region  genes  are  characterized  by  sequence  con- 
servation among  alleles  and  limited  polymor- 
phism. The  lack  of  polymorphism  has  been  sug- 
gested to  preclude  an  immunological  function 
for  the  products  of  these  genes.  We  have  identi- 
fied a  Qa  region  gene  whose  sequence  differs 
greatly  between  alleles  of  the  C57BL/6  and  C3H 
mice.  The  sequence  differences  between  the  two 
alleles  is  manifested  in  both  scattered  and  clus- 


tered nucleotide  substitutions.  The  clustered 
substitutions,  which  are  similar  to  those  observed 
in  the  microrecombinations  that  diversify  H-2 
genes,  may  reflect  past  microrecombination 
events  with  /f-2  and  other  Qa  region  genes.  These 
data  may  provide  the  first  evidence  that  Qa  genes 
can  be  recipients  in  the  microrecombination 
process. 

PGR  analysis  has  indicated  that  this  Qa  gene  is 
transcribed  in  some  strains  of  mice.  We  are  en- 
gaged in  an  in-depth  analysis  of  the  transcription, 
translation,  and  cell  surface  expression  of  this 
gene  and  its  product  in  order  to  ascertain  its  func- 
tion. This  gene  is  polymorphic  in  at  least  three 
strains  of  mice,  and  other  strains  are  being  ana- 
lyzed. The  diversity  and  polymorphism  of  this 
gene  suggest  an  immunological  function  for  its 
product,  perhaps  the  first  Qa  gene  to  be  ascribed 
such  a  function. 
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CELLULAR  growth  is  an  intricately  regulated 
process  that  is  tightly  controlled  by  the  coor- 
dinated interactions  of  many  genes.  Mutations 
that  alter  the  structure  or  expression  of  a  broadly 
based  class  of  genes  can  result  in  the  perturbation 
of  normal  cell  proliferation;  such  uncontrolled 
cellular  division  is  a  hallmark  of  cancer.  To  un- 
derstand better  the  molecular  events  that  dictate 
the  complex  course  of  tumor  development  and 
progression,  we  have  focused  on  the  identifica- 
tion of  these  critical  growth  control  genes  and 
characterization  of  the  mechanisms  by  which 
they  are  regulated. 

Oncogenic  Potential  of  a  Novel 
Amplified  Gene 

Studies  performed  in  this  and  other  laborato- 
ries have  provided  substantial  evidence  that  the 
physical  amplification  and  consequent  overex- 
pression  of  some  growth  control  genes  are  asso- 
ciated with  the  initiation  and/or  progression  of  a 
variety  of  mammalian  tumor  cells.  A  functional 
analysis  of  amplified  DNA  in  tumor  cells  should 
allow  the  identification  of  other  genes  that  have  a 
central  role  in  the  regulation  of  normal  cellular 
proliferation  and  that  may  contribute  to  cellular 
transformation.  We  have  identified  a  candidate 
for  such  a  gene.  Amplified  DNA  sequences  pres- 
ent in  the  tumorigenic  mouse  cell  line  3T3-DM 
have  been  analyzed  to  determine  whether  the 
presence  of  cellular  transforming  activity  is 
correlated  with  the  elevated  expression  of  any  of 
the  amplified  genes.  Our  results  have  provided 
evidence  that  the  mdm-2  gene,  which  is  ampli- 
fied more  than  50-fold  in  the  3T3-DM  cell  line, 
induces  tumorigenicity  when  overexpressed  in 
established,  nontumorigenic  cell  lines.  Analysis 
of  the  predicted  amino  acid  composition  of  the 
mdm-2  product  reveals  features  that  are  similar 
to  those  that  are  functionally  significant  in  cer- 
tain DNA-binding  proteins.  Thus  mdm-2  may  par- 
ticipate in  growth  control  pathways  by  affecting 
the  expression  of  other  genes.  We  are  raising  anti- 
sera  directed  against  the  product  of  this  evolu- 
tionarily  conserved  gene  to  facilitate  analysis  of 
its  normal  cellular  function. 


Expression  of  the  c-Ki-ras  Proto-oncogene 

Despite  its  potentially  pivotal  role  in  growth 
control  pathways,  relatively  little  is  known  about 
the  genetic  elements  that  mediate  the  expression 
of  the  c-Ki-ras  proto-oncogene.  Because  this 
gene  is  normally  expressed  constitutively  at  low 
levels  in  all  cell  types,  it  is  generally  considered  a 
member  of  a  class  of  "housekeeping"  or  growth 
control  genes.  Using  the  c-Ki-ras  gene  as  a  proto- 
type, we  have  carried  out  experiments  to  identify 
mediators  of  expression  that  might  be  common  to 
these  genes.  Functionally  important  domains 
within  the  transcriptional  control  (or  promoter) 
region  have  been  identified,  and  we  have  begun 
to  examine  specific  DNA  sequence  motifs  that 
may  represent  binding  sites  for  particular  nuclear 
regulatory  factors. 

One  promoter  element  exhibits  a  particular 
transcriptional  influence.  This  element  has  an  un- 
usual sequence  composition  (rich  in  C  +  T  nu- 
cleotides on  one  strand,  referred  to  as  a  pur/pyr 
domain)  and  binds  one  or  more  nuclear  factors. 
Several  lines  of  evidence  indicate  that  the  same  or 
related  DNA-binding  proteins  recognize  similar 
pur/pyr  domains  that  are  located  in  the  promoter 
regions  of  other  growth  control  genes.  Thus  the 
sequence  similarities  of  these  regions  may  reflect 
a  functional  relatedness.  The  potential  of  the 
pur/pyr  regions  to  adopt  unusual  structural  con- 
formations may  also  have  implications  for  the  ex- 
pression of  genes  in  which  they  are  contained; 
such  elements  may  allow  for  the  maintenance  of 
these  promoters  in  a  structure  poised  for 
transcription. 

We  have  isolated  recombinant  DNA  clones  rep- 
resenting genes  encoding  DNA-binding  proteins 
that  recognize  the  pur/pyr  motif.  The  availability 
of  these  clones  will  allow  the  in  vitro  and  in  vivo 
studies  needed  to  define  the  role  that  these  un- 
usual sequence  elements  may  play  in  mediating 
gene  expression  control. 

The  Genesis  of  Meningiomas 

A  new  avenue  of  investigation  is  aimed  at  eluci- 
dating the  molecular  changes  involved  in  the  de- 
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velopment  of  meningiomas,  a  specific  class  of 
brain  tumors  that  originate  in  the  thin  membrane 
covering  the  brain  and  spinal  cord.  Meningiomas 
are  one  of  the  most  common  tumors  of  the  central 
nervous  system,  accounting  for  up  to  20  percent 
of  all  intracranial  tumors  and  25  percent  of  intra- 
spinal neoplastic  lesions.  Currently  the  mecha- 
nisms responsible  for  the  initiation  and  progres- 
sion of  these  tumors  are  largely  unknown. 

Extensive  cytogenetic  analyses,  coupled  with 
recent  DNA  studies,  suggest  that  meningiomas  re- 
sult from  the  loss  of  function  of  a  tumor  suppres- 
sor gene,  the  meningioma  susceptibility  locus 
(msl),  located  on  human  chromosome  22.  It  is 
proposed  that  both  alleles  of  this  gene  must  be 
lost  or  functionally  inactivated  for  tumorigenesis 
to  occur.  Although  only  a  few  such  genes  have 
been  isolated,  it  has  become  clear  that  the  pro- 
tein products  of  tumor  suppressor  genes  play  crit- 
ical roles  in  negatively  regulating  cellular  prolif- 
eration. We  have  initiated  studies  to  identify  the 
msl  gene  and  other  functionally  important  genes 
whose  expression  is  missing  or  altered  in  menin- 
giomas relative  to  their  normal  precursor  cells. 

To  do  this,  we  have  employed  subtractive  hy- 
bridization and  cloning.  These  protocols  remove 


sequences  that  are  common  to  two  cell  popula- 
tions and  enrich  for  sequences  that  are  present  in 
only  one  of  these  populations.  Utilizing  cultures 
of  primary  leptomeningeal  cells  established  in 
this  laboratory  and  two  meningioma  cell  lines, 
we  have  constructed  subtraction  cDNA  libraries 
whose  cDNA  contents  are  enriched  for  sequences 
of  genes  that  are  expressed  in  normal  human  lep- 
tomeningeal cells  but  are  either  not  expressed  or 
aberrantly  expressed  in  meningioma  tumors. 
Screening  of  these  libraries  has  identified,  and 
will  continue  to  provide,  candidate  clones  for  the 
msl  tumor  suppressor  gene  and  other  genes  of 
interest.  More  detailed  characterization  of  these 
clones  is  in  progress. 

The  isolation  of  the  msl  gene  and  downstream 
affected  genes  has  diagnostic  and  therapeutic  im- 
plications for  many  tumors  of  neural  crest  origin. 
The  availability  of  the  leptomeningeal  cell  lines 
and  genetic  markers  will  provide  a  unique  set  of 
tools  to  begin  dissection  of  the  critical  genetic 
events  associated  with  the  etiology  and  pathology 
of  meningiomas  and  the  elucidation  of  what  is 
expected  to  be  a  fundamental  pathway  in  the  de- 
velopment and  differentiation  of  multiple  cell 
types  in  the  central  nervous  system. 
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THE  elegant  and  now  widely  used  methods 
for  sequencing  DNA  are  precise  and  quite 
simple,  but  labor-intensive,  expensive,  and 
frustrating  for  large  projects.  The  current  auto- 
mated instruments  based  on  fluorescence  de- 
tection methods  are  clearly  a  step  forward,  but 
another  enhancement  of  efficiency  is  needed  to 
bring  large  projects  like  sequencing  of  the  hu- 
man genome  within  range.  The  heart  of  se- 
quencing is  the  remarkably  powerful  size-frac- 
tionation  of  DNA  chains  into  ladders  with 
one-base  resolution.  In  trying  to  evolve  more 
efficient,  ladder-based  methods,  we  have  pur- 
sued two  approaches. 

Many  of  the  labor-intensive  steps  of  sequenc- 
ing, from  growth  of  bacterial  cultures  and  DNA 
preparation  to  fractionation  of  samples  on  the 
crucial  gels,  can  be  made  parallel  processes  by 
multiplexing,  as  pioneered  by  George  Church 
(HHMI,  Harvard  Medical  School)  using  chemical 
sequencing  and  by  us  using  chain  termination  se- 
quencing. Here,  many  vectors,  each  with  its  in- 
sert of  DNA  to  be  sequenced,  are  combined  at  an 
early  stage  and  sequenced  and  fractionated  as  a 
mixture.  Each  vector  in  the  mix  has  the  same  se- 
quence to  prime  DNA  synthesis  into  its  insert,  so 
that  all  are  done  in  one  simple  reaction,  but  each 
has  unique  sequences  as  tags  just  flanking  the 
insert. 

The  ladders  of  mixed  sequences  fractionated 
on  a  gel  are  transferred  to  membranes,  which  are 
then  probed  sequentially  with  radioactive  or  che- 
miluminescent  probes  that  recognize  each  of  the 
tag  sequences,  revealing  one  sequence  pattern 
after  another.  The  labor-intensive  steps  are  sim- 
plified by  the  level  of  multiplexing  (10-  to  40- 
fold),  and  the  readout  is  reduced  to  repetitive 
rounds  of  probing  that  can  be  readily  automated. 
Various  architectures  for  automation  and  signal 
detection  are  under  development. 

The  bottleneck  for  multiplex  sequencing  now 
shifts  to  other  fronts.  The  "back  end"  problem  of 
reading  the  DNA  sequence  off  of  the  autoradio- 
grams  that  result  from  each  daily  cycle  of  probing 
quickly  becomes  rate-limiting.  Automation  of 


this  crucial  sequence-calling  is  difficult  and  has 
been  getting  much  attention.  We  are  trying  a  new 
approach  based  on  communication  and  signal- 
processing  theory. 

The  "front  end"  problem  of  feeding  appro- 
priate sequences  into  the  system  is  especially 
crucial.  In  the  past,  multiplex  sequencing  has 
been  done  with  random  fragments  of  DNA  in  the 
hope  of  reassembling  the  large  sequence  by  over- 
lapping information  from  5-  to  10-fold  redundant 
sequencing.  This  becomes  increasingly  difficult 
with  increasing  project  size,  as  the  gaps  are  diffi- 
cult to  fill.  An  organized  front  end  seems  neces- 
sary. Thus  we  are  developing  in  vivo  approaches 
to  generate  and  map  the  input  inserts. 

Clones  to  be  sequenced  are  generated  in  bacte- 
rial cells  by  turning  loose  a  transposon  that  in- 
serts itself  quite  randomly  into  the  large  DNA  of 
interest.  Quick-mapping  methods  allow  assem- 
bly of  the  minimally  overlapping  set  of  clones  to 
ensure  coverage.  The  transposons  are  designed  to 
have  all  the  crucial  elements  for  multiplex  se- 
quencing in  both  directions  out  from  the  point  of 
insert,  thus  generating  twice  the  reading  length. 
Small-scale  tests  are  in  progress  and  should  tell  us 
if  this  front  end  approach  will  have  the  expected 
simplification  for  multiplex  sequencing. 

A  second  approach  to  sequencing  that  might 
provide  increases  in  efficiency  and  speed  is  capil- 
lary gel  electrophoresis.  Recent  developments 
here  and  in  other  laboratories  have  shown  that 
microbore  capillaries  with  an  internal  diameter 
of  50-70  microns  can  be  used  to  fractionate  DNA 
samples  by  size,  just  like  the  conventional  slab 
gels.  The  virtue  of  capillaries  is  that  very  high 
voltages  (10-30  kilovolts)  can  be  used,  since 
heat  dissipation  is  so  efficient.  Molecules  300- 
400  in  length  can  be  analyzed  in  minutes,  with  a 
resulting  increase  in  resolution. 

The  difficulty  is  that  the  small  gels  can  accept 
only  small  amounts  of  DNA,  so  that  detection 
must  be  very  sensitive.  Working  with  Norm  Dovi- 
chi  (University  of  Alberta,  Edmonton),  we  have 
adapted  a  sheath  flow  cuvette  for  detection  of 
four  fluorescent  tags  so  that  detection  in  the 


157 


DNA-seqtiencing  Technologies 


range  of  a  few  thousand  molecules  is  achieved 
and  all  four  bases  can  be  detected  in  the  output 
from  one  column.  Charge-coupled  devices 
(CCDs)  are  ideal  for  detection  of  these  low-fluor- 
escent signals  and  permit  spatial  resolution  that 
will  be  helpful  for  continuous  spectral  analysis 
and  for  parallel  analysis  of  multiple  capillaries. 

Commercial  instruments  for  conventional 
zone  capillary  electrophoresis  have  worked  out 
automated  sample-handling  that  can  be  adapted 


to  the  sequencing  problem.  We  see  these  auto- 
mated and  multiplexed  capillary  sequencing  in- 
struments as  potential  alternatives  to  current  fluo- 
rescent machines  for  primer-based  sequencing. 

We  have  been  struck  by  the  interest  that  col- 
leagues in  more  engineering-related  sciences 
have  shown  in  these  problems.  The  project 
has  resulted  in  a  refreshing  blend  of  interests 
that  will  bring  new  solutions  to  molecular 
investigations. 
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INVESTIGATIONS  in  this  laboratory  focus  on 
the  molecular  genetics  of  membrane  and  se- 
cretory proteins.  Experiments  involve  three  pro- 
teins: the  hemagglutinin  (HA)  of  influenza  virus, 
which  serves  as  a  marker  molecule  for  specific 
cell  populations  in  transgenic  mouse  experi- 
ments; human  tissue-type  plasminogen  activator 
(t-PA) ,  a  serine  protease  produced  in  vascular  en- 
dothelial cells  and  involved  in  thrombolysis  (dis- 
solving blood  clots);  and  BiP/GRP78,  a  protein 
of  the  endoplasmic  reticulum  (ER)  that  is  in- 
volved in  the  initial  mobilization  of  proteins  that 
traverse  the  secretory  pathway. 

Development  of  a  Model  for  Autoimmune 
Type  I  Diabetes 

In  type  I  diabetes  the  destruction  of  insulin- 
producing  pancreatic  /3  cells  is  thought  to  occur 
via  an  autoimmune  mechanism — an  immune  re- 
sponse directed  against  the  host.  Surface  mole- 
cules of  the  |S  cells  have  been  implicated  as  target 
antigens.  We  have  used  HA  as  a  marker  molecule 
to  investigate  the  mechanism  of  immunological 
tolerance  and  to  develop  an  animal  model  for  the 
study  of  diabetes. 

In  collaboration  with  Robert  Hammer  (HHMI, 
University  of  Texas  Southwestern  Medical  Center 
at  Dallas) ,  we  have  microinjected  the  HA  gene — 
placed  under  the  control  of  sequences  from  the 
rat  insulin  II  gene — into  fertilized  mouse  em- 
bryos to  generate  three  lines  of  transgenic  RIPHA 
mice  in  which  the  transgene  is  expressed  only  in 
the  (8  cells  of  the  pancreas.  We  have  now  bred 
large  numbers  of  transgenic  mice  that  are  homo- 
zygous for  the  HA  gene  and  for  the  major 
histocompatibility  gene. 

From  birth  these  transgenic  animals  have 
slightly  raised  blood  glucose  levels  compared 
with  those  in  control  animals.  Although  histologi- 
cal analysis  of  the  pancreata  of  young  RIPHA  mice 
reveals  some  disorganization  of  the  normally  or- 
dered architecture  of  the  islets,  these  animals  dis- 
play no  physiological  problems  until  approxi- 
mately 4-5  months  of  age.  However,  after  this 
time  up  to  27  percent  of  the  RIPHA  mice  develop 
frank  diabetes  mellitus  as  a  consequence  of  lym- 


phocyte-mediated autoimmune  destruction  of 
the  pancreatic  ^  cells.  Analyses  of  the  sera  of  dia- 
betic animals  often  reveal  the  presence  of  anti- 
bodies directed  against  HA,  as  well  as  antibodies 
that  recognize  proteins  of  the  /3  cells.  The  high 
blood  glucose  levels  in  the  diabetic  mice  can  be 
decreased  to  normal  levels  by  administration  of 
insulin.  Therefore  the  development  of  hypergly- 
cemia in  these  transgenic  mice  has  all  the  features 
that  typify  the  pathogenesis  of  human  type  I  dia- 
betes mellitus.  These  mice  should  provide  a  use- 
ful animal  model  for  further  studies  of  this 
disease. 

The  development  in  RIPHA  mice  of  immune 
responses  against  HA  raises  the  question  of  why 
the  protein,  which  is  expressed  on  the  /3  cells  of 
the  developing  pancreas  before  thymic  matura- 
tion, is  not  recognized  as  a  self  antigen.  Addi- 
tional lines  of  transgenic  mice  have  now  been 
developed  that  express  HA  under  the  control  of 
the  mouse  metallothionein  promoter.  These 
MTHA  mice,  which  express  high  levels  of  HA  in 
many  tissues  (including  liver  and  kidney),  will 
enable  investigation  of  the  immunological  re- 
sponses to  HA  when  the  protein  is  expressed  in 
major  cell  populations.  Thus  far,  MTHA  mice  up 
to  10  months  of  age  have  displayed  no  signs  of 
diabetes  or  any  other  autoimmune  disease. 

Human  Tissue-Type  Plasminogen  Activator 

Many  normal  and  abnormal  processes  requir- 
ing extracellular  proteolysis  are  thought  to  be 
mediated  by  plasminogen  activators  that  cleave 
plasminogen  to  the  active  protease  plasmin.  Evi- 
dence points  to  t-PA  as  the  physiological  thrombo- 
lytic agent  in  the  vascular  system. 

The  level  of  t-PA  activity  in  the  circulation  is 
controlled  by  the  interaction  of  the  molecule 
with  three  other  proteins.  First,  the  affinity  of  the 
enzyme  for  its  substrate  plasminogen  is  increased 
several  hundredfold  by  binding  to  fibrin.  Second, 
t-PA  is  rapidly  inactivated  by  the  serpin,  plasmin- 
ogen activator  inhibitor- 1  (PAI- 1 ) .  PAJ- 1  acts  as  a 
suicide  substrate  and  forms  a  covalent  bond  with 
a  serine  residue  in  the  active  site  of  t-PA.  Finally, 
the  enzyme  is  efficiently  cleared  from  the  blood 
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circulation  by  specific  t-PA  receptors  on  liver 
cells. 

The  t-PA  polypeptide  is  composed  of  a  number 
of  independent  functional  domains,  which  in- 
clude a  hydrophobic  signal  sequence,  a  short  pro 
segment,  a  finger  domain  having  homology  to  the 
fibrin-binding  finger  domains  of  fibronectin,  an 
epidermal  growth  factor  (EGF)-like  domain,  two 
kringle  structures,  and  finally  the  catalytic  do- 
main that  is  homologous  to  other  members  of  the 
serine  protease  family.  We  have  shown  that  the 
finger  and/or  EGF-like  domains  are  involved  in 
the  initial  high-affinity  binding  of  t-PA  to  fibrin, 
while  stimulation  of  t-PA  activity  involves  second- 
ary, lower  affinity  interactions  of  fibrin  with  ei- 
ther of  the  two  kringle  domains.  The  binding  of 
t-PA  to  specific  receptors  on  hepatic  cells  also 
involves  sequences  within  the  finger  and/or  EGF- 
like  domains. 

Although  the  three-dimensional  structure  of 
t-PA  has  not  been  elucidated,  we  have  been  able 
to  model  the  finger  domain,  the  EGF-like  domain, 
the  kringle  domains,  and  the  light-chain/inhibi- 
tor complex  using  the  known  structures  of  homol- 
ogous proteins.  Site-directed  mutants  designed 
using  these  proposed  structures  have  provided 
information  about  the  individual  amino  acid  se- 
quences that  interact  with  the  effector  mole- 
cules. For  example,  we  have  generated  variant 
enzymes  that  are  efficient,  fibrin-stimulated  plas- 
minogen activators  but  are  resistant  to  inhibition 
by  a  variety  of  serpins  (including  PAI- 1 )  or  do  not 
bind  to  the  t-PA  receptor(s)  involved  in  clearance 
of  the  enzyme  in  the  liver.  Because  these  mutant 
enzymes  should  have  an  extended  effective  life  in 
the  circulation,  they  may  have  significant  poten- 
tial for  use  in  thrombolytic  therapy  of  patients 
with  myocardial  infarction. 

Protein  Folding  Within  Cells 

Until  recently  it  was  widely  assumed  that  the 


folding  and  assembly  of  newly  translocated  poly- 
peptides into  their  tertiary  and  quaternary  struc- 
tures is  a  spontaneous  process  that  does  not  in- 
volve the  intervention  of  other  cellular  proteins. 
It  is  now  apparent,  however,  that  members  of 
heat-shock  protein  families,  including  the  ER- 
resident  protein  BiP,  are  intimately  involved  in 
facilitating  the  folding  and  assembly  of  nascent 
polypeptides. 

BiP  binds  transiently  to  a  variety  of  wild-type 
membrane  and  secretory  proteins  and  more  per- 
manently to  malfolded  proteins  that  are  trapped 
in  the  ER.  These  observations  led  to  the  hypothe- 
sis that  BiP  plays  a  role  in  folding  in  the  ER  lumen. 
Analysis  of  the  interaction  of  BiP  with  HA  mutants 
that  lack  various  structural  domains  has  demon- 
strated that  BiP  binds  to  amino  acids  that  form  the 
trimeric  stalk  domain  in  the  folded  HA  molecule. 
BiP  also  appears  to  bind  to  and  stabilize  partially 
folded  polypeptides  in  a  state  competent  for  fur- 
ther folding  and  oligomeric  assembly. 

Cloning  and  sequencing  of  cDNAs  encoding 
BiP  from  mammalian  and  yeast  cells  have  re- 
vealed an  extremely  high  degree  of  evolutionary 
conservation.  The  yeast  BiP  gene  is  essential  for 
cell  viability  and,  like  its  mammalian  counter- 
part, can  be  induced  by  the  accumulation  of  un- 
folded proteins  in  the  ER.  The  yeast  gene  differs 
from  the  mammalian  gene,  however,  in  also  re- 
sponding to  heat  shock.  All  the  information  re- 
quired for  accurate  transcriptional  regulation  of 
yeast  BiP  is  contained  within  a  230-base  pair  re- 
gion directly  upstream  of  the  initiation  codon. 
This  sequence  contains  a  functional  consensus 
heat-shock  element  and  an  element  that  is  respon- 
sible for  induction  of  yeast  BiP  mRNA  following 
accumulation  of  unfolded  proteins  in  the  ER. 
This  unfolded  protein  response  (UPR)  element 
displays  homology  to  conserved  sequences  pres- 
ent in  the  promoters  of  mammalian  glucose-regu- 
lated proteins. 
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SUBENDOTHELIUM 

Schematic  illustration  of  the  biosynthesis  and  function  of  von  Willebrand  factor.  Red  ovals  repre- 
sent the  factor,  which  is  synthesized  in  the  bone  marrow  megakaryocyte  (precursor  of  blood 
platelets)  and  in  the  vascular  endothelial  cell.  It  is  assembled  through  several  steps  into  a  large 
complex  containing  up  to  100  subunits.  Bound  to  von  Willebrand  factor,  in  the  circulating 
blood,  is  the  antihemophilic  factor  VIII  (blue  triangle).  In  addition  to  carrying  factor  VIII,  von 
Willebrand  factor  serves  as  a  major  adhesive  link  between  platelets  and  the  blood  vessel  wall  at 
sites  of  injury. 

Research  of  David  Ginsburg. 
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THE  major  research  activities  of  my  laboratory 
are  centered  around  the  study  of  two  impor- 
tant blood  clotting  proteins,  von  Willebrand  fac- 
tor and  plasminogen  activator  inhibitor- 1,  and 
their  associated  human  diseases.  In  addition,  we 
are  applying  molecular  tools  to  the  study  of  bone 
marrow  transplantation. 

von  Willebrand  Factor 

One  major  function  of  von  Willebrand  factor, 
which  is  an  important  part  of  the  body's  blood 
clotting  system,  is  to  serve  as  a  bridge  connecting 
blood  platelets  to  the  wall  of  injured  blood  ves- 
sels, thereby  helping  to  control  bleeding,  von 
Willebrand  factor  also  serves  as  the  carrier  for 
factor  VIII,  the  substance  missing  in  patients  with 
hemophilia.  Abnormalities  in  von  Willebrand 
factor  result  in  von  Willebrand  disease,  the  most 
common  human  inherited  bleeding  disorder,  oc- 
curring in  1-3  percent  of  the  general  population. 
Although  more  than  20  different  subtypes  of  this 
disease  have  been  described,  the  molecular  basis 
for  this  variation  is  unclear. 

In  previous  work,  we  cloned  and  characterized 
a  major  portion  of  the  gene  for  human  von  Wille- 
brand factor.  Presently  we  are  applying  these  mo- 
lecular tools  to  increase  our  understanding  of  the 
function  of  von  Willebrand  factor  and  the  molec- 
ular basis  for  von  Willebrand  disease,  von  Wille- 
brand factor  is  a  large  molecule  with  multiple 
discrete  functions.  We  are  conducting  experi- 
ments to  localize  the  various  functions  of  von 
Willebrand  factor  to  precise  regions  within  the 
molecule.  The  specific  segment  involved  in  bind- 
ing to  factor  VIII  has  been  localized,  and  we  are 
characterizing  the  regions  required  for  von  Wil- 
lebrand factor  interaction  with  the  platelet  sur- 
face and  the  blood  vessel  wall. 

We  are  also  focusing  on  the  molecular  basis  of 
human  von  Willebrand  disease.  The  defect  in 
some  patients  with  type  III  von  Willebrand  dis- 
ease (the  most  severe  form)  is  an  inability  to  copy 
the  von  Willebrand  factor  gene  into  normal  mes- 
senger RNA.  Most  of  the  patients  with  type  IIB 
von  Willebrand  disease  appeared  to  have  one  of 
four  defects,  all  clustered  within  a  small  region  of 


the  von  Willebrand  factor  gene  thought  to  be  crit- 
ical for  its  interaction  with  blood  platelets.  By 
introducing  one  of  these  defects  into  the  DNA  of 
tissue  culture  cells,  we  have  shown  that  this  sin- 
gle change  is  responsible  for  the  type  IIB  variant. 
In  similar  studies  of  type  IIA  von  Willebrand  dis- 
ease, we  identified  a  separate  set  of  defects  clus- 
tered in  a  different  region  of  the  von  Willebrand 
factor  gene.  By  introducing  these  defects  into  tis- 
sue culture  cells,  we  have  shown  that  type  IIA 
may  be  due  to  abnormalities  in  the  process 
whereby  von  Willebrand  factor  is  manufactured 
inside  the  cell.  Finally,  in  studies  of  a  patient 
whose  von  Willebrand  factor  is  unable  to  bind 
factor  VIII,  we  identified  a  single  change  in  the 
gene  that  has  helped  to  pinpoint  the  region  of 
von  Willebrand  factor  responsible  for  carrying 
factor  VIII. 

Through  these  studies,  we  hope  to  expand  our 
understanding  of  von  Willebrand  factor,  to  ad- 
vance our  ability  to  diagnose  and  classify  von 
Willebrand  disease  accurately,  and  to  improve 
the  medical  treatment  for  this  common  human 
disorder. 

Plasminogen  Activator  Inhibitor- 1 

The  fibrinolytic  system  is  the  body's  mecha- 
nism for  breaking  down  blood  clots.  This  system 
is  delicately  balanced  with  the  system  that  forms 
blood  clots.  A  precise  balance  between  these  two 
systems  is  vital.  Overactivity  of  either  system 
could  result  in  uncontrolled  bleeding  or  uncon- 
trolled blood  clotting.  Plasminogen  activator  is 
the  protein  that  turns  on  the  fibrinolytic  system. 
Its  activity  is  controlled  by  the  regulator  protein 
— plasminogen  activator  inhibitor- 1 . 

Synthetic  plasminogen  activators  are  now  used 
in  patients  to  dissolve  blood  clots,  particularly  in 
the  early  stages  of  a  heart  attack  when  a  major 
blood  vessel  to  the  heart  has  suddenly  become 
blocked.  There  is  also  increasing  evidence  that 
patients  with  abnormally  high  blood  levels  of 
plasminogen  activator  inhibitor- 1  (blocking  the 
normal  blood  clot-dissolving  activity  of  natural 
plasminogen  activator)  are  at  particularly  high 
risk  for  heart  attacks  and  other  diseases  due  to 
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increased  blood  clot  formation.  Thus  an  under- 
standing of  the  structure  and  function  of  plasmin- 
ogen activator  inhibitor- 1  and  of  its  interaction 
with  plasminogen  activators  is  of  great  signifi- 
cance in  the  study  of  the  fibrinolytic  system  and 
its  role  in  many  important  human  diseases. 

For  these  reasons  we  have  cloned  the  gene  for 
human  plasminogen  activator  inhibitor- 1  and 
have  characterized  its  structure.  We  have  suc- 
cessfully produced  recombinant  human  plasmin- 
ogen activator  inhibitor- 1  in  bacterial  cells  and 
are  using  this  system  to  explore  its  interactions 
with  other  components  of  the  clotting  system  and 
to  determine  which  portions  are  most  important 
for  its  functions.  This  work  should  lead  to  an  im- 
proved understanding  of  the  role  of  the  fibrino- 
lytic system  in  human  diseases  and  to  the  develop- 
ment of  new  treatment  approaches  for  these 
important  disorders. 

Bone  Marrow  Transplantation 

Bone  marrow  transplantation  is  being  used 


with  increasing  frequency  to  treat  a  variety  of  dis- 
eases, including  several  types  of  leukemia  and  a 
number  of  other  cancers.  In  this  procedure,  doses 
of  radiation  and  chemotherapy  are  given  that  are 
designed  to  destroy  the  patient's  bone  marrow 
and  would  ordinarily  be  fatal.  The  patient  is  then 
"rescued"  by  the  transplantation  of  normal  bone 
marrow  from  a  healthy  donor. 

We  have  used  DNA  probes  to  study  the  patient's 
blood  cells  after  the  transplant,  to  determine 
which  cells  have  come  from  the  donor  bone 
marrow  and  which  have  come  from  any  remain- 
ing patient  bone  marrow  that  survived  the  proce- 
dure. With  recent  advances  we  have  been  able  to 
extend  this  approach  to  study  the  origin  of  even  a 
single  blood  cell.  In  addition  to  increasing  our 
understanding  of  the  biology  of  bone  marrow 
transplantation,  this  information  may  prove  use- 
ful in  directing  treatment  after  the  transplant  and 
may  help  predict  which  patients  are  at  risk  for 
specific  complications. 
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THE  genetic  basis  of  hemophilia  has  been  un- 
derstood since  biblical  times,  when  mothers 
of  boys  who  had  bled  to  death  at  circumcision 
were  advised  not  to  have  future  sons  circum- 
cised. The  most  common  form  of  this  disease,  he- 
mophilia A,  affects  1  male  in  5,000  throughout 
the  world.  It  results  from  mutations  in  the  gene 
coding  for  a  blood  coagulation  protein  called  fac- 
tor VIII.  The  factor  VIII  gene  is  located  on  the  X 
chromosome,  one  of  the  two  sex  chromosomes. 
Consequently,  hemophilia  A  is  inherited  as  a 
"sex-linked"  trait.  Males,  having  a  single  X  chro- 
mosome, are  afflicted  with  hemophilia  if  their 
factor  VIII  gene  is  mutated,  but  females  are  pro- 
tected from  manifesting  the  disease  by  a  second, 
normal  X  chromosome.  Thus  females  may  be  si- 
lent carriers  of  the  disease  but  transmit  it  geneti- 
cally to  their  sons. 

Our  laboratory  is  interested  in  understanding 
what  types  of  mutations  lead  to  hemophilia  A  and 
how  these  mutations  are  generated.  The  disease  is 
well  suited  to  studies  on  mutagenesis  because  it 
is  clinically  heterogeneous,  implicating  a  wide 
variety  of  mutations  in  the  factor  VIII  gene. 
Correlating  the  types  of  mutations  with  particular 
clinical  findings  may  be  very  helpful  in  under- 
standing the  role  of  factor  VIII  in  coagulation.  In 
addition,  many  cases  of  hemophilia  occur  sporad- 
ically in  families  without  any  history  of  the  dis- 
ease. This  observation  suggests  that  many  muta- 
tions arise  anew  in  affected  individuals  or  their 
mothers.  Knowledge  of  mutations  allows  accu- 
rate prenatal  diagnosis  and  detection  of  female 
carriers  by  analysis  of  the  mutation  itself. 

Because  the  factor  VIII  gene  is  unusually  large, 
one  of  the  major  obstacles  in  this  project  lies  in 
finding  the  mutations.  We  have  experimented 
with  several  different  mutation  screens  and  have 
had  considerable  success  with  a  technique  called 
denaturing  gradient  gel  electrophoresis.  This 
technique  is  based  on  the  principal  that  DNA 
fragments  differing  by  a  single  base  pair  (e.g., 
normal  vs.  mutated)  also  differ  in  their  thermal 
stability.  In  practice,  the  thermal  stability  of  DNA 
fragments  from  normal  and  hemophilic  individ- 
uals is  assayed  on  gels  containing  a  chemical  de- 


naturant,  which  has  the  same  destabilizing  effect 
as  temperature. 

We  have  focused  on  finding  mutations  in  DNA 
samples  from  patients  with  mild  and  moderately 
severe  hemophilia.  Our  experience  suggested 
that  these  patients  would  be  likely  to  have  amino 
acid  coding  changes.  We  developed  a  strategy  of 
screening  for  mutations  in  factor  VIII  coding  re- 
gions by  denaturing  gradient  gel  electrophoresis. 
Mutations  were  found  in  21  of  the  35  patients 
screened  to  date.  These  mutations  are  scattered 
throughout  the  factor  VIII  gene  and  appear  to  be 
quite  diverse. 

Having  identified  the  mutations  responsible 
for  hemophilia  in  some  patients  whose  disease  is 
sporadic,  we  have  had  the  opportunity  to  investi- 
gate the  origin  of  the  mutations.  In  the  past,  mu- 
tations were  assumed  to  occur  as  isolated  events 
in  either  eggs  or  sperm.  However,  we  have  evi- 
dence that  some  mutations  occur  much  earlier,  at 
some  point  during  parental  development,  ren- 
dering the  parent  "mosaic"  for  mutant  and  nor- 
mal cells  in  different  parts  of  the  body.  Conse- 
quently, many  cells  in  the  parent's  germline  may 
carry  the  mutation,  but  his  or  her  blood  cells,  in 
which  the  presence  of  the  mutation  is  tested,  may 
not.  Mosaicism  has  also  been  found  associated 
with  other  X-linked  and  dominantly  inherited 
disorders.  These  findings  demonstrate  that  the 
unaffected  parent  of  a  child  with  sporadic  disease 
may  be  at  substantial  risk  of  having  a  second  child 
with  the  disease. 

In  a  second  line  of  research,  our  laboratory  is 
exploring  DNA  sequences  near  the  factor  VIII 
gene.  These  are  of  great  interest  because  many 
genetic  diseases  map  near  hemophilia  A.  For  ex- 
ample, two  of  the  most  common  X-linked  dis- 
eases, red  and  green  color  blindness  and  glucose- 
6-phosphate  dehydrogenase  (G6PD)  deficiency, 
are  tightly  linked  to  the  hemophilia  locus.  By 
physical  mapping  methods,  we  found  that  the 
genes  for  factor  VIII,  visual  pigment,  and  G6PD 
are  located  within  a  million  base  pairs  of  each 
other.  In  collaboration  with  Barbara  Trask  at 
Lawrence  Livermore  National  Laboratory,  we 
were  able  to  establish  the  relative  "orientation" 
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of  the  three  genes,  or  the  directions  in  which 
their  messenger  RNAs  are  transcribed. 

A  number  of  much  rarer  genetic  disorders  are 
also  tightly  linked  to  the  factor  VIII  gene.  These 
rare  diseases  are  clinically  diverse  and  of 
unknown  biochemical  basis.  They  include 
Emery-Dreifuss  muscular  dystrophy,  nephro- 
genic diabetes  insipidus,  adrenoleukodystrophy, 
incontinentia  pigmenti,  and  a  manic-depressive 
disorder.  By  isolating  genes  near  the  factor  VIII 
gene,  we  may  be  able  to  uncover  the  genetic  basis 
for  one  of  these  disorders. 

The  physical  map  has  formed  the  framework 
for  identifying  candidate  disease  genes  in  the  fac- 
tor VIII  neighborhood.  These  are  found  by  virtue 
of  their  association  with  "CpG  islands,"  regions 
of  human  DNA  that  are  rich  in  the  bases  cytosine 
and  guanine.  Since  islands  are  often  found  near 
the  origin  of  messenger  RNA  transcription,  it  is 
sometimes  possible  to  identify  and  isolate  a  tran- 
scribed gene  by  isolating  a  CpG  island.  We  have 
cloned  five  islands  near  the  human  factor  VIII 
gene.  Messenger  RNAs  have  been  found  for  three 
of  the  islands,  and  we  have  isolated  cDNAs  corre- 


sponding to  two.  These  are  being  sequenced. 
Eventually  the  corresponding  genes  from  patients 
will  be  examined  for  DNA  sequence  aberrations 
to  determine  whether  the  genes  are  responsible 
for  disease. 

A  surprising  result  of  the  CpG  island  investiga- 
tion was  the  discovery  of  an  island  inside  a  non- 
coding  portion  of  the  factor  VIII  gene.  We  found 
associated  with  the  island  two  messenger  RNAs, 
transcribed  in  opposite  directions.  The  function 
of  these  "nested"  genes  is  unknown,  but  their 
relationship  to  factor  VIII  is  interesting  to  con- 
sider. Gene  A  is  contained  entirely  within  the 
noncoding  region  and  is  transcribed  in  the  direc- 
tion opposite  to  the  factor  VIII  gene.  Two  other 
transcribed  copies  of  gene  A  are  located  near  the 
factor  VIII  gene.  Their  existence  may  explain 
why  hemophiliacs  deleted  for  the  factor  VIII 
gene  have  no  disorders  other  than  hemophilia. 
Gene  B  includes  some  of  the  factor  VIII  coding 
regions,  suggesting  that  it  may  participate  in  the 
coagulation  pathway.  Currently  we  are  trying  to 
determine  whether  either  of  these  genes  pro- 
duces a  protein. 
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SEVERAL  years  ago,  while  studying  the  replica- 
tion of  cells  in  culture,  we  discovered  that 
compactin,  a  drug  that  can  prevent  the  biosynthe- 
sis of  a  key  metabolic  intermediate  called  meva- 
lonic acid,  can  also  block  the  synthesis  of  DNA 
and  prevent  cells  from  attaching  normally  to  a 
culture  dish.  Furthermore,  we  showed  that  com- 
pactin's  effects  on  DNA  synthesis  and  cell  attach- 
ment depended  on  an  induced  deficiency  of 
mevalonic  acid.  Both  effects  could  be  prevented 
by  adding  mevalonic  acid  to  the  cell  culture  me- 
dium, though  known  products  of  mevalonic  acid 
metabolism,  including  cholesterol,  were  inac- 
tive. Thus  it  appeared  that  either  mevalonic  acid 
itself  or  an  unknown  product  was  involved. 

In  an  attempt  to  identify  the  compound  that 
was  required  for  DNA  synthesis  and  cell  attach- 
ment, we  treated  cells  with  compactin  in  the  pres- 
ence of  mevalonic  acid  with  a  radioactive  tracer. 
Just  enough  exogenous  mevalonic  acid  was  used 
to  prevent  the  drug's  effects.  Then  we  analyzed 
the  radioactive  products  of  mevalonic  acid  that 
the  cells  formed.  This  led  to  the  discovery  of  a 
new  class  of  modified  proteins  in  animal  cells. 

As  much  as  40  percent  of  the  cell-associated 
radioactivity  was  attached  to  a  special  group  of 
cell  proteins  that  could  be  distinguished  by  gel 
electrophoresis.  The  radioactivity  seemed  to  be 
present  in  protein-attached  products  of  meva- 
lonic acid,  called  isoprenoid  compounds.  The 
discovery  of  this  new  type  of  modified  proteins 
opened  up  a  field  of  protein  research  that  is  now 
expanding  rapidly. 

In  follow-up  studies,  we  sought  to  identify  the 
proteins  and  characterize  the  attached  isoprenoid 
groups.  We  were  able  to  show  that  one  of  the 
modified  proteins  is  lamin  B,  a  structural  protein 
that  is  attached  to  the  inner  nuclear  membrane. 
In  collaboration  with  Michael  Gelb  (University  of 
Washington),  we  demonstrated  that  lamin  B  is 
modified  by  a  cysteine  thioether-linked  farnesyl 
group  and  that  the  farnesylated  cysteine  residue 
is  located  at  the  protein's  carboxyl-terminal  end. 
(A  farnesyl  group  is  a  1 5 -carbon  isoprenoid  com- 


pound that  is  formed  from  three  molecules  of 
mevalonic  acid.) 

These  results  are  of  considerable  interest,  as 
other  investigators  have  identified  additional 
proteins,  including  the  oncogene  product  p21 
ras,  that  are  similarly  modified.  Moreover,  there 
is  reason  to  believe  that  the  modified  carboxyl- 
terminal  cysteine  residues  in  lamin  B  and  p21  ras 
promote  binding  of  the  respective  proteins  to 
cell  membranes. 

Some  animal  cell  proteins  are  modified  by  far- 
nesyl groups;  other  animal  cell  proteins  are  modi- 
fied by  another  isoprenoid  group,  the  20-carbon 
geranylgeranyl  moiety.  In  collaboration  with  Ber- 
nard Fung  and  Stephen  Clarke  (University  of  Cali- 
fornia, Los  Angeles)  and  Masahito  Kawata  and 
Yoshimi  Takai  (Kobe  University,  Japan),  we  re- 
cently identified  several  geranylgeranylated  pro- 
teins, including  the  7-subunits  of  a  brain  hetero- 
trimeric  guanine  nucleotide-binding  protein 
and  three  low-molecular-weight  guanine  nucleo- 
tide-binding proteins — smg  p21B,  smg  p25A, 
and  G25K. 

These  results  and  those  of  other  investigators 
have  raised  a  number  of  important  questions.  At- 
tempts to  identify  and  characterize  the  enzymes 
that  catalyze  the  various  modification  reactions 
are  in  progress  in  several  laboratories.  Work  in 
our  laboratory,  done  in  collaboration  with  Fuyu- 
hiko  Tamanoi  (the  University  of  Chicago),  has 
provided  evidence  that  the  farnesyl  transferase  ac- 
tivity of  the  yeast  Saccharomyces  cerevisiae  de- 
pends on  two  different  genes,  DPR  1 /Rami  and 
Ram2,  whereas  the  geranylgeranyl  transferase  ac- 
tivity depends  on  a  third  gene,  CDC43-  The  pre- 
cise role  of  these  genes  remains  to  be 
determined. 

Another  important  question  relates  to  the  func- 
tional significance  of  the  modification  reactions. 
Many  of  the  modified  proteins  appear  to  be  at- 
tached to  membranes,  and  the  modifying  iso- 
prenoid groups  appear  to  promote  this  attach- 
ment. But  it  is  not  clear  how  the  modifying 
groups  interact  with  the  membranes  at  a  molecu- 
lar level.  Experiments  that  address  this  question 
are  under  way. 
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OUR  laboratory  is  interested  in  the  general 
problem  of  differentiation  and  morphogen- 
esis. We  are  trying  to  understand  at  a  molecular 
level  some  of  the  factors  that  determine  the  cell 
type  into  which  an  embryonic  cell  differentiates 
and  how  the  ratios  of  the  different  cell  types  are 
then  maintained  in  an  organism.  As  a  model  sys- 
tem, we  are  using  the  slime  mold  Dictyostelium 
discoideum. 

Dictyostelium  normally  exists  as  undifferen- 
tiated single  cells  called  amoebae  that  eat  bacte- 
ria in  soil  and  decaying  leaves  and  proliferate  by 
cell  division.  When  the  amoebae  eventually  over- 
grow their  food  supply  and  starve,  they  aggregate 
together  in  groups  of  about  100,000.  Roughly  80 
percent  of  these  cells  become  spores.  (A  spore  is 
a  cell  with  a  tough  outer  coat  that  forms  an 
"escape  capsule.")  The  remaining  20  percent  of 
the  cells  form  a  stalk  about  2  mm  high  that  holds 
the  spore  cells  off  the  ground.  A  spore,  dispersed 
by  the  wind,  will  crack  open  to  release  an 
amoeba,  which  may  luckily  find  itself  in  the 
midst  of  a  new  supply  of  bacteria.  The  advantage 
of  this  organism  is  its  simplicity:  cells  differen- 
tiate into  just  two  main  types. 

Determining  Cell  Fate 

In  the  presence  of  a  food  source,  Dictyoste- 
lium cells  grow  to  a  certain  size  and  then  separate 
their  chromosomes  to  opposite  sides  of  the  cell 
and  divide  in  two.  The  cycle  of  grov^h  and  divi- 
sion then  repeats.  In  a  field  of  cells,  there  will  be 
cells  at  all  different  phases  of  this  cycle. 

Dictyostelium  uses  a  simple  and  elegant  mech- 
anism based  on  this  cycle  to  determine  whether  a 
cell  will  become  stalk  or  spore.  When  the  cells 
starve,  those  cells  that  have  recently  separated 
their  chromosomes  and  divided  will  become 
stalk  cells,  and  the  remaining  cells  become 
spores.  As  long  as  the  cells  are  randomly  distrib- 
uted with  respect  to  the  phase  of  their  cell  cycles, 
there  will  always  be  the  proper  percentage  of 
cells  in  the  "stalk"  quadrant.  We  refer  to  this  as  a 
musical  chairs  mechanism,  since  the  decision  of 
any  given  cell  to  become  either  stalk  or  spore  is 
made  by  the  phase  of  the  cycle  that  the  cell  hap- 


pens to  be  in  at  the  time  of  the  differentiation 
signal  (starvation). 

Cell-type  choice  determination  mechanisms  of 
this  sort  may  operate  in  humans,  and  aberrations 
could  thus  lead  to  birth  defects.  We  are  presently 
mutating  Dictyostelium  cells  and  isolating  mu- 
tants with  altered  cell-type  ratios.  We  will  then 
use  DNA  transformation  to  identify  the  genes  that 
were  mutated.  We  are  also  using  drugs  that 
disrupt  the  cell  cycle  to  examine  the  extent  of  the 
linkage  between  the  cell  cycle  and  cell-type 
choice. 


Sensing  Cell  Density 

During  Dictyostelium  development,  a  cell 
waits  until  it  is  near  a  large  mass  of  other  Dictyo- 
stelium cells  before  it  turns  on  the  stalk-  or 
spore-specific  genes.  Being  able  to  sense  whether 
other  cells  are  nearby  represents  a  paradigm 
for  possible  mechanisms  that  would  allow,  for 
instance,  liver  cells  to  sense  how  much  of  the 
body  is  composed  of  liver  cells.  At  present,  little 
is  known  about  the  molecular  mechanisms 
whereby  the  size  and  density  of  a  tissue  are  sensed 
by  its  individual  cells.  Such  mechanisms  would 
be  centrally  involved  in  the  regulation  of  growth 
during  development,  wound  healing,  and  regen- 
eration. In  addition,  an  important  and  relevant 
aspect  of  tumor  cells  is  that  they  have  lost  their 
ability  to  regulate  the  size  and/or  density  of  the 
tissue  and,  as  a  result,  proliferate.  One  way  this 
could  happen  would  be  if  the  tumor  cells  could 
no  longer  properly  sense  the  total  mass  of  the 
tissue. 

We  have  found  that  Dictyostelium  cells  sense 
whether  they  are  near  other  cells  by  secreting 
small  quantities  of  a  protein,  which  we  have 
named  density-sensing  factor  (DSF).  Cells  are 
sensitive  to  DSF:  above  a  threshold  concentration 
they  will  express  cell-type-specific  genes.  We 
have  done  theoretical  diffusion  calculations  and 
have  found,  in  agreement  with  our  observations, 
that  DSF  secreted  from  cells  that  are  quite  far 
from  other  cells  diffuses  away  so  quickly  that  it 
never  reaches  the  threshold  concentration.  How- 
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ever,  in  or  near  an  aggregate  of  around  1,000  or 
more  cells,  the  DSF  concentration  will  be  above 
the  threshold  value.  We  have  purified  DSF  and 
have  found  it  modified  by  attachment  of  sugars. 
We  are  investigating  whether  the  sugars  are  im- 
portant for  its  function. 

We  have  determined  the  amino  acid  sequence 
offour  small  fragments  of  DSF  and  have  used  this 


information  to  obtain  part  of  the  DSF  gene.  This 
has  been  used  in  turn  to  obtain  the  sequence  of 
amino  acids  for  part  of  the  DSF  protein.  Com- 
puter comparison  with  data  banks  of  other  pro- 
tein sequences  indicates  that  DSF  is  not  related  to 
any  known  protein.  We  are  now  starting  experi- 
ments to  see  if  mammals  have  genes  similar  to  the 
Dictyostelium  DSF  gene. 
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WE  are  interested  in  understanding  the  mo- 
lecular mechanisms  by  which  neuronal 
growth  cones  find  their  way  toward,  and  ulti- 
mately recognize,  their  correct  targets  during  de- 
velopment. Growth  cones  navigate  over  long  dis- 
tances and  often  through  a  series  of  complex 
choice  points,  appearing  to  follow  signals  on  the 
surfaces  of  cells  and  in  the  extracellular  environ- 
ment. We  would  like  to  uncover  the  molecules 
that  impart  specificity  to  the  developing  nervous 
system  and  thus  allow  growth  cones  to  recognize 
their  correct  pathways  and  targets.  To  address 
these  issues,  we  use  molecular  genetic  ap- 
proaches in  the  fruit  fly  Drosophila. 

Our  ongoing  cellular  analysis  of  the  develop- 
ing central  nervous  system  in  the  insect  embryo 
has  given  rise  to  five  major  conclusions: 

First,  certain  glial  and  mesectodermal  cells  ap- 
pear to  interact  with  one  another,  several  under- 
going specific  cell  migrations,  and  in  so  doing  lay 
down  a  pattern  for  the  major  fiber  pathways  in  the 
developing  nervous  system. 

Second,  these  glial  and  mesectodermal  cells 
provide  instructive  information  for  differential 
guidance  of  the  initial,  "pioneering"  growth 
cones  as  they  choose  which  cells  to  extend  to- 
ward or  along.  For  example,  a  specific  subset  of 
cells  at  the  midline  appear  to  provide  an  attrac- 
tive cue  for  the  growth  cones  that  extend  toward 
the  midline  and  pioneer  the  commissural  axon 
pathways. 

Third,  once  the  pathways  are  established,  the 
predominant  guide  for  "follower"  growth  cones 
is  the  surface  of  the  earlier  axons  in  these  path- 
ways. Growth  cones  are  able  to  distinguish  one 
particular  axon  bundle,  or  fascicle,  out  of  an 
array  of  many.  The  experimental  analysis  of  these 
phenomena  led  to  the  labeled  pathways  hypothe- 
sis, which  holds  that  the  nerve  pathways  are  dif- 
ferentially labeled  by  recognition  molecules  that 
enable  growth  cones  to  navigate  through  com- 
plex choice  points. 

Fourth,  our  observations  suggest  that  the  ex- 
pression of  surface  recognition  molecules  is  dy- 
namic and  regional  on  the  surface  of  individual 


neurons — that  parts  of  the  cell  are  differentially 
labeled  in  accordance  with  the  processes  around 
it  for  which  it  has  a  selective  affinity.  For  exam- 
ple, a  neuron  might  express  different  molecules 
on  the  surface  of  its  commissural  process  as  com- 
pared with  its  longitudinal  process,  reflecting  its 
changing  behavior  as  it  navigates  across  the  mid- 
line in  a  commissural  pathway  and  then  turns  into 
a  different  longitudinal  pathway. 

Finally,  having  navigated  along  a  series  of  path- 
ways, growth  cones  are  capable  of  recognizing 
their  correct  target  cells.  In  the  Drosophila  em- 
bryo, the  specificity  of  neuronal  growth  cones  for 
target  cells  is  most  clearly  studied  in  the  ability  of 
motoneuron  growth  cones  to  recognize  specific 
muscle  fibers. 

Our  molecular  genetic  approach  to  these  is- 
sues has  been  fourfold.  First,  we  have  been  study- 
ing in  Drosophila  the  expression  and  function  of 
cell  and  substrate  adhesion  molecules  that  are 
likely  to  play  a  significant  role  in  these  events. 
For  example,  we  cloned  the  three  genes  that  en- 
code the  three  subunits  of  Drosophila  laminin,  a 
substrate  adhesion  molecule  previously  shown  to 
be  a  potent  promoter  of  neurite  outgrowth  of  de- 
veloping vertebrate  neurons.  We  have  isolated  a 
lethal  mutation  in  the  gene  encoding  the  A  sub- 
unit  of  laminin  (lama)  and  are  looking  for  muta- 
tions in  the  other  two  genes. 

We  have  also  cloned  two  Drosophila  genes — 
fat  and  dachsous — that  encode  cadherin-like 
molecules  (calcium-dependent  cell  adhesion 
molecules).  One  class  of  mutations  in  the  fat 
gene  cause  a  tumor-like  overgrowth  of  epidermal 
tissues.  Another  class  of mutations,  as  well  as  a 
class  of  dachsous  mutations,  alter  the  morpho- 
genesis of  epidermal  tissues.  Thus  these  two 
members  of  the  cadherin  gene  superfamily  are 
involved  in  the  control  of  morphogenesis;  at  least 
one  of  the  genes  also  functions  as  a  tumor  sup- 
pressor gene. 

Second,  beginning  with  an  immunological  ap- 
proach, we  identified  and  cloned  the  genes  en- 
coding four  different  surface  glycoproteins.  We 
call  these  proteins  fasciclin  I  (fas  I),  fasciclin  II 
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(fas  II),  fasciclin  III  (fas  III),  and  neuroglian. 
These  glycoproteins  are  dynamically  expressed 
on  overlapping  subsets  of  axon  fascicles  and  glia 
during  embryonic  development. 

Three  of  these  molecules,  fas  II,  fas  III,  and 
neuroglian,  are  part  of  the  immunoglobulin  su- 
perfamily  and  are  highly  related  to  a  family  of 
adhesion  molecules  found  on  the  surface  of  ver- 
tebrate neurons.  The  other  protein,  fas  I,  appears 
unrelated  to  any  currently  listed  in  the  data  bank. 
Our  molecular  genetic  analysis  has  shown  that, 
by  a  process  of  alternative  RNA  splicing,  all  four 
proteins  are  generated  in  a  variety  of  different 
forms  by  different  cells  during  development.  To 
begin  to  test  the  function  of  these  proteins,  we 
used  transformation  techniques  to  induce  their 
expression  on  the  surface  of  the  S2  Drosophila 
cell  line  in  culture.  With  standard  aggregation  as- 
says and  biochemical  methods,  we  have  shown 
that  these  four  proteins  are  homophilic  adhesion 
molecules  that  can  mediate  selective  cell  sorting 
as  well  as  cell  aggregation;  apparently  fas  I  de- 
fines a  new  class  of  adhesion  molecules. 

We  are  using  genetic  analysis  and  transforma- 
tion methods  to  study  the  function  of  these  mole- 
cules in  the  developing  organism.  We  generated 
and  identified  insertional  mutations  and  small 
deletions  in  the  fas  /,  fas  II,  fas  III,  and  neuro- 
glian genes.  Complete  loss-of-function  muta- 
tions in  both  fas  I  and  fas  III  are  viable  under 
laboratory  conditions;  mutations  in  fas  II  and 
neuroglian  lead  to  lethality.  None  of  the  four 
mutations  lead  to  gross  abnormalities  in  the  devel- 
oping nervous  system.  Rather,  the  mutations  ap- 
pear to  lead  to  more  subtle  defects  in  specific 
subsets  of  neurons  and  specific  axon  pathways. 


For  example,  the  fas  II  protein  is  normally  ex- 
pressed on  a  small  subset  of  growth  cones  and 
axons  that  pioneer  and  selectively  fasciculate  in 
the  MPl  axon  fascicle.  In  the  fas  //mutant,  these 
specific  growth  cones  do  not  properly  recognize 
one  another  and  the  MPl  pathway  does  not  form; 
in  contrast,  other  axon  pathways  appear  to  de- 
velop normally.  Thus  this  member  of  the  immu- 
noglobulin superfamily  functions  in  part  as  a  neu- 
ronal recognition  molecule  for  the  MPl  axon 
fascicle. 

Third,  we  have  been  identifying  and  character- 
izing genes  that  function  in  the  early  interactions 
of  the  glia  and  mesectoderm  in  establishing  the 
initial  patterns  of  axon  pathways  or  that  function 
in  the  production  or  reception  of  the  signals  that 
attract  the  commissural  growth  cones  toward  the 
midline.  For  example,  several  new  genes  have 
been  identified  that  appear  to  function  in  the 
guidance  of  growth  cones  toward  the  midline; 
mutations  in  these  genes  specifically  disrupt  the 
ability  of  commissural  growth  cones  to  extend 
toward  and  across  the  midline. 

Fourth,  we  are  using  a  variety  of  genetic  and 
molecular  genetic  methods  to  screen  for  new 
genes  that  are  involved  in  target  recognition.  As  a 
model  system  for  the  study  of  target  recognition 
in  Drosophila,  we  focus  on  the  ability  of  moto- 
neuron growth  cones  to  find  and  recognize  their 
correct  muscles  during  embryonic  development. 
We  have  identified  several  genes  that  are  ex- 
pressed by  different  subsets  of  undifferentiated 
muscle  fibers  prior  to  innervation.  Two  of  these 
genes  encode  membrane  proteins  that  are 
members  of  the  same  gene  family  of  cell  adhesion 
and  signaling  molecules.  We  are  presently  char- 
acterizing mutations  in  these  two  genes. 


Opposite:  Visualization  of  axons  (brown  stain )  and  different  subsets  of  glia  (blue  stain,  reveal- 
ing only  nuclei)  in  or  just  outside  the  developing  central  nervous  system  in  the  embryo  of  the  fruit 
fly  Drosophila  melanogaster.  The  four  panels  show  four  different  subsets  of  embryonic  glia  ( as 
revealed  by  the  "enhancer  trap"  method)  in  relation  to  the  pattern  of  axon  pathways  that  develop 
around  them.  Panel  A  shows  the  midline  glia;  panel  B,  the  longitudinal  glia  and  several  other 
classes  of  nerve  root  and  exit  glia;  and  panel  C,  a  small  subset  of  these  glia — all  in  relation  to  the 
pattern  of  axon  pathways  in  four  or  five  segments  of  the  central  nervous  system.  Panel  D  shows 
the  four  exit  glia  just  outside  the  central  nervous  system  in  three  segments. 
From  Kldmbt,  C,  and  Goodman,  C.S.  1991.  GLIA  4:205-213. 
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Immunohistochemical  staining  of  B  lymphocytes  expressing  transgene- encoded  antigen  recep- 
tors (brown ),  located  within  the  follicular  and  marginal  zone  areas  in  the  spleen  of  an  immuno- 
globulin-gene  transgenic  mouse. 

Research  of  David  Y.  Mason,  Margaret  fones,  and  Christopher  C.  Goodnow. 
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EACH  individual  B  and  T  lymphocyte  in  the 
immune  system  expresses  antigen  receptors 
of  one  type  on  its  surface,  which  confer  on  the 
cell  an  ability  to  recognize  one  of  the  millions  of 
different  antigens;  and  there  are  millions  of  dif- 
ferent lymphocytes  in  the  immune  system.  Given 
the  system's  annihilative  powers,  it  is  remarkable 
that  tissue  components  of  our  own  bodies  are 
spared  during  immunological  attacks  on  invading 
foreign  organisms.  Normally  the  immune  system 
can  recognize  one's  tissue  components  as  "self" 
and  tolerate  them.  Self-tolerance  is  lost,  however, 
in  a  variety  of  "autoimmune"  diseases,  such  as 
systemic  lupus  erythematosus,  type  I  diabetes 
mellitus,  and  rheumatoid  arthritis,  resulting  in 
inexorable  destruction  of  particular  organs  and 
tissues.  The  mechanisms  that  maintain  self-toler- 
ance in  healthy  individuals,  and  the  factors  that 
lead  to  its  breakdown  in  autoimmune  disease,  are 
the  main  focus  of  our  laboratory. 

It  has  been  theorized  for  many  decades  that 
self-tolerance  might  somehow  result  from  the  si- 
lencing or  elimination  of  lymphocytes  bearing 
antigen  receptors  that  happen  to  recognize  self 
antigens.  To  determine  whether  this  idea  was 
correct,  however,  has  been  difficult,  since  it  is 
almost  impossible  to  track  the  life  of  any  one  cell 
among  millions. 

Advances  in  biotechnology,  in  particular  the 
advent  of  transgenic  mice,  have  opened  doors  to 
the  development  of  ways  to  follow  the  life  of  par- 
ticular immune  cells  in  vivo.  Transgenic  mice 
are  genetically  altered  at  the  outset  of  embryonic 
development  by  microinjecting  carefully  de- 
signed gene  cassettes  (transgenes)  into  fertilized 
oocytes.  With  colleagues  in  the  laboratory  of  An- 
tony Basten  at  the  University  of  Sydney,  we  pro- 
duced transgenic  mice  in  which  most  of  the  B 
lymphocytes  expressed  identical,  rather  than 
widely  differing,  antigen  receptors. 

This  was  done  by  introducing  transgenes  that 
coded  for  a  single  antibody  molecule  (since  anti- 
body molecules  serve  as  antigen  receptors  on  B 
lymphocytes) .  The  particular  molecule  was  one 
that  recognized  and  bound  a  foreign  protein,  hen 
egg-white  lysozyme,  and  because  the  transgene 


was  expressed  in  essentially  all  the  B  lympho- 
cytes, they  all  now  recognized  lysozyme  in  an 
identical  fashion. 

The  extraordinary  abundance  of  lysozyme- 
binding  B  cells  in  the  transgenic  mice  has  made  it 
possible  to  track  the  development  and  fate  of 
these  cells  in  the  body.  To  determine  the  fate  of  B 
cells  that  might  happen  to  recognize  a  self  anti- 
gen rather  than  a  foreign  one,  we  prepared  addi- 
tional transgenic  mice  in  which  transgene  cas- 
settes led  to  the  synthesis  and  production  of 
lysozyme  by  the  mouse  itself.  When  the  two  types 
of  mice  are  mated,  a  fraction  of  their  offspring 
inherit  both  types  of  transgene,  and  thus  contain 
large  numbers  of  lysozyme-binding  B  cells  that 
encounter  lysozyme  expressed  as  if  it  were  a  nor- 
mal self  constituent.  In  this  particular  case, 
where  lysozyme  is  free  in  solution  in  the  blood- 
stream, the  B  cells  are  indeed  profoundly  si- 
lenced, much  as  was  predicted  decades  before. 

Precise  alterations  in  the  design  of  the  intro- 
duced cassettes  are  allowing  us  to  build  upon  this 
model  and  explore  the  details  of  the  normal  pro- 
cess of  self-tolerance.  In  one  striking  example, 
where  lysozyme  has  been  restructured  so  that  it 
remains  bound  to  cell  membranes  of  transgenic 
animals  rather  than  free  in  the  blood,  we  find  that 
lysozyme-binding  B  cells  are  not  only  function- 
ally silenced  but  are  in  fact  completely  elimi- 
nated from  the  sites  in  lymphoid  organs  where 
immune  responses  are  normally  mounted.  This 
finding  points  to  important  differences  in  the  way 
the  immune  system  tolerates  different  classes  of 
self  antigens  in  the  body.  We  are  planning  to  ex- 
tend this  approach  to  other  important  classes  of 
self  antigen  by  constructing  additional  strains  of 
transgenic  mice. 

Another  approach  to  understanding  the  process 
of  self-tolerance  in  B  lymphocytes  makes  use  of 
the  different  forms  of  antibody,  such  as  the  immu- 
noglobulins IgM,  IgD,  and  IgG,  which  are  differ- 
entially used  as  antigen  receptors  by  B  lympho- 
cytes at  distinct  stages  in  their  life  span.  By 
redesigning  the  antibody  transgenes,  we  are 
currently  comparing  the  behavior  of  B  cells  ex- 
pressing these  different  receptor  isotypes  either 
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singly  or  in  combination.  These  studies  have  be- 
gun to  reveal  a  complex  pattern  of  crosstalk  be- 
tween IgM  and  IgD  receptors,  which  may  help  in 
dissecting  the  immediate  events  within  the  B  cell 
that  determine  whether  it  is  activated  or  rendered 
tolerant. 

Once  antigen  binds  to  receptors  on  the  B  cell 
surface,  the  choice  between  being  provoked  into 
attack  or  becoming  tolerant  is  ultimately  deter- 
mined by  biochemical  events  inside  each  recep- 
tor-bearing cell.  By  studying  the  responses  of  tol- 
erant and  nontolerant  B  lymphocytes  isolated 
from  transgenic  mice  and  maintained  in  tissue 


culture,  we  are  beginning  to  identify  some  of  the 
key  changes  involved  in  silencing  the  B  cells  that 
are  reactive  to  self.  These  include  both  modula- 
tion of  the  antigen  receptors  themselves  and 
blocking  of  the  cells'  ability  to  diflf^erentiate  into 
antibody  secretors.  Transgenic  mouse  technol- 
ogy, allowing  genetic  dissection  of  signaling  cir- 
cuits within  the  B  cells,  together  with  standard 
techniques  of  biochemical  analysis,  should  en- 
able us  first  to  gain  an  understanding  of  the  nor- 
mal controls  exerted  on  B  lymphocytes  and  even- 
tually to  illuminate  the  breakdown  of  these 
controls  in  autoimmunity. 
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THE  normal  T  cell  immune  response  defends 
the  host  from  many  infectious  organisms  (vi- 
ruses, parasites,  fungi,  and  bacteria)  and  elimi- 
nates cancer  cells  that  may  arise  in  the  body.  The 
catastrophic  consequences  of  a  defective  T  cell 
immune  response  are  well  illustrated  by  the  ac- 
quired immune  deficiency  syndrome  (AIDS),  a 
fatal  disease  caused  by  infection  of  the  helper 
subset  of  T  lymphocytes  with  human  immunode- 
ficiency virus  1  (HIV-1). 

Human  T  cell  leukemia  virus  I  (HTLV-I)  also 
infects  these  helper  T  cells  but,  instead  of  pro- 
ducing T  cell  death,  induces  neoplastic  transfor- 
mation leading  to  adult  T  cell  leukemia  (ATL). 
HTLV-I  has  also  been  recently  linked  with  a  pro- 
gressive demyelinating  syndrome  termed  tropical 
spastic  paraparesis,  which  resembles  multiple 
sclerosis.  This  laboratory  has  focused  its  research 
efforts  on  further  defining  the  biochemical  pro- 
cesses that  underlie  both  normal  T  cell  growth 
and  the  abnormal  patterns  of  proliferation  that 
follow  HIV-1  and  HTLV-I  retroviral  infection. 

The  cascade  of  biological  events  constituting 
the  normal  T  cell  immune  response  is  normally 
triggered  by  the  interaction  of  foreign  antigens 
with  antigen-specific  receptor  molecules  arrayed 
on  the  surface  of  the  resting  T  cell.  Antigen  bind- 
ing to  these  receptors  in  turn  leads  to  T  cell  acti- 
vation and  the  expression  of  various  growth- 
related  genes,  including  those  for  interleukin-2 
(IL-2,  T  cell  growth  factor)  and  its  high-affinity 
membrane  receptor.  The  subsequent  interaction 
of  IL-2  with  its  receptor  then  promotes  T  cell  pro- 
liferation, thereby  increasing  both  the  number 
and  activity  of  the  antigen-activated  T  cell  clones. 

Structure  and  Function  of  the 
Interleukin-2  Receptor 

The  functional  IL-2  receptor  corresponds  to  a 
membrane  complex  composed  of  two  known  li- 
gand-binding  proteins,  IL-2Ra  and  IL-2Rj8.  In  the 
absence  of  its  counterpart,  the  a-  and  (8-subunits 
alone  give  rise  only  to  low  or  intermediate  recep- 
tor forms,  respectively.  The  |8-subunit  plays  a 
dominant  role  in  growth  signal  transduction, 
while  the  inducible  expression  of  the  a-subunit 


gene  is  centrally  involved  in  the  tightly  regulated 
display  of  high-affinity  forms  of  the  IL-2  receptor. 

The  IL-2R/3  gene  is  constitutively  expressed  in 
many  resting  T  cells  and  natural  killer  cells.  How- 
ever, immune  activation  of  these  cells  promotes 
augmented  IL-2R(8  expression.  Recent  studies 
have  revealed  the  noncovalent  association  of  two 
different  kinds  of  protein  kinases  with  the  IL-2R|8 
subunit.  Specifically,  both  a  tyrosine  kinase  and  a 
serine-threonine  kinase  interact  with  different 
segments  of  the  intracytoplasmic  domain  of  the 
IL-2R|8  receptor  molecule.  Studies  are  now  under 
way  to  define  the  role  played  by  kinases  in  growth 
signal  transduction. 

Transcriptional  Activation  of  HIV-1 

HFV- 1  may  establish  a  latent  or  persistent  form 
of  infection  within  host  CD4^  T  cells  and  mono- 
cytes. Activation  of  these  "latent"  HIV-1  provi- 
ruses  appears  critically  dependent  upon  select 
host  transcription  factors,  including  the  NF-/cB 
family  of  proteins.  These  inducible  factors  bind 
to  and  activate  the  duplicated  /cB  enhancer  ele- 
ment present  in  the  HFV-l  long  terminal  repeat 
(LTR).  Recent  studies  indicate  that  as  many  as 
four  different  /cB-specific  proteins  are  present  in 
activated  human  T  cells.  Intriguingly,  these  four 
proteins  are  expressed  in  the  nucleus  with  dis- 
tinctly biphasic  kinetics  (two  early  and  two 
late),  a  process  that  involves  both  differences 
in  the  mobilization  of  preformed  pools  of  these 
proteins  and  changes  in  the  levels  of  gene 
expression. 

All  four  of  these  (cB-specific  binding  proteins  are 
structurally  related  to  the  \-rel  oncogene  from  the 
avian  reticuloendotheliosis  virus,  REV-T.  Remark- 
ably, wild-type  but  not  nontransforming  mutants 
of  Y-rel  specifically  bind  to  the  /cB  enhancer  and 
inhibit  NF-kB  function.  These  findings  raise  the 
possibility  that  v-re/-mediated  cellular  transforma- 
tion may  be  inextricably  linked  to  its  suppressive 
interaction  with  the  kB  enhancer. 

Trans-Regulation  of  HTLV-I  Gene  Expression 

Our  recent  studies  of  HTLV-I  have  focused  on 
the  critical  function  of  the  Tax  and  Rex  trans- 
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regulatory  proteins  encoded  by  this  virus.  The 
40-kDa  Tax  polypeptide  is  a  potent  transcrip- 
tional activator  that  induces  both  the  HTLV-I  LTR 
and  several  cellular  genes,  including  those  for  IL- 
2Ra  and  IL-2.  The  induction  of  these  various  tran- 
scription units  does  not  involve  the  direct  bind- 
ing of  Tax  to  target  DNA  sites,  but  rather  is 
mediated  through  the  indirect  interplay  of  Tax 
with  different  sets  of  endogenous  host  transcrip- 
tion factors.  Mutational  studies  of  Tax  have  per- 
mitted the  identification  of  discrete  functional 
domains  within  this  transactivator  that  are  selec- 
tively responsible  for  the  interactions  with  these 
different  host  factors.  Furthermore,  the  amino- 
terminal  47  amino  acids  comprise  two  physically 
overlapping  but  functionally  distinct  domains 
that  correspond  to  a  novel  zinc-binding  motif  and 
nuclear  targeting  signal.  These  domains  both  ap- 
pear required  for  the  biological  function  of  Tax. 

The  HTLV-I  Rex  protein  serves  as  an  essential 
post-transcriptional  regulator  of  HTLV-I  struc- 
tural gene  expression,  inducing  gag,  pol,  and 
env.  The  structural  and  enzymatic  products  are 
uniquely  translated  from  unspliced  {gag,  pot)  or 
singly  spliced  {env)  viral  mRNAs,  and  are  re- 
quired for  the  assembly  of  fully  infectious  viri- 
ons. Recent  studies  have  shown  that  Rex  action  is 
mediated  through  a  Rex  response  element,  a 
complex  RNA  stem-loop  structure  located  within 
the  3'  retroviral  LTR.  Mutational  analysis  of  the 
rex  gene  has  revealed  the  presence  of  at  least  two 
functional  domains,  one  that  mediates  direct 


binding  to  its  RNA  response  element  and  nu- 
clear/nucleolar  localization  and  a  second  that  is 
required  for  Rex  activation.  Rex  appears  to  func- 
tion either  by  promoting  the  disassembly  of  spli- 
ceosomes  engaging  these  long  gag,  pol,  and  env 
mRNAs  or  by  activating  the  nuclear  export  of 
these  incompletely  spliced  viral  mRNAs. 

Unexpectedly,  the  HTLV-I  Rex  protein  is  capa- 
ble of  functionally  replacing  the  HIV-1  Rev  pro- 
tein, acting  through  the  distinct  HIV-1  Rev  RNA 
response  element.  However,  consistent  with 
their  lack  of  amino  acid  homology.  Rex  binds  at  a 
site  in  the  HLV-l  RNA  response  element  clearly 
distinct  from  that  utilized  by  Rev.  These  findings 
highlight  a  pattern  of  convergent  evolution  in 
these  two  human  retroviruses,  whereby  each  em- 
ploys a  strikingly  similar  post-transcriptional 
strategy  for  regulating  the  expression  of  its  struc- 
tural gene  products. 

Additionally,  the  HTLV-I  Rex  response  element 
plays  an  important  role  in  the  polyadenylation  of 
all  HTLV-I  transcripts,  independent  of  the  pres- 
ence of  Rex.  Specifically,  folding  of  the  RNA  Rex 
response  element  serves  to  juxtapose  upstream 
and  downstream  binding  sites  for  essential  nuclei 
polyadenylation  factors,  which  must  interact  in  a 
cooperative  fashion  to  commit  the  HTLV-I  site  to 
effective  3'  processing.  Alterations  of  the  intrinsic 
secondary  structure  of  the  interposed  Rex  re- 
sponse element  lead  to  the  complete  loss  of  3' 
end  formation  and  correspondingly  the  abroga- 
tion of  viral  replication. 
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THE  process  of  terminal  differentiation  turns  a 
multipotential  cell  into  a  cell  that  carries  out 
a  particular  function  or  synthesizes  a  specific 
product.  The  lymphoid  B  lineage  ultimately 
generates  a  cell  that  secretes  antibody.  During 
cell  differentiation,  genes  that  encode  the  anti- 
body or  associated  proteins  are  expressed  in  a  de- 
fined cell  type-specific  and  temporally  ordered 
pattern. 

Transcription  of  the  jx  immunoglobulin  (Ig) 
gene  encoding  the  heavy  chain  of  the  antibody 
can  be  detected  in  virtually  all  lymphocytes.  By 
contrast,  the  k  Ig  light-chain  gene  is  transcribed 
only  in  late-stage  B  cells,  and  the  mb-1  gene  en- 
coding an  antibody-associated  protein  is  ex- 
pressed only  in  early-stage  B  cells.  The  goal  of  our 
research  is  to  gain  some  insight  into  the  molecu- 
lar mechanisms  that  mediate  the  developmental 
control  of  lymphoid-specific  gene  expression. 

Regulation  of  Ig  Gene  Expression  in  a 
Transgenic  Model 

Our  general  experimental  approach  is  to  intro- 
duce intact  rearranged  Ig  genes  into  cultured  lym- 
phoid and  nonlymphoid  cells  and  into  the  germ- 
line  of  mice.  Comparison  of  transcription  of  the 
exogenous  wild-type  and  mutant  genes  in  differ- 
ent cell  types  and  at  various  stages  of  develop- 
ment allow^s  the  identification  of  DNA  sequences 
that  are  instrumental  for  tissue-specific  and  tem- 
poral gene  regulation. 

Transfer  of  a  rearranged  wild-type  ^  gene  into 
the  mouse  germline  indicated  that  the  tissue- 
specific  expression  pattern  is  more  complex  than 
anticipated  from  previous  observations.  Instead 
of  a  simple  "on-state"  in  lymphoid  tissues  and 
"off-state"  in  all  nonlymphoid  tissues,  expres- 
sion of  both  the  endogenous  n  locus  and  the  ^l 
transgene  was  also  detected  in  skeletal  muscle. 
Moreover,  the  ^l  transgene  was  reproducibly  ex- 
pressed at  a  low  level  in  several  other  nonlym- 
phoid tissues.  Analysis  of  the  expression  pattern 
of  mutant  )u  transgenes  containing  intragenic  de- 
letions or  point  mutations  in  nuclear  factor-bind- 
ing sites  indicated  a  control  by  multiple  positive 
and  negative  regulatory  mechanisms. 


Consistent  with  previous  transfection  data,  /it 
transgene  expression  in  lymphoid  tissues  was  de- 
pendent upon  the  intragenic  enhancer  and  the 
binding  site  for  the  Oct-transcription  factors  in 
the  promoter.  Neither  of  these  regulatory  se- 
quences, however,  was  required  for  expression 
in  skeletal  muscle,  which  is  governed  by  a  mus- 
cle-specific control  region  located  3'  of  the 
enhancer. 

Low-level  expression  in  other  nonlymphoid 
tissues  was  independent  of  the  Oct-binding  site 
but  requires  function  of  the  intragenic  enhancer. 
Mutation  of  Es  and  E5  factor-binding  sites  in  the 
enhancer  increased  this  low-level  expression  to  a 
high  level  similar  to  that  found  in  lymphoid  tis- 
sues. The  off-state  of  the  transgene  in  liver  and 
fibroblasts,  however,  was  not  affected  by  this  mu- 
tation, suggesting  negative  regulation  by  at  least 
two  mechanisms.  Taken  together,  a  plethora  of 
regulatory  information  appears  to  be  required  for 
directing  the  complex  tissue-specific  expression 
pattern  of  the  ^l  gene  in  the  whole  animal. 

To  determine  the  molecular  basis  for  the  se- 
quential expression  of  Ig  heavy-  and  light-chain 
genes  during  B  cell  differentiation,  we  generated 
chimeric  Ig  ;u  gene  constructs,  in  which  individ- 
ual regulatory  sequences  have  been  replaced  by 
the  corresponding  regulatory  sequences  from  the 
K  light-chain  gene.  Gene  transfer  of  these  chi- 
meric transgenes  into  the  mouse  germline  and 
analysis  of  their  temporal  expression  pattern  in- 
dicated that  replacement  of  the  ^  enhancer  with 
the  K  enhancer  decreased  the  level  of  expression 
specifically  in  early-stage  B  cells  and  delayed,  at 
least  in  part,  the  appearance  of  transgene 
transcripts. 

Likewise,  replacement  of  the  ^l  promoter  with 
the  K  promoter  decreased  expression  specifically 
in  early-stage  B  cells.  Neither  the  k  enhancer  nor 
the  K  promoter,  however,  was  sufficient  for  con- 
ferring upon  the  jix  gene  the  precise  temporal  ex- 
pression pattern  of  the  endogenous  k  gene  or  an 
intact  K  transgene,  suggesting  that  the  combina- 
tion of  the  K  enhancer  and  k  promoter  may  be 
required  for  the  proper  temporal  regulation  of  Ig 
genes. 
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Lineage-  and  Stage-Specific  Expression  of 
the  mb-1  Gene 

To  gain  insight  into  the  molecular  basis  for  the 
coordinate  lineage-specificity  and  distinct  stage- 
specificities  of  genes  that  are  expressed  only  in 
lymphocytes,  we  extended  our  studies  to  the  reg- 
ulation of  the  mb-1  gene.  This  gene  is  expressed 
specifically  in  early-stage  B  cells.  Isolation  and 
characterization  of  the  mb  l  promoter  indicated 
that  a  relatively  short  DNA  fragment  contains  the 
information  for  both  lineage-  and  stage-specific- 
ity. The  mb-1  promoter  was  found  to  contain  a 
binding  site  for  a  novel  B  cell-specific  factor  that 
is  present  only  in  nuclear  extracts  from  early- 
stage  B  cells,  thus  paralleling  the  expression  pat- 
tern of  the  mb  l  gene. 

Isolation  and  Characterization  of 
Lymphoid-Specific  cDNA  Clones 

To  identify  a  set  of  new  markers  for  individual 
stages  of  the  lymphocyte  lineage  and  to  find  gene 
products  that  are  important  for  the  lymphoid  cell 
differentiation  pathway,  our  laboratory  isolated 
novel  lymphocyte-specific  cDNA  clones  by  dif- 
ferential screening  of  a  pre-B  cell  cDNA  library 
with  cDNA  from  an  erythroid  cell  line.  To  iden- 
tify candidate  transcriptional  regulators  among 
the  isolated  lymphoid-specific  cDNAs,  we  as- 


sayed proteins  expressed  from  the  cDNAs  in  vitro 
for  binding  to  DNA  cellulose. 

One  of  the  cDNA  clones,  which  was  expressed 
specifically  in  pre-B  and  T  cell  lines,  was  found 
to  encode  a  44-kDa  protein  that  binds  to  DNA 
cellulose.  Analysis  of  the  nucleotide  sequence  of 
this  cDNA  revealed  a  homology  to  the  nonhistone 
high-mobility  group  1  (HMGl)  gene  and  to  ge- 
netically defined  regulators  of  cell  specifica- 
tions that  include  fungal  mating-type  genes  and 
the  putative  mammalian  testis-determining  factor 
gene  SRY. 

This  lymphoid-specific  cDNA  clone  represents 
a  gene,  termed  LEF- 1  {lymphoid  enhancer-bind- 
ing factor  1) ,  that  is  developmentally  regulated 
and  expressed  in  T  and  pre-B  lymphocytes  but  not 
in  later  stage  B  cells  or  nonlymphoid  tissues. 
LEF-1  encodes  a  nuclear  protein  with  a  single 
HMG  box  that  was  shown  to  bind  to  a  functionally 
important  site  in  the  T  cell  antigen  receptor 
(TCR)  a  enhancer.  Maximal  TCRa  enhancer  activ- 
ity was  found  to  parallel  the  cell  type-specific 
expression  pattern  of  LEF-1.  Moreover,  forced 
expression  of  recombinant  LEF- 1  in  late-stage  B 
cells  increases  TCRa  enhancer  function.  Taken 
together,  these  data  suggest  that  LEF-  7  is  a  regula- 
tory participant  in  lymphocyte  gene  expression 
and  differentiation. 
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Cyclic  AMP- responsive  enhancer- binding  protein  (CREB)  expressed  in  germ  cells  of  the  testis. 
Shown  is  a  photomicrograph  of  a  cross-section  of  two  seminiferous  tubules  at  different  develop- 
mental stages,  VII  (left)  and  V  ( right).  The  intense  orange-brown  structures  are  the  nuclei  of  the 
round  haploid  spermatids  stained  with  an  antiserum  to  CREB.  The  relative  acellular  region  is  the 
interstitial  space  between  the  two  tubules  lined  with  the  blue  nuclei  of  the  spermatogonia  and 
spermatocytes,  the  cellular  precursors  of  the  spermatids.  The  round  spermatids  will  differentiate 
into  maturing  spermatozoa,  which  are  the  linear  blue  structures  adjacent  to  the  spermatids  (left ) 
or  juxtaposed  between  columns  of  spermatids  ( right).  CREB  is  believed  to  activate  the  transcrip- 
tion of  genes  involved  in  spermatogenesis,  under  the  regulation  of  cAMP-dependent  protein  ki- 
nase. Magnification,  X  1,500. 
Data  of  Gerard  Waeber,  Heather  Hermann,  and  foel  F.  Habener. 
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OUR  laboratory  seeks  an  understanding  of  the 
molecular  processes  involved  in  the  regula- 
tion of  gene  expression.  The  general  hypothesis 
being  tested  is  that  peptide  hormones  are  impor- 
tant regulatory  molecules  in  conveying  informa- 
tion among  cells  via  ligand-receptor  interactions 
and  corresponding  signal  transduction,  resulting 
in  the  expression  of  specific  genes.  These  pro- 
cesses are  important  in  determining  cellular  meta- 
bolic responses  such  as  secretory  activity,  cellu- 
lar differentiation,  and  growth. 

Peptides  in  Cellular  Metabolism 

Peptides  activate  metabolic  responses  in  cells 
by  way  of  interactions  with  specific  receptors  on 
distant  (endocrine) ,  adjacent  (paracrine) ,  or  the 
same  (autocrine)  cells.  These  ligand-receptor  in- 
teractions lead  to  the  activation  of  signal  trans- 
duction pathways  involving  postulated  cascades 
of  protein  phosphorylation  enzymatically  cata- 
lyzed by  protein  kinases,  eventuating  in  the  as- 
sembly of  active  transcriptional  complexes. 
Under  intensive  investigation  are  two  such  signal- 
ing pathways  mediated  by  cAMP-dependent  pro- 
tein kinase  A  and  by  phospholipid/diacylglycerol- 
stimulated  protein  kinase  C.  A  major  focus  of  our 
laboratory  is  to  understand  how  specific  phos- 
phorylated  DNA-binding  proteins  interact  with 
cognate  DNA  sequences  and  thereby  induce  gene 
expression. 

Genes  Encoding  Polypeptide  Hormones 

We  have  determined  the  structures,  organiza- 
tion, and  regulation  of  the  expression  of  genes 
encoding  several  of  the  polypeptide  hormones. 
Our  work  has  centered  on  the  genes  encoding  the 
glucagon  and  glucagon-related  peptides,  somato- 
statin, insulin,  angiotensinogen,  and  the  gonado- 
tropins. These  studies  have  progressed  through 
several  stages:  1)  cloning  of  the  genes  and  eluci- 
dation of  their  structures,  2)  determinations  of 
the  DNA  enhancer  and  suppressor  sequences  re- 
sponsible for  the  regulation  of  the  transcriptional 
expression  of  the  genes,  and  3)  isolation  and  char- 


acterization of  DNA-binding  proteins  involved  in 
the  regulation  of  expression.  These  studies  have 
led  to  the  identification  of  cell-specific  enhancer 
sequences  within  the  genes — sequences  that  de- 
termine in  which  cellular  phenotype  the  genes 
are  expressed  and  how  gene  transcription  re- 
sponds to  activator  substances  such  as  cAMP  and 
phorbol  esters.  They  have  also  led  to  the  identifi- 
cation of  complex,  cell-specific  post-transla- 
tional  processing  of  protein  precursors  (prohor- 
mones) encoding  the  peptide  hormones. 

Recently  we  determined  that  specific  nuclear 
proteins  bind  to  these  important  DNA  enhancer 
sequences  and  that  the  binding  specificities,  as 
well  as  the  transactivation  activities,  of  these  pro- 
teins are  regulated  by  their  phosphorylation. 

Regulation  of  Glucagon  and  Somatostatin 
Gene  Expression 

Our  analyses  of  the  regulation  of  the  expres- 
sion of  the  glucagon  in  pancreatic  islet  cell  lines 
reveal  that  islet  cell-specific  expression  resides 
in  at  least  two  enhancer-like  sequences  and  that  A 
cell  expression  in  the  islets  is  determined  by  an 
additional  enhancer/promoter  combination.  The 
expression  of  the  somatostatin  gene  is  restricted 
to  the  D  cells  because  a  suppressor  element  pre- 
vents expression  in  the  A  and  B  cells.  Transcrip- 
tional activation  of  the  glucagon  gene  is  mediated 
through  both  protein  kinase  C  and  protein  kinase 
A  pathways,  whereas  activation  of  the  somato- 
statin gene  is  regulated  by  protein  kinase  A  and  a 
calcium-regulated  pathway. 

Cloning  and  Structure  of  a  cAMP-Dependent 
DNA-binding  Protein 

In  studies  of  the  somatostatin  and  gonadotro- 
pin genes,  we  have  determined  that  their  expres- 
sion is  stimulated  via  a  cAMP-dependent  signal 
transduction  pathway.  We  have  discovered  that 
DNA-binding  proteins  interact  with  specific, 
short  DNA  sequences  to  generate  cAMP-respon- 
sive  complexes.  These  DNA-protein  complexes 
that  mediate  either  cAMP  or  phorbol  ester  control 
of  gene  transcription  share  certain  related  struc- 
tures. Cooperative  interactions  among  several 
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DNA-binding  proteins  and  with  adjacent  target 
DNA  sequences  appear  to  determine  cellular 
specificity  of  gene  expression  as  well  as  metaboli- 
cally  regulated  expression.  Recently  we  have 
cloned  several  members  of  a  family  of  cAMP- 
responsive  enhancer-binding  proteins  (CREBs) 
and  have  discovered  a  domain  on  one  of  the 
CREBs  that  is  phosphorylated  by  the  cAMP- 
dependent  protein  kinase  A,  as  well  as  by  addi- 
tional protein  kinases.  The  phosphorylation  of 
CREB  has  the  profound  effect  of  increasing  the 
transactivation  of  gene  transcription. 

Discovery  of  an  Insulinotropic  Peptide 

We  have  discovered  two  new  glucagon-like 
peptides  related  in  structure  to  pancreatic  gluca- 
gon and  co-encoded  for  glucagon  in  the  prohor- 
mone (proglucagon) .  These  peptides  are  differ- 
entially cleaved  from  the  proglucagon,  the  initial 
translation  product,  in  the  pancreas  and  the  intes- 
tines. One  of  the  peptides,  glucagon-like  pep- 
tide-I  (7-37),  produced  in  the  intestine  and  re- 
leased in  response  to  oral  nutrients  has  potent 


insulinotropic  activities.  Concentrations  as  low 
as  10""- 10"'^  M  stimulate  insulin  secretion  in 
the  perfused  rat  pancreas  and  stimulate  both 
cAMP  formation  and  proinsulin  mRNA  levels  in 
insulinoma  cell  lines.  We  have  determined  that 
the  glucagon-like  peptide  regulates  insulin  se- 
cretion in  humans  and  may  be  involved  in  the 
pathogenesis  of  certain  types  of  diabetes  mellitus 
due  to  faulty  regulation.  Particular  emphasis  is  on 
analyses  of  the  i8-cell  receptor  for  the  glucagon- 
like  peptide  and  the  mechanisms  operative  in  the 
stimulation  of  insulin  gene  transcription  in  re- 
sponse to  the  actions  of  the  peptide. 

Our  goals  are  1)  to  characterize  further  the 
genes  encoding  the  regulatory  (DNA-binding) 
proteins  that  interact  with  tissue-specific  en- 
hancers and  to  determine  how  cAMP-mediated 
expression  of  the  peptide  hormone-encoding 
genes  is  regulated  in  the  specific  cellular  pheno- 
types  and  2)  to  test  further  the  new  glucagon-like 
peptides  for  their  potential  regulatory  actions 
within  the  pancreatic  islets  and  the  intestinal 
tract  and  to  explore  the  possible  role  of  the  pep- 
tides in  diabetes  mellitus. 
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HOW  do  viruses  enter  and  leave  cells?  How  do 
receptors  and  their  ligands  cycle  from  cell 
surface  to  cell  interior  and  back?  How  do  regula- 
tory proteins  activate  or  inhibit  transcription  of 
particular  genes?  These  questions  deal  with  mo- 
lecular recognition  in  the  determination  of  cell 
organization.  They  represent  groups  of  projects 
in  our  laboratory,  all  of  which  involve  elucida- 
tion of  detailed  atomic  structures  as  a  prerequi- 
site for  tackling  functional  problems. 

Viruses 

The  small  double-stranded  DNA  viruses  SV40 
and  polyoma  have  given  us  our  first  glimpse  of 
virus  particles  that  package  a  minichromosome  in 
one  cell  and  deliver  it  to  the  nucleus  in  another. 
The  shells  of  these  viruses  are  composed  of  72 
pentamers  of  the  major  structural  protein  VPl 
and  30-60  copies  each  of  two  internal  proteins, 
VP2  and  VP3.  These  components  package  the 
viral  DNA.  The  VPl  polypeptide  chain  folds  in 
such  a  way  that  two  large  |8-sheets  with  radially 
directed  strands  form  a  framework,  with  tight  in- 
teractions between  adjacent  subunits  in  a  pen- 
tamer.  The  carboxyl  terminus  of  VPl  forms  an 
extended  arm  that  interacts  with  subunits  of  an- 
other pentamer,  generating  three  kinds  of  inter- 
pentamer  contact  in  the  virus  particle.  Flexibly 
extended  arms,  which  form  ordered  structures 
only  when  the  units  assemble  into  a  particle,  ap- 
pear to  be  an  important  feature  of  complex 
assemblies. 

Comparison  of  a  low-resolution  structure  of 
polyoma  with  the  SV40  model  shows  that  two 
surface  loops  are  larger  in  the  polyoma  subunit. 
Mutational  evidence  suggests  that  these  loops 
create  a  shallow  pocket  for  binding  sialic  acid, 
required  for  cell  entry  by  polyoma  but  not  by 
SV40.  A  number  of  viruses  of  various  structural 
types  use  cell-surface  sialic  acid  for  attachment, 
and  it  is  of  broad  interest  to  understand  how  spe- 
cific viruses  accomplish  the  interaction.  Compar- 
ison of  SV40  and  polyoma  shows  that  this  func- 
tion can  readily  be  added  or  lost  by  small  changes 
at  the  surface  of  a  viral  coat  protein.  We  have 
prepared  cocrystals  of  polyoma  with  bound  sialyl 


lactose  in  order  to  confirm  assignment  of  its  site 
and  to  study  the  molecular  details. 

We  have  recently  begun  to  study  the  double- 
stranded  RNA  viruses.  Crystals  of  rotavirus  single- 
shelled  particles  and  reovirus  cores  diffract  to  at 
least  7  A  resolution.  These  particles  are  elabo- 
rately organized  molecular  machines,  containing 
the  complete  transcription  and  RNA-modifica- 
tion  activities. 

Receptors 

The  receptor  for  human  immunodeficiency 
virus  (HIV)  is  the  lymphocyte  surface  antigen 
CD4.  Its  extracellular  portion  is  composed  of 
four  immunoglobulin-like  domains.  We  have  de- 
termined the  atomic  structure  of  a  two-domain 
amino-terminal  fragment,  which  binds  HIV  as 
tightly  as  does  the  intact  receptor.  The  first  two 
domains  are  formed  by  a  continuous  |S-strand 
connector,  and  they  have  an  extensive  hydropho- 
bic interface.  Thus  they  form  a  rigid  rod-like  seg- 
ment. The  HlV-binding  site  appears  to  be  a  ridge 
along  one  edge  of  the  first  domain. 

We  have  also  crystallized  a  rather  different  sort 
of  cell-surface  protein — the  extracellular  do- 
main of  the  human  transferrin  receptor.  This  mol- 
ecule undergoes  a  well-characterized  cycle  of 
uptake  and  return  to  the  cell  surface.  The  exter- 
nal domain,  which  makes  up  about  three-fourths 
of  the  molecule,  exhibits  reversible  conforma- 
tional changes  at  low  pH  that  we  believe  to  be 
signals  for  intracellular  sorting  steps. 

Transcriptional  Regulatory  Complexes 

A  common  characteristic  of  eukaryotic  tran- 
scriptional regulatory  elements  is  the  presence  of 
sites  in  multiple  copies  that  vary  slightly  in  se- 
quence, often  with  two  or  more  related  proteins 
that  can  bind  to  them.  The  best  understood  pro- 
karyotic  paradigm  is  in  the  immunity  region  of 
temperate  bacteriophages,  where  two  proteins, 
repressor  and  Cro,  bind  two  sets  of  three  sites, 
with  appropriately  graded  affinities.  We  have 
made  an  effort  to  understand  the  mechanism  of 
this  regulatory  switch,  by  determining  the  struc- 
tures of  a  series  of  specific  protein/DNA  com- 
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plexes  containing  the  Cro  protein  of  phage  434 
or  the  DNA-binding  domain  of  its  repressor.  We 
are  beginning  to  use  computational  approaches 
to  link  observed  structural  differences  among 
these  various  complexes  with  the  corresponding 
free  energies  of  binding. 

We  have  also  been  studying  several  eukaryotic 
regulatory  proteins,  initially  by  preparing  crys- 
tals of  their  DNA-binding  domains  in  complex 
with  synthetic  binding  sites.  TflllA,  which  con- 
trols 5S  RNA  transcription  in  Xenopus,  represents 
the  so-called  zinc  finger  class.  The  finger  is  a 
small,  30-residue  domain,  stabilized  by  a  tightly 
bound  zinc  ion.  A  recombinant  fragment  com- 
prising seven  of  the  nine  fingers  from  TflllA  binds 
to  a  30-base  pair  (bp)  DNA  containing  an  appro- 
priate part  of  the  total  binding  site.  We  have  crys- 
tallized this  complex. 

GAL4,  a  regulator  of  galactose  metabolism  in 
yeast,  has  a  rather  different  zinc-containing  struc- 
ture. The  amino-terminal  65  residues  form  a  do- 
main that  binds  with  dyad  symmetry  to  a  17-bp 
consensus  DNA  sequence.  A  first  structure  of  such 


a  complex  is  nearly  complete.  Each  domain  con- 
tains two  zinc  ions  in  close  proximity,  liganded 
by  six  cysteines. 

GCN4,  also  a  yeast  regulatory  protein,  repre- 
sents yet  another  class  of  DNA-binding  structures. 
It  contains  a  dimerization  element,  generally 
called  a  leucine  zipper,  which  forms  an  a-helical 
coiled  coil  about  30  residues  in  length.  This  seg- 
ment is  preceded  in  the  protein  sequence  by  a 
positively  charged  region,  which  has  little  or- 
dered structure  in  the  free  protein  but  which  also 
acquires  a-helical  structure  when  it  binds  to 
DNA.  We  have  prepared  crystals  of  the  basic  re- 
gion/leucine  zipper  fragment  of  GCN4,  in  com- 
plex with  a  synthetic  binding  site,  and  a  structure 
determination  is  in  progress. 

Understanding  how  these  various  structures 
recognize  their  DNA  sites  is  only  a  beginning.  The 
specificities  of  interactions  between  other  do- 
mains of  these  proteins  and  additional  compo- 
nents of  a  transcriptional  initiation  complex  pre- 
sent even  more  challenging  puzzles  for  the 
future. 
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WE  are  studying  two  problems  in  the  genetic 
regulation  of  pattern  formation  in  Dro- 
sophila. One  is  the  question  of  how  determinants 
come  to  be  localized  within  a  cell  (for  instance, 
the  egg)  where  they  can  initiate  correct  develop- 
mental fates.  The  second  is  the  action  of  DNA  reg- 
ulatory sequences  that  respond  to  positional  cues 
within  a  tissue  and  confine  the  expression  of 
genes  to  particular  cells. 

During  early  development  of  the  Drosophila 
embryo,  maternal-effect  genes  express  products 
involved  in  establishing  the  basic  body  plan. 
Among  this  class  of  genes  is  exuperantia  {exu), 
which  is  needed  for  the  correct  determination  of 
anterior  embryonic  structures.  The  product  of 
the  exu  gene  is  needed  during  the  development 
of  the  oocyte  for  anterior  localization  of  the  RNA 
of  another  gene,  bicoid  (bed).  This  localization 
leads,  during  early  embryogenesis,  to  a  steep  an- 
teroposterior concentration  gradient  of  bed  pro- 
tein, which  acts  as  an  embryonic  anterior  determi- 
nant. We  are  interested  in  how  the  exu  gene 
functions  to  bring  about  this  subcellular  localiza- 
tion of  the  bed  RNA. 

We  have  analyzed  the  effects  of  exu  mutations 
on  spermatogenesis.  During  a  long  period  of 
growth  and  development,  primary  spermato- 
cytes, which  have  not  yet  undergone  meiosis, 
appear  normal  in  exu  mutants;  defects  can  be  ob- 
served during  meiosis  and  subsequent  differen- 
tiation of  spermatids.  Fully  differentiated,  motile 
sperm  are  never  produced.  One  exu  allele  is  fe- 
male-specific, showing  only  the  maternal -effect 
phenotype,  and  the  other  allele  is  male-specific. 
Both  alleles  have  been  informative  about  the 
functioning  of  exu  in  gametogenesis  in  the  two 
sexes. 

We  have  cloned  the  exu  gene  and  found  that  it 
encodes  overlapping  male  and  female  tran- 
scripts, which  differ  in  size  in  the  two  sexes.  The 
two  sex-specific  transcripts  differ  also  in  splicing 
and  polyadenylation  patterns,  but  sequence  anal- 
ysis has  shown  that  they  both  encode  the  same 
predicted  58-kDa  polypeptide. 

We  have  determined  the  sequence  of  the  fe- 
male-specific exu  mutation.  It  changes  a  single 


amino  acid  in  the  protein,  possibly  identifying  a 
region  more  critical  for  its  female  than  its  male 
function. 

The  male  transcript  is  longer  than  the  female  in 
its  3'-untranslated  tail.  For  part  of  the  tail,  the 
male-specific  exu  mutation  is  deleted.  We  are 
currently  attempting  to  understand  the  function 
of  the  male-specific  tail.  We  have  also  found  that 
the  tra-2  (transformer- 2)  gene,  which  functions 
in  Drosophila  sex  determination,  is  needed  for 
efficient  processing  of  the  exu  transcript  in  the 
male  mode. 

Together  these  results  suggest  that  sex-specific 
processing  of  the  exu  transcript  in  the  germline  is 
a  biologically  important  event.  Why  is  the  exu 
gene  needed  maternally  for  early  development  of 
the  embryo  and  also  during  spermatogenesis? 
Since  exu  mutants  disturb  the  anterior  localiza- 
tion of  bed  RNA  in  eggs,  one  model  is  that  the 
gene  functions  in  both  cases  to  localize  RNAs  in 
the  germ  cells.  Since  there  is  no  known  effect  of 
bed  mutations  on  spermatogenesis,  different  do- 
mains in  the  exu  protein  could  act  to  recognize 
different  RNAs. 

Alternatively,  the  exu  product  could  serve  a 
more  general  role  in  germ  cell  cytoarchitecture, 
and  the  disruptive  effect  of  exu  mutations  on  bed 
RNA  localization  could  be  a  pleiotropic  effect  of 
disturbing  this  structure.  We  do  not  believe,  how- 
ever, that  the  latter  scenario  is  correct.  We  have 
determined  the  distribution  of  the  exu  protein 
during  oogenesis  and  early  development  with  the 
use  of  antibodies  raised  against  the  protein.  We 
find  it  to  be  present  in  the  oocyte  in  a  sharp  con- 
centration gradient  at  times  in  oogenesis  when 
the  bed^NA  is  localized,  with  highest  concentra- 
tions at  the  anterior  ends  of  developing  oocytes. 
But  the  protein  appears  to  be  degraded  after  it 
enters  oocytes  (it  is  produced  by  the  nurse  cells, 
which  are  attached  to  the  oocyte  and  nourish  it 
during  its  development)  and  is  not  present  in  the 
mature  oocyte  or  early  embryo.  These  results  sug- 
gest that  the  exu  protein's  role  in  bed  RNA  local- 
ization may  be  transient. 

Perhaps  the  exu  protein  modifies  another  pro- 
tein that  the  bed  RNA  interacts  with,  or  performs 
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an  escort  role  to  position  bed  RNA  in  the  right 
place  in  the  oocyte.  Another  attractive  hypothesis 
is  that  the  exu  protein  may  aid  in  folding  the  bed 
RNA  into  a  secondary  structure  that  is  recognized 
by  other  factors  that  localize  it. 

During  development,  positional  information 
instructs  genes  to  be  expressed  in  different  cells 
in  given  tissues.  The  Drosophila  white  gene  is 
required  for  deposition  of  pigments  in  the  eye  of 
the  fly.  Normally  the  gene  is  expressed  in  the  en- 
tire eye,  leading  to  uniformly  pigmented  eyes  in 
the  adult.  We  are  studying  a  case  in  which  the 
white  gene  was  inserted  in  a  genomic  location 
where  its  expression,  rather  than  being  uniform 
in  the  eye,  is  limited  primarily  to  the  ventral  half. 
This  patterned  expression  is  due  to  the  influence 
of  DNA  flanking  the  insertion  site,  since  moving 
the  gene  to  other  locations  in  the  genome  does 
not  have  the  same  effect. 

Mutations  were  induced  that  eliminate  the  pat- 
terned expression,  returning  the  pigmentation  to 


wild  type.  The  molecular  nature  of  these  muta- 
tions has  been  studied  by  cloning  and  analyzing 
the  flanking  DNA.  These  studies  show  that  se- 
quences flanking  both  sides  of  the  white  gene  are 
necessary  for  the  regulation,  but  that  a  region  up- 
stream of  the  white  promoter  is  particularly  im- 
portant for  the  effect.  This  regulatory  region  is 
sensitive  to  distance  from  the  gene,  since  moving 
it  closer  causes  more  severe  repression  of  white 
expression,  leading  to  even  less  pigmented  tissue 
in  the  eye.  An  intriguing  aspect  of  the  regulation 
is  that  it  functions  not  only  in  cis  to  the  white 
gene  but  also  in  trans  on  paired  copies  of  white. 

A  number  of  Drosophila  genes  show  pairing- 
dependent  effects  on  their  expression,  suggesting 
that  promoters  or  labile  regulatory  RNAs  pro- 
duced by  these  genes  can  act  on  a  second  homolo- 
gous copy  of  the  gene.  We  are  analyzing  the 
flanking  DNA  and  unraveling  its  elements  in 
order  to  understand  better  this  proximity-depen- 
dent type  of  gene  regulation. 
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MOST  interesting  biological  transitions, 
whether  during  the  life  of  a  single  cell  or 
the  development  of  a  complex  tissue,  are  usually 
accompanied  by  underlying  changes  in  the  ex- 
pression of  genes.  Knowledge  of  molecular 
events  that  result  in  activation  of  these  genes  can 
lead  to  a  detailed  understanding  of  such  transi- 
tions. We  are  using  a  molecular  and  biochemical 
approach  to  examine  specific  transitions  that  oc- 
cur in  very  different  contexts:  the  mammalian 
cell  division  cycle  and  the  developing  mouse 
cerebellum. 

Control  of  Gene  Expression  During 
the  Cell  Cycle 

We  have  established  over  the  past  several  years 
that  coordinate  induction  of  histone  H4,  H2b, 
and  H 1  gene  expression  during  the  S  phase  of  the 
cell  cycle  is  achieved  through  the  agency  of  dis- 
tinct transcription  factors  that  interact  with 
highly  conserved  subtype-specific  consensus  ele- 
ments within  each  of  the  promoters.  That  the 
transcription  of  these  genes  is  accomplished  by 
different  proteins  suggests  that  their  coordinate 
induction  is  due  to  a  pleiotropic  regulatory  mech- 
anism that  directly  participates  in  this  activation 
step  during  the  transition  from  Gl  to  S  phase.  Our 
efforts  are  focused  on  discovering  the  nature 
of  this  regulatory  step  and  determining  its 
specificity. 

During  the  past  year  progress  in  elucidating 
this  regulatory  step  has  been  achieved  through 
in-depth  analysis  of  the  regulatory  protein  OTFl 
(Octl,  NFIII)  for  the  H2b  cell  cycle.  Thus  high- 
titer  polyclonal  and  monoclonal  antibodies  spe- 
cific for  Oct  1  have  been  raised  and  employed  to 
examine  the  chemical  nature  of  this  protein  as 
the  division  cycle  progresses.  Our  results  demon- 
strate that  multiple  forms  of  Octl  exist  in  mam- 
malian cells  and  that  their  distribution  is  dramati- 
cally regulated  during  the  cell  cycle. 

Further  analysis  has  established  that  differen- 
tial phosphorylation  of  Octl  during  the  cycle  is 
at  least  partially  responsible  for  the  observed 
changes  in  this  protein  as  cells  proceed  toward 
division.  In  particular,  a  peptide-mapping  exper- 


iment using  Octl  pulse-labeled  with  P  ortho- 
phosphate  in  vivo  has  resulted  in  identification 
of  at  least  six  phosphopeptides  that  are  specifi- 
cally labeled  in  metaphase  cells.  Preliminary  re- 
sults indicate  that  purified  Octl  is  a  substrate  for 
CDC2  kinase  in  vitro,  but  that  these  phosphory- 
lations may  be  due  to  several  protein  kinases. 

Our  present  efforts  are  focused  on  discovery  of 
the  enzymes  that  post-translationally  modify 
Octl  during  the  cell  cycle,  and  demonstration 
that  these  modifications  are  functionally  relevant 
to  histone  gene  transcription.  Similar  analyses  of 
a  second  histone-specific  cell  cycle  regulatory 
factor,  H1TF2,  have  begun  to  determine  whether 
its  properties  are  similarly  modulated  during  the 
cycle.  Demonstration  that  the  timing  and  nature 
of  the  post-translational  modifications  on  Octl 
and  H1TF2  are  similar  in  vivo  would  establish 
the  existence  of  the  proposed  pleiotropic  regula- 
tory mechanism  for  regulation  of  transcription 
during  S  phase. 

One  fundamental  question  that  has  arisen  from 
this  work  is  whether  the  initiation  of  S  phase- 
specific  transcription  and  chromosomal  DNA  syn- 
thesis are  mechanistically  coupled.  To  address 
this  issue,  we  have  focused  on  two  specific  ques- 
tions: Are  the  regulatory  proteins  for  S  phase  his- 
tone gene  transcription  directly  involved  in  chro- 
mosomal DNA  synthesis?  Might  proteins  that 
regulate  initiation  of  DNA  synthesis  at  specific 
chromosomal  origins  of  replication  be  activated 
by  the  same  mechanisms  that  modulate  those 
transcription  factors? 

In  collaboration  with  Nicholas  Heintz  (Univer- 
sity of  Vermont  Medical  School),  we  have  re- 
cently identified  a  cellular  protein  complex 
(RIP60/RIP100)  with  several  properties  ex- 
pected of  replication-initiating  proteins.  Thus 
the  purified  complex  binds  specifically  to  two 
well-characterized  origins  of  DNA  replication  at 
sites  thought  to  be  important  for  origin  function, 
and  shows  ATP-dependent  DNA  helicase  activity. 
Our  present  efforts  are  directed  toward  a  defini- 
tive demonstration  that  RIP60/RIP 1 00  can  partic- 
ipate in  DNA  replication,  and  toward  generation 
of  appropriate  tools  to  determine  whether  the  ac- 
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tivity  of  these  factors  is  regulated  during  the  cell 
cycle. 

Molecular  Approaches  Toward  an 
Understanding  of  the  Mammalian 
Cerebellum 

The  mammalian  cerebellum  is  a  complex  and 
highly  stereotyped  structure  in  which  major  pat- 
tern formation  and  functional  organization  occur 
postnatally.  The  precise  description  of  the  cellu- 
lar events  occurring  during  cerebellar  develop- 
ment, and  the  existence  of  many  mutant  mouse 
strains  in  which  normal  development  of  the  cere- 
bellum is  perturbed,  recommend  it  as  an  amena- 
ble system  for  molecular  analysis  of  central  ner- 
vous system  development.  Our  initial  interests  in 
this  area  have  been  to  identify  genes  that  are  ei- 
ther essential  for  normal  development  of  the  cere- 
bellum or  that  serve  as  molecular  markers  for  spe- 
cific developmental  events  that  occur  during  its 
formation.  Our  ultimate  goal  is  to  utilize  these 
genes  to  identify  novel  proteins  that  are  crucial  to 
proper  development  of  the  cerebellum  and  to 
identify  molecular  mechanisms  that  participate 
in  specific  developmental  events  by  analysis  of 
the  pathways  that  result  in  their  correct  spatial 
and  temporal  expression. 

To  identify  genes  that  are  required  for  normal 
development  or  maintenance  of  cerebellar  struc- 
ture and  function,  we  have  initiated  efforts  to 
clone  the  genes  responsible  for  several  neurologi- 
cal mutants  of  mice.  Our  most  significant  pro- 
gress toward  identification  of  these  genes  has 
been  in  studies  concerning  the  Lurcher  (Lc)  and 
meander  tail  {med)  loci.  Lc  is  a  semidominant 
mutation  that  results  in  death  of  essentially  all 
cerebellar  Purkinje  cells,  beginning  at  about  two 
weeks  of  age.  Secondary  loss  of  cerebellar  gran- 
ule cells  and  olivary  neurons  is  also  observed.  We 
have  constructed  a  detailed  genetic  map 
surrounding  the  Lc  locus  on  chromosome  6  and 
have  identified  an  RFLP  (restriction  fragment 
length  polymorphism)  marker  approximately 


0.5  cM  from  the  gene.  Genomic  sequences  from 
this  closely  linked  marker  were  used  to  screen  a 
yeast  artificial  chromosome  (YAC)  library  from 
Shirley  Tilghman  (HHMI,  Princeton  University), 
resulting  in  isolation  of  a  280-kilobase  YAC  that 
maps  to  chromosome  6.  Using  sequences  isolated 
from  this  YAC  and  informative  recombinants  gen- 
erated during  genetic  mapping  of  the  Lc  locus, 
we  have  begun  a  chromosomal  walk  toward  the 
Lc  gene. 

The  gene  mea  is  a  recessive  mutation  resulting 
in  gross  perturbations  of  cerebellar  cytoarchitec- 
ture  that  are  confined  to  the  anterior  lobes  of  the 
cerebellum.  The  sharp  boundary  between  the 
normal  and  affected  area  of  the  mea /mea  cere- 
bellum is  reminiscent  of  the  discrete  boundaries 
evident  in  many  Drosophila  developmental  mu- 
tants, suggesting  that  the  mea  gene  may  influ- 
ence compartmental  cellular  organization  in 
mammalian  brain.  In  this  case  we  have  also  con- 
structed a  detailed  genetic  map  surrounding  the 
mea  gene  on  chromosome  4  and  have  begun  ef- 
forts to  identify  appropriate  genomic  sequences 
to  begin  isolation  of  YAC  clones  containing  the 
mea  locus.  The  identification  of  genes  responsi- 
ble for  these  and  other  mouse  neurological  muta- 
tions should  provide  fundamentally  important 
insights  into  cerebellar  structure  and  function. 

During  the  past  year  we  have  continued  to  pur- 
sue several  different  strategies  to  identify  cDNA 
clones  that  are  cell  specific  and  developmentally 
regulated  in  the  cerebellum.  Using  both  subtrac- 
tive  hybridization  and  differential  screening 
methods,  a  large  number  of  novel  cDNA  clones 
have  been  isolated  and  are  presently  being  ana- 
lyzed. Initial  results  indicate  that  our  cloning 
strategies  have  been  successful,  since  many  devel- 
opmentally regulated  cDNAs  have  been  identi- 
fied. For  example,  several  candidate  cDNA  clones 
for  the  developmentally  regulated  neuron:glial 
ligand  "astrotactin"  have  been  obtained  in  a  col- 
laborative effort  with  Mary  Beth  Hatten  (Colum- 
bia University) . 
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RNA-DNA  hybrids  have  a  vital  role  in  biology 
as  intermediates  of  genetic  replication.  DNA 
synthesis  in  both  eukaryotes  and  bacteria  is  initi- 
ated by  short  RNA  primers  that  are  hybridized 
with  the  template  DNA.  This  primer  RNA  must  be 
removed  for  DNA  replication  to  be  completed. 
Various  nucleases  can  perform  this  digestion,  but 
notably  specialized  to  the  task  are  the  enzymes 
know^n  as  ribonuclease  H  (RNase  H,  where  H 
stands  for  hybrid),  which  hydrolyze  only  RNA 
chains  and  then  only  when  they  occur  in 
heteroduplexes. 

RNase  H  also  participates  in  retroviral  replica- 
tion. It  is  a  constituent  activity  of  the  multifunc- 
tional enzyme  reverse  transcriptase  and,  as  such, 
is  indispensable  for  retroviral  infectivity.  During 
reverse  transcription,  the  polymerase  moiety  of 
the  transcriptase  molecule  first  produces  an  RNA- 
DNA  hybrid  on  the  template  viral  RNA.  RNase  H 
then  removes  the  genomic  RNA  to  free  the  com- 
plementary DNA,  which  serves  as  the  template  for 
plus-strand  synthesis.  Finally,  the  resulting  DNA 
duplex  is  integrated  into  the  host  genome. 

The  RNase  H  domain  constitutes  the  carboxyl- 
terminal  third  in  reverse  transcriptases  of  such 
retroviruses  as  human  immunodeficiency  virus 
(HIV)  and  Moloney  murine  leukemia  virus 
(MMLV) .  These  portions  of  sequence  are  clearly 
homologous  with  the  RNases  H  of  Escherichia 
colt  and  yeast.  In  light  of  our  ultimate  interest  in 
reverse  transcriptase  structure,  we  have  under- 
taken a  crystallographic  study  of  E.  coli  RNase  H. 
This  work,  performed  in  collaboration  with  Rob- 
ert Crouch,  who  discovered  and  extensively 
characterized  this  enzyme  at  the  National  Insti- 
tutes of  Health,  proves  to  be  fascinating  in  its  own 
right. 

MAD  Structure  of 

the  Selenomethionyl  Protein 

We  have  used  a  novel  method  to  solve  the  crys- 
tal structure  of  E.  coli  RNase  H.  This  involved 
producing  the  recombinant  protein  with  seleno- 
methionine systematically  replacing  the  four  me- 
thionine residues  in  this  155-residue  protein. 
The  selenomethionyl  protein  is  fully  active,  and 


it  crystallizes  isomorphously  with  the  natural  en- 
zyme. We  took  these  crystals,  grown  by  Wei  Yang, 
to  the  Photon  Factory  synchrotron  in  Japan. 
There,  in  collaboration  with  Yoshinori  Satow,  we 
made  measurements  of  multiwavelength  anoma- 
lous diffraction  (MAD)  for  use  in  the  structure 
determination.  This  MAD  phasing  analysis  pro- 
ceeded, in  Wei  Yang's  hands,  in  a  straightforward 
manner  to  an  initial  image  at  2.2  A  resolution. 
Presently  the  structure  is  refined  at  1 .7  A  resolu- 
tion with  an  R- value  residual  of  17  percent. 

The  RNase  H  from  E.  coli  is  an  a/fi  protein  with 
a  tertiary  folding  that  bears  no  significant  similar- 
ity to  other  known  structures.  It  is  organized 
around  a  mixed  central  /3-sheet  of  five  strands 
with  four  a-helices  on  one  side  and  one  on  the 
other.  Our  model  is  virtually  identical  with  one 
reported  independently  by  another  group  work- 
ing with  a  different  crystal  form. 

Implications  for  Substrate  Binding 
and  Catalysis 

RNase  H  is  an  endonuclease  that  releases  5'- 
phosphate  products  upon  hydrolysis  of  the  RNA 
from  RNA-DNA  hybrid  duplexes.  The  reaction  re- 
quires divalent  cations,  preferentially  Mg^"*"  in  the 
case  of  E.  coli.  These  and  other  features  distin- 
guish the  catalytic  mechanism  of  RNase  H  from 
that  of  the  RNase  A  family.  We  can  rather  confi- 
dently identify  the  catalytic  site  in  RNase  H  as 
being  at  the  conjunction  of  seven  evolutionarily 
conserved  residues.  At  the  center  of  this  con- 
served group  are  a  glutamate  and  two  aspartate 
residues  that  site-directed  mutations  have  shown 
to  be  absolutely  required  for  activity.  They  form  a 
carboxyl  triad  that  appears  to  be  a  likely  site  for 
the  binding  of  divalent  cations. 

Several  structural  features  appear  to  mark  the 
substrate  binding  zone.  One  is  a  sulfate-binding 
site,  which  we  propose  to  be  occupied  by  a  nu- 
cleotide phosphate  in  the  enzyme-substrate  com- 
plex. This  site  lies  in  a  groove  separated  by  a  ridge 
from  the  carboxyl  triad,  which  is  14  A  away  in 
another  groove.  The  electrostatic  potential  sur- 
face of  these  grooves  is  complementary  to  that  of 
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the  nucleic  acid,  assuming  that  divalent  cations 
are  present  at  the  carboxyl  triad. 

Based  on  these  observations,  we  have  built  a 
model  of  the  complex  of  RNase  H  with  a  hybrid 
duplex  in  the  A  conformation.  We  are  attempting 
to  cocrystallize  the  enzyme  with  oligonucleotide 
hybrids.  We  have  also  grown  crystals  of  a  com- 
plex with  Mg^^  and  are  in  the  process  of  solving 
that  structure.  It  appears  that  the  catalytic  mecha- 
nism is  most  likely  to  be  similar  to  that  of  ribo- 
zymes  and  of  the  3'^  5'  exonuclease  of  DNA  poly- 
merase I,  which  Thomas  Steitz  (HHMI,  Yale 
University)  and  his  co-workers  have  studied  in 
detail. 

Relationships  to  Retroviral  Reverse 
Transcriptase 

The  sequences  of  E.  coli  and  yeast  RNases  H  are 
clearly  related  to  the  RNase  H  portions  of  retro- 
viral reverse  transcriptases.  Seven  of  the  nine  resi- 
dues that  are  absolutely  conserved  among  1 3  re- 
troviral RNases  H  occur  in  common  with  the 
microbial  enzymes.  Moreover,  in  a  topologically 


restricted  alignment,  E.  co/?' RNase  H  has  28  per- 
cent overall  amino  acid  sequence  identity  with 
MMLV  RNase  H  and  24  percent  identity  with  HIV 
RNase  H. 

Our  sequence  alignments  also  suggest  impor- 
tant structural  differences  among  the  various 
RNase  H  proteins.  This  has  led  us  to  an  exercise  in 
homology  modeling  in  collaboration  with  Ken- 
neth Smith  and  Barry  Honig  at  Columbia.  It  ap- 
pears that  HIV  RNase  H  has  a  1 5-residue  deletion 
relative  to  E.  colt  RNase  H  and  that  this  can 
readily  be  accommodated.  Our  comparisons  have 
also  suggested  mutations,  which  are  now  being 
studied  in  Steve  Goff's  laboratory  at  Columbia 
with  the  MMLV  reverse  transcriptase. 

Finally  we  have  returned  to  our  structural  stud- 
ies on  the  MMLV  reverse  transcriptase.  Millie 
Georgiadis  is  trying  to  produce  usable  crystals 
with  new  constructs  prepared  by  Alice  Teles- 
nitsky  in  the  Goff  laboratory.  We  hope  that  what 
we  have  learned  from  E.  co/?  RNase  H,  including 
MAD  phasing  based  on  selenomethionine,  will 
help  in  our  analysis  of  the  retroviral  enzyme. 
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EACH  individual  gene  occupies  a  fixed  posi- 
tion on  a  chromosome.  By  and  large,  moving 
a  gene  has  only  a  minor  effect  on  its  expression. 
Thus  most  studies  of  gene  expression  are  able  to 
focus  on  the  gene  as  an  independent  unit,  with- 
out taking  into  account  larger  organizational  fea- 
tures. There  are  exceptional  cases,  however,  in 
which  the  relationship  between  a  gene  and  its 
environment  plays  a  role  in  expression  of  the 
gene.  Our  work  has  concentrated  on  two  of  these 
exceptions  in  the  fruit  fly. 

Several  years  ago,  we  found  a  surprising  associa- 
tion between  two  apparently  unrelated  genes.  A 
gene  encoding  a  secreted  component  of  the  Dro- 
sophila cuticle  lay  entirely  within  an  intron  of 
another  gene,  called  Gart,  which  encodes  enzy- 
matic activities  necessary  for  biosynthesis  of  pur- 
ine bases.  These  oppositely  oriented  nested  genes 
are  regulated  quite  differently  from  each  other 
during  development.  The  cuticle  gene  belongs  to 
a  family  of  genes  that  is  expressed  at  high  levels 
in  certain  epidermal  cells  during  two  phases  of 
the  life  cycle  of  the  fly,  late  larval  and  prepupal. 
The  Gart  purine  gene  is  apparently  expressed  at 
low  levels  in  all  cells  throughout  development, 
consistent  with  its  housekeeping  role.  We  won- 
dered whether  the  expression  of  either  gene  was 
influenced  by  its  neighbor — the  cuticle  gene  by 
its  surrounding  neighbor,  and  the  purine  gene  by 
its  neighbor  within. 

Examination  of  the  structure  of  the  Gart  locus 
in  other  insects  revealed  that  the  nested  gene  ar- 
rangement is  nearly  identical  in  a  distantly  re- 
lated species  of  fruit  fly  (although  the  cuticle 
gene  is  absent  from  the  intron  of  an  even  more 
distant  now- Drosophila  relative).  The  nested 
genes  derived  from  one  species  are  able  to  func- 
tion correctly  when  introduced  into  the  other 
species.  By  introducing  segments  of  the  locus 
from  one  species  into  the  other,  we  are  attempt- 
ing to  evaluate  what  components  are  necessary 
for  expression  of  each  of  the  nested  genes,  using 
the  fully  functional  resident  locus  as  a  control. 

Somewhat  to  our  surprise,  we  found  that  regu- 
lation of  either  gene  depends  in  part  upon  compo- 
nents closely  associated  with  the  other.  These  re- 


lationships between  seemingly  unrelated  genes 
that  occupy  the  same  segments  of  DNA  may  have 
relevance  to  recently  discovered  examples  of 
nested  genes  involved  in  human  disease,  includ- 
ing those  for  factor  VIII  and  neurofibromatosis. 

The  relationship  between  a  gene  and  its  chro- 
mosomal environment  is  especially  apparent  in 
examples  of  "position  effects"  associated  with 
chromosomal  rearrangements.  In  flies  a  well- 
known  class  of  position  effects  involves  inactiva- 
tion  of  genes  in  the  vicinity  of  rearrangement 
breakpoints.  Gene  inactivation  is  extremely  vari- 
able from  cell  to  cell,  such  that  the  affected  tissue 
shows  a  variegated  pattern  of  expression.  In  each 
case,  it  is  found  that  the  gene  has  been  juxtaposed 
to  heterochromatin,  the  deeply  staining  regions 
of  chromosomes  that  flank  the  centromere.  Al- 
though heterochromatin  contains  a  substantial 
fraction  of  DNA  in  all  higher  eukaryotes,  the  re- 
petitive sequence  structure  characteristic  of  het- 
erochromatin and  the  near  absence  of  genes  have 
hampered  attempts  to  understand  its  role  in 
the  genome.  Genes  that  show  variegated  expres- 
sion when  placed  next  to  heterochromatin  pro- 
vide a  reporter  function,  allowing  us  to  inves- 
tigate these  poorly  understood  regions  of 
chromosomes. 

Variegated  position  effects  caused  by  juxtapo- 
sition to  heterochromatin  are  seen  for  a  large 
number  of  genes  in  Drosophila.  One  well-stud- 
ied example  is  the  brown  gene,  required  for  full 
pigmentation  of  the  eye.  Unlike  nearly  all  other 
genes,  however,  such  position  effects  on  the 
brown  gene  are  dominant  over  wild  type — that 
is,  placing  one  copy  of  brown  next  to  heterochro- 
matin can  lead  to  inactivation  of  the  other  copy. 

We  have  investigated  the  genetic  basis  for  this 
gene  inactivation  in  trans  and  have  found  that  a 
necessary  component  is  the  pairing  of  homo- 
logues  in  the  immediate  vicinity  of  the  brown 
gene.  These  findings  have  led  to  an  explanation 
for  "trans-inactivation"  whereby  protein  compo- 
nents of  heterochromatin  make  direct  contact 
with  the  trans  copy  of  the  brown  gene  across 
paired  homologues.  In  suppon  of  this  hypothe- 
sis, we  have  been  able  to  reproduce  trans-inacti- 
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vation  at  sites  of  transposons  carrying  the  brown 
gene,  but  only  for  paired  copies  of  the  gene.  In 
addition,  we  have  found  that  even  very  small  le- 
sions that  disrupt  pairing  in  the  immediate  vicin- 
ity of  the  gene  also  reduce  trans-inactivation.  Our 
current  efforts  are  aimed  at  identification  of  the 
cis-  and  trans-acting  components  of  trans- 
inactivation. 

Position  effect  in  general  and  trans-inactivation 
in  particular  are  phenomena  that  are  easily  ob- 


served in  Drosophila,  where  powerful  tools  are 
available  for  genetic  dissection.  Related  phenom- 
ena are  known  to  occur  in  mammals,  such  as  X 
chromosome  inactivation,  in  which  one  of  the 
female's  X  chromosomes  becomes  heterochro- 
matic.  The  many  similarities  between  chro- 
mosomes in  organisms  as  diverse  as  flies  and 
mammals  lead  to  the  expectation  that  an  under- 
standing of  position  effects  in  Drosophila  will 
have  general  implications. 


The  classical  phenomenon  of  dominant  position-effect  variegation  reproduced  at  a  site  of  inser- 
tion of  the  Drosophila  brown  gene  present  on  a  transposon.  The  brown  gene  is  necessary  for 
transport  of  a  precursor  needed  for  red  eye  pigment  production.  The  fly  with  a  uniform  red  eye  has 
a  single  copy  of  the  transposed  gene  on  one  chromosome,  while  the  other  fly  has  both  a  normal 
copy  and  a  position-affected  copy.  The  mutant  phenotype  results  from  cis-inactivation  of  the 
position- affected  copy  and  trans-inactivation  of  the  paired  copy. 
Research  of  Steven  Henikoff. 
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THE  complement  system,  which  was  discov- 
ered nearly  100  years  ago,  was  initially  de- 
scribed as  an  activity  found  in  serum  that  medi- 
ates the  lysis  of  erythrocytes  or  the  killing  of 
foreign  organisms  also  treated  with  specific  im- 
mune antibodies.  Later  it  was  realized  that  com- 
plement also  facilitates  the  interaction  of  anti- 
gen-antibody complexes  with  cells  of  the 
immune  system.  It  is  now  known  that  comple- 
ment can  bind  some  foreign  organisms  or  cells 
without  the  need  for  antibody. 

The  complement  system  consists  of  at  least  20 
serum  proteins  that  are  activated  in  a  cascade  fash- 
ion: initial  activation  of  a  small  number  of  early 
components  leads  to  the  enzymatic  generation  of 
a  large  number  of  biologically  active  later  compo- 
nents. As  part  of  this  process,  protein  fragments 
are  released  that  attract  inflammatory  cells,  and 
antigen-antibody  complexes  are  coated  with 
specific  complement  fragments  that  covalently 
attach  to  this  target.  One  of  these  fragments,  com- 
plement component  C3,  is  able  to  be  proteolyti- 
cally  cleaved  after  attachment  to  targets.  These 
cleavage  reactions  result  in  a  number  of  different 
conformations;  this  allows  C3  to  interact  with  at 
least  three  unique  cell  surface  receptors,  the 
complement  receptors.  As  part  of  this  process,  C3 
fragments  may  also  bind  to  self  tissues,  rather 
than  to  the  antibody-bound  target,  thereby  attack- 
ing at  inappropriate  sites.  Other  cell  membrane 
C3-binding  proteins  are  able  to  inactivate  this  C3 
and  prevent  inappropriate  damage  to  self  tissues. 

We  are  interested  in  the  interaction  of  C3  with 
its  specific  receptors  and  regulatory  proteins,  par- 
ticularly the  biological  aspects  of  complement 
receptor  2  (CR2).  CR2  interacts  with  the  C3d,g 
fragment,  which  is  located  near  the  site  at  which 
C3  covalently  binds  to  its  target.  The  C3d,g  frag- 
ment remains  attached  to  targets  after  the  rest 
of  C3  is  trimmed  away  during  processing  of  the 
antigen-antibody-C3  complex. 

CR2  also  serves  as  the  receptor  for  the  Epstein- 
Barr  virus  (EBV) ,  which  is  responsible  for  most 
cases  of  infectious  mononucleosis  and  is  causally 
associated  with  a  number  of  human  tumors  of  B 
lymphocytes  and  epithelial  cells.  Patients  who 


have  forms  of  congenital  or  acquired  immunode- 
ficiency (such  as  AIDS  or  after  organ  transplanta- 
tion) are  particularly  susceptible  to  tumors  asso- 
ciated with  EBV. 

In  the  past  few  years  we  have  cloned  and  ana- 
lyzed the  structure  of  the  human  CR2  gene.  Ex- 
pression of  the  recombinant  receptor  in  other 
cells  is  sufficient  to  mediate  the  binding  of  EBV 
and  C3d,g.  By  using  other  recombinant  tech- 
niques and  creating  mutations  within  the  recep- 
tor, we  have  shown  that  specific  amino  acids  in  a 
small  binding  domain  at  the  amino  terminus  of 
the  receptor  are  important  for  ligand  interac- 
tions. In  addition,  we  have  synthesized  peptides 
that  have  the  ability  to  block  binding  of  EBV  to 
the  receptor.  These  studies  should  allow  us  to 
devise  strategies  to  alter  the  function  of  this  re- 
ceptor in  vivo.  For  instance,  one  type  of  reagent 
might  block  viral  binding  but  not  normal  binding 
of  C3.  This  could  be  useful  in  some  illnesses  asso- 
ciated with  EBV. 

To  increase  our  understanding  of  the  biologi- 
cal role  of  complement  receptors  and  regulatory 
proteins,  we  have  cloned  and  expressed  mouse 
homologues  for  CR2  and  complement  receptor 
1,  another  C3  receptor.  In  addition,  we  have 
cloned  and  expressed  related  genes  whose  com- 
plement-binding and  regulatory  activities  are  not 
fully  understood.  Once  we  understand  the  activi- 
ties of  these  proteins,  we  will  be  able  to  utilize 
many  murine  models  of  the  normal  immune  re- 
sponse, as  well  as  autoimmune  diseases,  to  exam- 
ine the  roles  that  these  proteins  play  in  vivo. 

Another  aspect  of  CR2  expression  is  also  under 
analysis  in  my  laboratory.  Expression  of  CR2  var- 
ies during  B  lymphocyte  development:  it  is  ex- 
pressed only  on  late  pre-B  cells  and  mature  B 
cells  and  not  on  very  early  pre-B  lymphocytes  or 
on  late  immunoglobulin-secreting  cells.  The  mo- 
lecular mechanisms  that  underlie  this  pheno- 
type,  which  is  also  found  among  other  B  cell- 
specific  markers,  are  likely  fundamental  to  the 
overall  processes  by  which  B  cells  mature  and  are 
activated.  We  are  analyzing  these  mechanisms.  As 
part  of  these  studies  we  have  defined  a  promoter 
for  CR2  and  other  sites  within  the  gene  that  are 
likely  to  be  important  in  gene  regulation.  In  addi- 


195 


The  Evolution  and  Biological  Roles  of  Complement  Receptors 


tion,  we  are  determining  whether  the  receptor 
levels  vary  because  the  gene  is  turned  on  and  off 
during  development.  These  studies  will  increase 
our  understanding  of  specific  gene  expression  in 


B  lymphocytes  and  further  our  knowledge  of  how 
to  alter  B  cell  phenotypes  along  pathways  that 
might  be  more  beneficial  during  certain  disease 
states. 
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HOW  do  genes  control  animal  development? 
Taking  a  primarily  genetic  approach  to  this 
question,  the  members  of  our  laboratory  have  iso- 
lated developmental  mutants  of  the  roundworm 
Caenorhabditis  elegans  and  have  used  both  ge- 
netic and  molecular  genetic  techniques  to  charac- 
terize these  mutants.  Because  the  complete  cellu- 
lar anatomy  (including  the  complete  wiving 
diagram  of  the  nervous  system)  and  the  complete 
cell  lineage  of  C.  elegans  are  known,  mutant  ani- 
mals can  be  studied  at  the  level  of  single  cells  and 
even  single  synapses.  Genes  that  play  specific 
roles  in  cell  lineage,  cell  signaling,  cell  death, 
and  cell  migration  have  been  identified  and 
analyzed. 

Cell  Lineage 

The  problem  of  cell  lineage — how  a  single  fer- 
tilized egg  cell  undergoes  a  complex  pattern  of 
cell  divisions  to  generate  a  multiplicity  of  dis- 
tinct cell  types — is  one  major  focus  of  the  re- 
search of  our  laboratory.  We  have  identified 
hundreds  of  genes  responsible  for  controlling 
aspects  of  the  C.  elegans  cell  lineage.  Many  of 
these  genes  function  in  generating  cell  diversity 
during  development.  For  example,  some  genes 
act  to  make  the  two  daughter  cells  generated  by  a 
single  cell  division  different  from  each  other,  and 
one  gene  acts  to  make  certain  daughter  cells  dif- 
ferent from  their  mothers.  The  action  of  many 
cell  lineage  genes  is  not  constrained  to  a  single 
tissue  or  organ.  For  example,  one  gene  acts  in 
both  the  nervous  system  and  the  musculature, 
while  another  acts  in  these  two  tissues  as  well  as 
in  the  gonad  and  the  hypodermis. 

We  have  analyzed  a  number  of  these  genes  at 
the  molecular  level.  These  studies  have  revealed 
that  genes  that  control  aspects  of  cell  lineage  in 
C.  elegans  are  strikingly  similar  to  genes  found  in 
other  organisms,  including  humans.  Thus  the 
analysis  of  developmental  control  genes  in  C.  ele- 
gans should  help  us  to  understand  aspects  of  the 
development  of  more  complex  organisms. 

Cell  Signaling 

Much  of  the  development  of  C.  elegans,  like 


that  of  other  organisms,  involves  intercellular 
communication.  We  have  studied  cell  interac- 
tions in  nematode  development  by  using  a  laser 
microbeam  to  kill  single  cells  in  living  animals:  if 
destruction  of  one  cell  alters  the  fate  of  a  second 
cell,  the  first  cell  must  normally  interact  with  the 
second.  We  have  analyzed  in  detail  the  cell  inter- 
actions involved  in  inducing  the  development  of 
the  vulva,  which  forms  the  external  genitalia, 
connects  the  uterus  with  the  outside  environ- 
ment, and  is  used  for  egg  laying  and  copulation. 
We  have  characterized  many  genes  that  function 
in  the  cell  interactions  of  vulval  development. 
One  gene  that  acts  as  a  switch  in  the  vulval  induc- 
tive signaling  pathway  is  a  member  of  the  ras 
gene  family.  Other  ras  genes  are  associated  with 
many  human  cancers;  the  same  mutations  that 
cause  extra  vulval  cell  divisions  in  C.  elegans  are 
oncogenic  in  mammals.  The  study  of  this  and 
other  genes  that  function  in  cell  signaling  in  C. 
elegans  might  provide  insights  relevant  to  cancer- 
ous growth  in  humans. 

Cell  Death 

Naturally  occurring  or  "programmed"  cell 
death  is  common  during  the  development  of  the 
nervous  system  of  many  animals,  including  C. 
elegans.  Why  organisms  generate  cells  only  to 
have  them  die  is  an  intriguing  question.  Further- 
more, the  mechanisms  responsible  for  cell  death 
may  be  of  some  medical  importance,  as  the  clini- 
cal features  of  many  human  disorders  (including 
trauma,  stroke,  and  a  variety  of  neurodegenera- 
tive diseases)  are  a  consequence  of  nerve  cell 
deaths. 

We  have  been  identifying  and  characterizing 
genes  that  function  in  programmed  cell  death  in 
C.  elegans.  Two  genes  cause  cells  to  die,  and 
seven  other  genes  are  involved  in  removing  the 
corpses  of  dead  cells.  The  two  genes  that  cause 
cells  to  die  must  be  expressed  by  the  dying  cells 
themselves,  indicating  that,  at  least  to  this  extent, 
programmed  cell  deaths  are  cell  suicides.  Al- 
though many  cells  die  during  the  course  of  C. 
e/egflns  development,  most  cells  survive;  cell  sur- 
vival requires  the  inactivation  of  the  cell  death 
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process,  as  a  tenth  gene  functions  to  prevent  the 
action  of  the  nine  cell  death  genes  in  surviving 
cells.  In  addition,  this  regulatory  gene  is  itself 
controlled  in  a  cell-specific  fashion  by  other 
genes  that  decide  which  cells  are  to  live  and 
which  are  to  die.  We  are  currently  analyzing  mo- 
lecularly  genes  involved  in  programmed  cell 
deaths  in  C.  elegans. 

We  hope  that  knowledge  of  what  makes  cells 
die  and  of  what  can  block  the  cell  death  process 
in  C.  elegans  will  lead  to  methods  that  will  pre- 
vent the  cell  deaths  responsible  for  human 
disorders. 

Cell  Migration 

During  animal  development,  cells  are  often 
generated  far  from  their  final  positions  and  must 
migrate  considerable  distances  before  being  able 
to  function.  To  understand  what  causes  cells  both 
to  migrate  and  to  stop  migrating,  we  are  analyzing 


two  C.  elegans  cell  migrations.  The  first  involves 
a  pair  of  muscle  precursor  cells  that  are  born  in 
the  posterior  body  region  and  move  to  a  central 
position  along  the  animal's  length,  near  its  gonad. 
We  have  discovered  that  these  migrations  involve 
signaling  between  the  migrating  cells  and  go- 
nadal cells  located  at  the  termination  site  of  the 
migration  and  are  now  characterizing  genes  that 
function  in  this  signaling  process. 

The  second  migration  we  have  studied  involves 
a  pair  of  neuronal  cells  that  move  from  the  tail 
region  to  the  midbody  region  of  the  animal.  Thir- 
teen genes  have  been  identified  that  must  func- 
tion for  these  neuronal  migrations  to  occur  prop- 
erly. Some  of  these  genes  probably  act  in  the 
migration  process  per  se,  but  some  do  not.  The 
actions  of  some  of  these  latter  genes  allow  these 
neurons  to  acquire  their  identities;  if  these  genes 
do  not  function,  these  neurons  fail  to  express 
their  normal  characteristics,  including  their 
long-range  cell  migration. 


Opposite:  The  microscopic  nematode  Czenorhahditis  elegans  is  used  to  identify  and  analyze  genes 
that  control  development  and  behavior.  Many  of  the  genes  that  act  in  this  simple  animal  are 
strikingly  similar  to  genes  found  in  humans.  For  example,  theC.  elegans  let-60  gene  is  very  similar 
to  human  ras  genes.  Mutations  that  cause  ras  genes  to  be  abnormally  active  are  oncogenic  and 
constitute  the  most  frequent  class  of  mutation  associated  with  human  cancers. 

The  figure  shows  a  normal  C.  elegans  adult  ( top  panel)  and  two  different  types  o/let-60  mu- 
tants. In  the  middle  animal,  the  let-60  gene  is  inactivated,  preventing  certain  cell  interactions 
that  normally  induce  formation  of  a  component  of  the  system  used  for  egg  laying.  The  animal  is 
filled  with  retained  eggs.  By  contrast,  the  lower  animal  is  a  mutant  in  which  the  let-60  gene  is 
abnormally  active,  leading  to  the  formation  of  abnormal  growths  (indicated  by  arrowheads). 
The  study  of  the  C  elegans  let-60  gene,  in  providing  insights  to  the  function  of  ras  genes,  might  help 
reveal  ways  in  which  ras  gene  activity  and  cancerous  growth  could  be  inhibited. 

Research  of  H.  Robert  Horvitz.  Photographs  by  Gregf.  Beitel. 
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WE  have  been  using  mitochondria,  the  intra- 
cellular organelles  that  carry  out  energy 
metabolism,  as  a  system  for  investigating  how 
proteins  cross  biological  membranes. 

Most  of  the  proteins  of  mitochondria  are  first 
made  outside  the  organelles,  in  the  cytosol,  and 
then  imported  through  both  an  outer  and  inner 
mitochondrial  membrane  to  reach  the  innermost 
"matrix"  compartment.  In  order  to  traverse  the 
membranes,  the  newly  made  proteins  are  first  un- 
folded on  the  cytosolic  side.  After  import,  they 
refold  on  the  inside  of  the  organelle  into  their 
biologically  active  conformations. 

It  has  always  been  assumed  that  this  process  of 
refolding  is  a  spontaneous  event,  much  like  the 
refolding  of  many  denatured  proteins  observable 
in  a  test  tube.  Yet  we  have  identified  a  mutant  cell 
in  which  mitochondrial  proteins  were  imported 
into  the  matrix  but  failed  to  fold  into  biologically 
active  forms.  The  mutation  was  found  to  affect  a 
protein  that  normally  resides  in  the  matrix,  called 
heat-shock  protein  60  (hsp60). 

This  protein  was  originally  identified  by  the 
observation  that  its  level  was  increased  about 
twofold  in  response  to  incubation  of  cells  at  high 
temperatures.  It  is  abundant,  however,  even  be- 
fore heat  shock,  and  our  genetic  analysis  demon- 
strated that,  consistent  with  a  critical  baseline 
function,  hsp60  is  required  not  only  at  high  tem- 
peratures but  at  all  temperatures.  The  increased 
level  produced  in  response  to  heat  stress  could 
represent  an  effort  of  the  cell  to  efficiently  refold 
mitochondrial  proteins  that  heat  has  denatured. 

In  the  mitochondrial  matrix,  hsp60  is  found  in 
a  higher  order  structure,  a  complex.  Fourteen 
copies  of  the  protein  are  arranged  in  two  stacked 
rings,  a  "double  donut."  Each  ring  contains  seven 
radially  arranged  copies  of  hsp60.  How  does  this 
complex  function?  Our  studies  have  demon- 
strated that  unfolded  mitochondrial  proteins  en- 
tering the  matrix  space  become  associated  with 
the  surface  of  the  hsp60  complex.  Then,  in  steps 
requiring  both  energy  and  a  second  protein  com- 
ponent, the  polypeptides  are  folded  into  their  ac- 
tive forms  and  released  from  the  complex. 
We  are  now  trying  to  dissect  the  mechanism  of 


hsp60-directed  folding.  We  believe  that  the  path- 
way of  folding  must  be  dictated  by  the  amino  acid 
sequence  of  the  "substrate"  protein  to  be  folded, 
not  by  the  hsp60  complex,  because  we  have  used 
the  complex  to  fold  proteins  that  normally  reside 
outside  mitochondria. 

It  seems  that  hsp60  acts  by  speeding  up,  or 
"catalyzing,"  the  folding  of  proteins.  How  does  it 
do  this?  One  possibility  is  that  it  simply  prevents 
domains  of  proteins  from  wrongfully  interacting, 
either  with  each  other  or  with  nearby  proteins  in 
the  mitochondrial  matrix,  a  "chaperone"  func- 
tion. Another  possibility  is  that  the  complex  ac- 
tively promotes  the  progression  of  an  unfolded 
protein  through  a  series  of  folding  steps.  Because 
we  can  now  reconstitute  hsp60-mediated  folding 
in  a  test  tube,  we  can  use  biochemical  and  bio- 
physical techniques  to  examine  the  folding  mech- 
anism more  precisely. 

How  general  is  the  utilization  of  folding  ma- 
chinery in  living  cells?  Because  mitochondria  ap- 
parently arose  from  bacteria  (one  cell  ingested 
another),  it  is  not  surprising  that  a  structurally 
related  component  has  been  found  in  bacteria. 
Here  a  function  like  that  of  hsp60  is  implied  by 
an  experiment  carried  out  many  years  ago.  When 
Escherichia  coli  cells  partially  defective  in  the 
hsp60-related  protein  were  infected  with  virus, 
the  newly  made  virus  coat  proteins  could  not  as- 
semble to  make  new  virus  particles.  We  surmise 
that  the  hsp60-related  component  is  likely  to  be 
used  not  only  to  fold  and  assemble  proteins  of 
infecting  viruses,  but  also  to  fold  most  if  not  all 
newly  synthesized  proteins  indigenous  to  the  bac- 
terial cell.  We  are  currently  testing  whether  this 
is  the  case  by  producing  a  severe  defect  in  the 
hsp60-related  component. 

Like  bacteria,  cells  of  higher  organisms  might 
utilize  a  "machine"  to  fold  newly  made  proteins 
into  their  active  forms.  We  are  currently  taking 
both  genetic  and  biochemical  approaches  toward 
identifying  such  a  component. 

How  does  a  folding  machine  like  the  hsp60 
complex  itself  get  assembled?  It  seemed  possible 
that  while  all  mitochondrial  proteins  tested  to 
date  utilize  hsp60  for  their  folding  and  assembly. 
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the  folding  machine  might  self-assemble  to  pro- 
duce a  new  complex.  We  recently  tested  this  pos- 
sibility and  found  that  hsp60  subunits  could  not 
self-assemble  inside  mitochondria.  Assembly  re- 
quired the  presence  of  preexistent  functional 
hsp60  complex. 

How  did  the  mitochondrial  hsp60  complex  get 
assembled  in  the  first  place?  This  question  is  an- 
swerable in  terms  of  the  origin  of  mitochondria. 
It  seems  likely  that  when  the  original  mitochon- 
drion was  founded  as  an  ingested  bacterial  cell,  it 
already  contained  an  hsp60-related  protein.  The 
question,  then,  really  becomes,  How  was  the 
hsp60-related  component  in  the  bacterial  pro- 
genitor assembled  for  the  first  time?  It  seems  pos- 


sible that  assembly  might  have  occurred  sponta- 
neously, inaugurating  a  perpetual  catalyzed 
assembly  reaction.  Alternatively,  initial  assembly 
might  have  occurred  with  the  help  of  some  other 
component — another  protein,  or  perhaps  a  nu- 
cleic acid. 

Thus  we  have  shown  that  imported  mitochon- 
drial proteins  do  not  fold  and  assemble  on  their 
own  but,  rather,  require  assistance  from  a  molecu- 
lar machinery.  This  may  also  prove  to  be  the  case 
for  proteins  of  other  cellular  compartments.  In 
general,  in  the  living  cell,  newly  made  and  newly 
translocated  proteins  may  achieve  their  active 
conformations  using  machines  that  function  like 
the  hsp60  complex. 
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INFORMATION  processing  in  the  brain  de- 
pends on  the  transmission  of  signals  between 
neurons  at  specialized  areas  of  contact,  called 
synapses.  At  synapses,  ion  channel  proteins  in  the 
neuronal  cell  membrane  generate  an  electrical 
current,  which  triggers  the  release  of  chemical 
signals  from  the  first  neuron,  called  the  presynap- 
tic neuron.  These  chemical  signals,  or  neurotrans- 
mitters, bind  to  specific  receptor  proteins  in  the 
membrane  of  the  second  neuron,  called  the  post- 
synaptic neuron.  The  neurotransmitter  receptors 
then  generate  electrical  currents  in  the  second 
neuron,  thereby  completing  the  process  of  syn- 
aptic transmission. 

Both  the  amount  of  neurotransmitter  released 
from  the  presynaptic  neuron  in  response  to  a 
given  electrical  signal  and  the  sensitivity  of  the 
postsynaptic  receptor  system  for  a  given  amount 
of  neurotransmitter  can  be  modulated  by  extra- 
cellular factors.  The  molecular  mechanisms  that 
underlie  this  modulation  have  only  begun  to 
be  defined.  Recent  studies  have  provided  evi- 
dence that  protein  phosphorylation  is  an  import- 
ant mechanism  in  the  regulation  of  synaptic 
transmission. 

Protein  phosphorylation  systems  consist  of 
three  primary  components,  a  protein  kinase,  a 
substrate  protein,  and  a  phosphoprotein  phos- 
phatase. Protein  kinases  are  enzymes  that  catalyze 
the  chemical  transfer  of  phosphate  molecules 
from  ATP  to  specific  substrate  proteins.  The  activ- 
ities of  many  protein  kinases  are  regulated  by  neu- 
rotransmitters and  hormones  through  the  actions 
of  substances  called  second  messengers,  such  as 
cAMP,  calcium,  and  diacylglycerol.  Substrate 
proteins  include  many  cellular  components, 
among  them  enzymes,  ion  channels,  and  neuro- 
transmitter receptors.  The  addition  of  the  nega- 
tively charged  phosphate  group  alters  the  struc- 
ture of  these  substrate  proteins,  thereby 
regulating  their  functional  properties.  Phospho- 
protein phosphatases  are  enzymes  that  reverse 
the  process  of  protein  phosphorylation,  remove 
the  phosphate  group  from  the  substrate  protein, 
and  return  it  to  its  basal  state. 

My  laboratory  is  concerned  with  the  structure 


and  function  of  neurotransmitter  receptors  and 
the  role  of  protein  phosphorylation  in  the  regula- 
tion of  the  properties  of  the  neurotransmitter  re- 
ceptors. We  have  chosen  the  best-characterized 
neurotransmitter  receptor  and  ion  channel  in  neu- 
robiology today,  the  nicotinic  acetylcholine  re- 
ceptor, as  a  model  system.  In  addition,  we  have 
been  studying  the  major  excitatory  neurotransmit- 
ter receptor  in  the  brain,  the  glutamate  receptor, 
and  the  major  inhibitory  receptor  in  the  brain, 
the  GABA^  receptor.  These  receptors  are  neuro- 
transmitter-dependent  ion  channels  that  generate 
electrical  currents  in  the  postsynaptic  mem- 
brane of  the  synapse  in  response  to  their 
neurotransmitter. 

To  study  the  molecular  mechanisms  involved 
in  neurotransmitter  receptor  and  ion  channel 
function,  it  is  essential  to  identify  chemically  the 
specific  proteins  required  for  this  activity.  We 
began  by  defining  the  molecular  components  re- 
quired for  the  functioning  of  the  nicotinic  acetyl- 
choline receptor  ion  channel.  Using  membrane 
reconstitution  techniques,  we  solubilized  the  nic- 
otinic receptor  and  its  ion  channel  from  isolated 
postsynaptic  membranes,  purified  it,  and  recon- 
stituted it  into  phospholipid  vesicles.  These  stud- 
ies demonstrated  that  the  purified  receptor,  con- 
sisting of  four  types  of  protein  subunits  (a,  /3,  7, 
6) ,  contains  the  ion  channel  and  has  all  the  biolog- 
ical properties  of  the  nicotinic  receptor  in  the 
intact  cell. 

We  next  began  to  characterize  the  protein 
phosphorylation  of  these  structural  components. 
Postsynaptic  membranes  isolated  from  synapses 
highly  enriched  in  the  nicotinic  acetylcholine  re- 
ceptor contain  at  least  three  different  types  of 
protein  kinases  that  phosphorylate  the  nicotinic 
receptor  on  six  different  phosphorylation  sites: 
cAMP-dependent  protein  kinase  phosphorylates 
the  7-  and  6-subunits  of  the  receptor;  a  calcium- 
and  diacylglycerol-dependent  protein  kinase 
phosphorylates  the  5-subunit;  a  protein  tyrosine 
kinase  phosphorylates  the  0-,  7-,  and  6-subunits. 
These  postsynaptic  membranes  also  contain 
phosphoprotein  phosphatase  activity  that  de- 
phosphorylates  the  phosphorylated  nicotinic 
acetylcholine  receptor. 
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We  are  currently  using  protein  purification  and 
molecular  cloning  techniques  to  characterize  the 
protein  tyrosine  kinases  that  phosphorylate  the 
receptor  and  the  phosphotyrosine  protein  phos- 
phatases that  dephosphorylate  the  receptor.  We 
recently  identified  several  cDNA  clones  for  dif- 
ferent types  of  protein  tyrosine  kinases  that  are 
expressed  in  cells  enriched  in  the  nicotinic  re- 
ceptor and  are  attempting  to  determine  which  of 
these  protein  tyrosine  kinases  phosphorylate  the 
receptor.  In  addition,  we  recently  purified  the 
phosphotyrosine  protein  phosphatase  that  de- 
phosphorylates  the  tyrosine-phosphorylated  ace- 
tylcholine receptor  and  are  using  molecular 
cloning  techniques  to  isolate  cDNA  clones  for 
this  phosphotyrosine  protein  phosphatase. 

What  are  the  functional  effects  of  phosphoryla- 
tion of  the  receptor  by  these  protein  kinases?  We 
have  examined  this  question  directly  by  studying 
the  properties  of  the  purified  and  reconstituted 
receptor  phosphorylated  to  different  degrees  by 
the  various  protein  kinases.  Phosphorylation  of 
the  receptor  on  the  7-  and  6-subunits  by  cAMP- 
dependent  protein  kinase  dramatically  increases 
the  rate  of  desensitization  of  the  receptor.  Desen- 
sitization  is  the  process  by  which  the  receptor  is 
reversibly  inactivated  in  the  continued  presence 
of  the  neurotransmitter  acetylcholine.  Recently 
we  have  extended  these  studies  to  examine  the 
effect  of  phosphorylation  of  the  receptor  on  the 
(8-,  7-,  and  6-subunits  by  the  protein  tyrosine  ki- 
nase. Phosphorylation  of  the  receptor  by  the  pro- 
tein tyrosine  kinase  also  regulates  the  rate  of  de- 
sensitization of  the  receptor.  These  studies 
provide  direct  evidence  that  protein  phosphory- 
lation of  the  nicotinic  acetylcholine  recep- 
tor regulates  its  physiological  properties  and 
plays  a  role  in  modulating  its  sensitivity  to 
acetylcholine. 

We  have  also  used  site-specific  mutagenesis 
techniques  to  mutate  the  phosphorylation  sites 
on  the  receptor  subunits.  Mutant  receptor  sub- 
units  lacking  phosphorylation  sites  have  been  ex- 
pressed in  Xenopus  oocytes,  in  collaboration 
with  Gary  Yellen  (HHMI,  the  Johns  Hopkins  Uni- 


versity School  of  Medicine) .  The  desensitization 
and  regulation  of  desensitization  of  these  recep- 
tors by  protein  phosphorylation  are  now  being 
analyzed  and  compared  with  normal  receptors. 

Using  muscle  cell  cultures  that  are  highly 
enriched  in  the  acetylcholine  receptor,  we  have 
investigated  the  regulation  of  the  phosphoryla- 
tion of  the  receptor  by  neurotransmitters,  hor- 
mones, and  neuropeptides.  Calcitonin  gene- 
related  peptide  (CGRP),  a  neuropeptide  that  is 
released  from  the  presynaptic  neuron  with  ace- 
tylcholine, increases  the  intracellular  levels  of 
cAMP  and  thereby  regulates  the  phosphorylation 
of  the  receptor  by  the  cAMP-dependent  protein 
kinase.  In  addition,  studies  in  our  laboratory  sug- 
gest that  acetylcholine  itself  regulates  intracellu- 
lar levels  of  calcium  and  thereby  regulates  the 
phosphorylation  of  its  own  receptor  by  the  cal- 
cium- and  diacylglycerol-dependent  protein  ki- 
nase. We  have  also  demonstrated  that  tyrosine 
phosphorylation  of  the  nicotinic  receptor  is  regu- 
lated by  the  neurons  that  synapse  on  muscle. 
More  recently,  in  collaboration  with  Bruce  Wal- 
lace (University  of  Colorado  Health  Sciences 
Center),  we  have  found  that  agrin,  an  extracellu- 
lar matrix  protein,  may  be  the  factor  from  neu- 
rons that  regulates  tyrosine  phosphorylation  of 
the  receptor.  Agrin  appears  to  be  secreted  from 
neurons  and  to  induce  receptor  clustering  under 
the  nerve  during  synapse  formation.  These  results 
suggest  that  agrin-induced  tyrosine  phosphoryla- 
tion of  the  receptor  may  be  involved  in  the  induc- 
tion of  clustering  of  the  receptor  at  the  synapse. 

Our  recent  studies  of  the  glutamate  receptor 
and  the  GABA^  receptor  have  paralleled  our  stud- 
ies of  the  nicotinic  receptor.  We  are  using  protein 
purification,  reconstitution,  and  site-specific 
mutagenesis  techniques  to  determine  the  compo- 
nents required  for  neurotransmitter  receptor 
function  and  to  study  the  role  of  protein  phos- 
phorylation in  the  regulation  of  their  function. 
These  studies,  combined  with  our  studies  of  the 
nicotinic  acetylcholine  receptor,  are  likely  to 
demonstrate  that  protein  phosphorylation  of  neu- 
rotransmitter receptors  is  a  primary  mechanism 
in  the  regulation  of  synaptic  transmission. 
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OUR  laboratory  studies  molecular  aspects  of 
signal  transduction  processes  responsible 
for  vision.  Vertebrate  photoreceptor  cells  re- 
spond to  a  light  flash  via  G  protein-mediated  ac- 
tivation of  a  cyclic  GMP  phosphodiesterase.  A 
light  flash  hyperpolarizes  a  photoreceptor  be- 
cause hydrolysis  of  cGMP  shuts  down  dependent 
plasma  membrane  cation  channels.  In  darkness 
Ca^"^  enters  the  cell  through  these  channels,  but  a 
flash  blocks  its  entry.  The  resulting  decrease  in 
cytosolic  Ca^^  activates  guanylate  cyclase,  which 
resynthesizes  cGMP  so  that  the  cell  recovers. 

Invertebrate  photoreceptors  respond  very  dif- 
ferently. In  these  cells  light  activates  phospholi- 
pase  C,  which  produces  inositol  triphosphate  and 
diacylglycerol  as  second  messengers. 

Much  of  our  work  focuses  on  the  actions  of  G 
proteins,  since  many  aspects  of  the  visual  re- 
sponse depend  on  this  class  of  heterotrimeric  sig- 
nal transduction  enzymes.  G  proteins  bind  GTP 
in  response  to  receptor  stimulation,  and  the  GTP- 
charged  a-subunit  then  dissociates  from  a  com- 
plex of  13-  and  7-subunits  to  regulate  the  activity 
of  an  effector  enzyme  or  channel.  There  are  at 
least  1 5  different  G  protein  a-subunits  and  4  dif- 
ferent types  of  (S-subunits.  This  genetic  diversity 
suggests  that  G  proteins  have  a  variety  of  func- 
tions and  that  G  protein  activities  have  been  fine- 
tuned  to  generate  specific  cellular  responses. 
The  G  proteins  that  mediate  phototransduction 
in  rod  and  cone  photoreceptors  are  called 
transducins. 

Vertebrate  Phototransduction 

Ca^^  mediates  vertebrate  rod  cell  recovery 
from  a  light  flash.  During  the  past  year  we  identi- 
fied a  photoreceptor  Ca^"^-binding  protein, 
named  recoverin,  that  activates  guanylate  cyclase 
only  at  free  Ca^"^  concentrations  below  300  nM. 
Recoverin  couples  the  light-induced  loss  of  Ca^^ 
from  the  cell  to  guanylate  cyclase  activation  and 
recovery  from  photoexcitation.  In  collaboration 
with  Alexander  Dizhoor,  a  visiting  scientist  from 
the  USSR,  we  purified  this  protein,  demonstrated 
its  Ca^^-binding  properties,   cloned  and  se- 


quenced it,  and  demonstrated  its  ability  to  acti- 
vate guanylate  cyclase. 

Once  stimulated  by  light  to  bind  GTP,  transdu- 
cin  slowly  loses  its  bound  GTP  and  its  ability  to 
activate  phosphodiesterase.  Photoreceptor  cells 
recover  from  a  light  flash  within  a  couple  of  sec- 
onds, but  the  kinetics  of  GTP  hydrolysis  and 
phosphodiesterase  deactivation  are  slower,  about 
20-30  seconds.  To  clarify  the  role  that  GTP  hy- 
drolysis plays  in  the  photoresponse,  we  have  pro- 
duced transgenic  mice  that  express  a  mutant 
transducin  expected  to  hydrolyze  GTP  more 
slowly  than  its  normal  counterpart.  Preliminary 
analyses  suggest  that  the  rod  photoresponses 
from  these  mice  are  normal  except  in  the  pres- 
ence of  a  high  background  light.  Under  those 
conditions,  the  cells  appear  to  be  abnormally 
sensitive. 

Drosophila  Vision 

Biochemical  and  physiological  evidence  sug- 
gests that  a  G  protein  mediates  phototransduc- 
tion in  the  eyes  of  invertebrates  by  activating 
phospholipase  C.  We  had  previously  character- 
ized a  Drosophila  G  protein  /3-subunit  encoded 
by  a  gene  referred  to  as  GBB,  and  we  expected  to 
find  this  protein  in  Drosophila  eyes.  To  our  sur- 
prise, in  situ  hybridization  studies  revealed  that 
GBB  mRNA  is  absent  from  Drosophila  photore- 
ceptor cells. 

We  also  found,  however,  that  a  monoclonal  an- 
tibody raised  against  the  GBB  protein  reacts  with 
the  entire  Drosophila  nervous  system,  including 
the  eyes.  In  search  of  a  protein  in  the  eyes  that 
reacts  with  this  antibody,  we  used  it  to  screen  a 
Drosophila  head  cDNA  expression  library.  The 
protein  we  identified  is  50  percent  identical  to 
the  GBB  gene  product,  and  the  gene  that  encodes 
it,  QBE,  is  expressed  only  in  the  eye.  It  is  likely 
that  GBE  encodes  the  18-subunit  of  the  Drosoph- 
ila phototransduction  G  protein,  but  we  need  to 
do  genetic  and  biochemical  experiments  to  con- 
firm this. 

A  Drosophila  G  protein  a-subunit,  DGOa,  is 
absent  from  photoreceptors  but  present  in  the 
lamina,  a  layer  within  the  Drosophila  eye  that 
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processes  photoreceptor  signals.  The  DGOa  pro- 
tein is  a  substrate  for  pertussis  toxin,  an  ADP-ribo- 
syl  transferase  that  inactivates  GOa  subunits.  We 
produced  transgenic  Drosophila  that  express  a 
heat-shock-inducible  pertussis  toxin  gene  in 
most  tissues  of  the  fly,  including  the  lamina.  Per- 
tussis toxin  expression  in  these  flies  alters  a 
transient  component  of  the  electroretinograms 
normally  generated  in  the  lamina.  The  photore- 


ceptor component  of  the  electroretinograms  is 
normal,  suggesting  that  pertussis  toxin  has  un- 
coupled a  normal  photoreceptor  response  from 
its  perception  in  the  brain. 

The  long-term  objective  addressed  in  these 
projects  is  to  identify  the  biochemical  mecha- 
nisms by  which  cells  determine  their  responses 
to  stimuli  in  order  to  perform  specific  physiologi- 
cal functions. 
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MOST  cells  in  the  body  adhere  to  their  neigh- 
bors and  to  the  extracellular  matrix,  a  com- 
plex array  of  proteins  that  comprise  a  fibrillar 
meshwork  throughout  the  body.  Cell  adhesion 
plays  important  roles  in  the  normal  functions  of 
cells,  contributing  to  cellular  organization,  struc- 
ture, and  metabolism.  During  embryological  de- 
velopment, cell  adhesion  is  important  for  the 
movements  of  cells  that  contribute  to  modeling 
of  the  embryo.  In  the  adult,  appropriate  cell  ad- 
hesion is  necessary  for  numerous  physiological 
processes. 

For  example,  in  the  blood,  cells  known  as  plate- 
lets adhere  to  the  walls  of  blood  vessels  that  are 
damaged  and  help  to  prevent  bleeding.  This 
adhesion  process  is  essential  to  protect  against 
hemorrhage.  On  the  other  hand,  it  is  equally  im- 
portant that  platelets  should  not  adhere  at  inap- 
propriate times.  If  they  do,  the  result  is  thrombo- 
sis. Thus  the  control  of  platelet  adhesion  is  a 
matter  of  life  and  death.  Other  blood  cells  in- 
volved in  defense  mechanisms  during  infection 
or  inflammation  need  to  adhere  to  the  walls  of 
blood  vessels  at  the  sites  of  infection  in  order  to 
emigrate  into  the  affected  tissues. 

Another  process  involving  cell  adhesion  and 
migration  is  wound  healing.  When  skin  is  dam- 
aged, the  skin  cells  migrate  in  over  the  wound  to 
cover  it.  The  processes  of  cell  migration  involved 
in  wound  healing  have  much  in  common  with 
those  occurring  during  development. 

A  final  example  is  that  of  cancer.  Tumor  cells 
exhibit  altered  adhesion,  both  to  one  another  and 
to  their  surroundings.  This  altered  adhesion  is 
thought  to  be  involved  in  the  invasion  and  metas- 
tasis of  tumor  cells. 

These  examples  illustrate  the  importance  of 
appropriate  adhesion  of  cells  to  their  surround- 
ings. Our  laboratory  is  involved  in  the  molecular 
analysis  of  these  processes.  We  seek  to  under- 
stand the  proteins  involved  in  cell  adhesion  and 
how  they  control  adhesion  and  migration  of  cells 
in  both  normal  and  pathological  processes. 

Two  main  classes  of  proteins  interest  us.  The 
first  comprises  the  large  proteins  that  make  up 
the  extracellular  matrix.  These  proteins  cooper- 
ate to  build  a  fibrillar  meshwork  to  which  the 


cells  attach  and  on  and  through  which  they  mi- 
grate. We  have  investigated  several  of  these  pro- 
teins, which  we  refer  to  as  "nectins"  to  denote 
their  role  in  binding  to  cells.  Fibronectins,  a 
closely  related  group  of  proteins  all  encoded  by  a 
single  gene,  are  the  best  understood  of  these  nec- 
tins. We  and  others  have  analyzed  the  functions 
and  the  structure  of  these  proteins.  This  work  is 
leading  to  a  deeper  understanding  of  their  roles 
in  cell  behavior.  For  example,  it  is  now  known 
that  fibronectins  have  several  sites  in  each  mole- 
cule that  bind  cells.  The  detailed  structure  of 
these  binding  sites  is  being  elucidated.  One  in- 
triguing observation  is  that  fibronectins  share 
with  many  other  nectins  a  common  recognition 
site  made  up  of  only  three  amino  acids.  This  site 
(designated  RGD  in  the  single-letter  amino  acid 
code)  is  recognized  by  receptor  molecules  on 
cell  surfaces.  This  interaction  can  be  blocked  by 
antibodies  to  the  nectins  or  to  the  receptors, 
which  are  known  as  integrins,  or  by  competitor 
peptides  containing  the  RGD  sequence.  Such 
blockades  interfere  with  the  cell-adhesive  inter- 
actions involved  in  the  physiological  processes 
discussed  above.  Recent  work  has  identified 
other  cell-binding  sites  within  fibronectins, 
which  are  recognized  by  different  receptors.  The 
interaction  of  these  cells  with  these  molecules  is 
complex,  which  is  as  expected,  given  the  partici- 
pation of  cell  adhesion  in  many  diverse  cellular 
functions. 

Our  second  major  focus  of  interest  is  the  family 
of  integrin  receptors.  These  comprise  a  family  of 
related  cell  surface  receptors,  each  composed  of 
two  subunits.  Each  integrin  receptor  has  a  partic- 
ular specificity  for  certain  nectins  and  mediates 
the  interactions  of  cells  with  the  extracellular 
matrix.  In  addition,  the  integrins  connect  to  the 
inside  of  the  cell,  where  they  mediate  interac- 
tions with  the  internal  structures,  or  cytoskele- 
ton,  of  the  cell  that  are  involved  in  the  shape, 
organization,  and  migration  of  cells.  This  integra- 
tion of  the  organization  of  the  extracellular  ma- 
trix with  the  cytoskeleton  is  one  of  the  origins  of 
the  name  "integrins." 

We  have  recently  made  progress  in  analyses  of 
the  role  of  the  intracellular  portions  of  integrin 
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receptors  in  interactions  with  the  cytoskeleton. 
We  have  also  obtained  detailed  structural  infor- 
mation about  the  cytoskeletal  protein  talin, 
which  is  a  primary  candidate  for  interactions  with 
integrins.  Further  progress  along  these  lines 
should  help  explain  the  effects  of  cell  adhesion 
on  cell  structure  and  behavior. 
"  Using  the  methods  of  cell  and  molecular  biol- 
ogy, we  are  studying  the  structure  and  function  of 
fibronectins  and  integrins,  their  interactions,  and 
their  roles  in  various  physiological  processes,  in- 
cluding development,  blood  clotting,  inflamma- 
tion, wound  healing,  and  cancer.  We  observe  reg- 
ulated expression  of  these  molecules  during 
these  processes,  and  it  is  clear  that  these  mole- 
cules are  crucial  for  the  appropriate  behavior  of 
cells.  For  instance,  altered  expression  of  both  fi- 
bronectins and  integrins  in  tumor  cells  contrib- 
utes to  their  wayward  behavior,  and  expression  of 
these  proteins  is  altered  during  wound  healing. 
We  have  used  recombinant  DNA  methods  to  pro- 
duce specific  and  modified  forms  of  fibronectins 
and  integrins  and  investigate  the  ways  in  which 
they  affect  the  behavior  of  individual  cell  types 
such  as  platelets. 


To  extend  our  understanding  of  the  roles  of 
fibronectins  and  integrins  in  intact  organisms,  we 
use  genetic  analyses  in  two  animal  systems.  We 
are  analyzing  the  role  of  integrins  during  the  de- 
velopment of  Drosophila  melanogaster,  a  fruit 
fly  that  is  suitable  for  genetic  analyses.  Flies  with 
mutations  in  genes  encoding  integrins  have  de- 
fects in  embryonic  development,  in  muscle  func- 
tion, and  in  the  development  of  wings  and  eyes. 
Analyses  of  these  defects  provide  insight  into  the 
functions  of  these  proteins.  We  have  also  gener- 
ated strains  of  mice  that  are  mutant  for  fibronec- 
tins, and  we  plan  to  investigate  the  effects  of 
these  mutations  on  hemostasis,  thrombosis, 
wound  healing,  and  tumor  development.  We  are 
also  working  to  generate  mice  with  mutations  in 
integrins  and  in  other  molecules  involved  in  cell 
adhesion. 

These  studies  should- provide  a  deeper  under- 
standing of  the  molecular  basis  of  cell  adhesion 
and  its  involvement  in  physiological  and  patho- 
logical processes.  This  understanding,  in  turn, 
should  provide  opportunities  for  therapeutic 
treatments  of  diseases  such  as  thrombosis  and 
cancer. 


Opposite:  Sections  of  the  eyes  o/Drosophila  melanogaster,  showing  the  regular  array  of  photore- 
ceptors attached  to  a  basement  membrane  in  normal  ( wild-type )  retina  and  the  disruption  of  this 
organization  in  eyes  of  flies  with  mutations  in  integrins. 
Research  of  Susan  Zusman  and  Richard  O.  Hynes. 
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MANY  species  of  bacteria  are  capable  of  caus- 
ing diseases  by  colonizing  and  growing 
within  human  hosts,  using  tactics  that  avoid  nor- 
mal immune  responses.  As  part  of  a  general  strat- 
egy to  establish  an  infectious  niche,  a  variety 
of  microorganisms  cause  diseases  by  entering 
and  growing  inside  human  cells  soon  after  en- 
counter. Bacteria  that  establish  infections  in  this 
manner  are  called  intracellular  microorgan- 
isms. Among  the  diseases  they  cause  are  tubercu- 
losis and  the  most  common  types  of  sexually 
transmitted  and  food-borne  diseases  found  in  the 
industrialized  world.  Despite  the  prevalence  of 
such  infections,  there  was  little  information  until 
recent  years  on  the  factors  expressed  by  these  mi- 
croorganisms that  allow  them  to  enter  host  cells 
and  thrive. 

The  objectives  of  our  research  are  to  investi- 
gate two  important  aspects  of  the  life-style  of  in- 
tracellular microorganisms.  First,  we  would  like 
to  determine  at  the  molecular  level  how  these 
organisms  can  invade  human  cells  that  do  not 
normally  internalize  bacteria.  Second,  we  want  to 
analyze  factors  they  encode  that  allow  them  to 
survive  and  grow  within  the  ordinarily  hostile  en- 
vironment of  human  cells.  Our  main  approach 
has  been  to  identify  bacterial  species  that  enter  or 
grow  particularly  well  within  host  cells  and  to 
develop  genetic  and  biochemical  techniques  for 
analyzing  their  strategies.  The  primary  rationale 
for  this  approach  is  that  it  provides  insights  into 
basic  processes  that  are  applicable  to  numerous 
intracellular  microorganisms. 

To  investigate  the  molecular  mechanism  of  bac- 
terial entry  into  host  cells,  we  have  been  analyz- 
ing the  bacterium  Yersinia  pseudotuberculosis, 
an  organism  that  causes  an  intestinal  disease  often 
accompanied  by  infection  of  multiple  organ  sys- 
tems. This  microorganism  is  perhaps  the  most  ef- 
ficient bacterium  at  entering  into  human  cells 
grown  in  culture,  and  this  has  greatly  facilitated 
analysis. 

To  investigate  intracellular  growth,  we  have 
been  analyzing  Legionella  pneumophila,  the 
causative  agent  of  Legionnaire's  disease  pneumo- 
nia. The  intracellular  growth  process  of  this  bac- 


terium is  very  similar  to  that  of  a  wide  range  of 
intracellular  microorganisms,  and  development 
of  molecular  strategies  for  analyzing  it  has  been 
relatively  straightforward. 

Yersinia  pseudotuberculosis  Entry  Into 
Cultured  Human  Cells 

Y.  pseudotuberculosis  can  enter  host  cells  via 
three  different  paths.  For  each  path  the  microor- 
ganism apparently  encodes  a  unique  set  of  pro- 
tein factors  to  be  used  at  different  tissue  sites  dur- 
ing the  infection  process.  We  have  focused  on  the 
path  that  is  promoted  by  the  protein  invasin,  the 
product  of  the  bacterial  inv  gene.  Invasin  is  a 
103-kDa  protein  on  the  surface  of  the  bacterium 
that  allows  it  to  enter  human  cells  by  binding 
receptor  molecules  on  their  surface. 

We  have  shown  that  the  host  cell's  post-bind- 
ing uptake  requires  only  a  20-kDa  region  of  inva- 
sin. Host  cells  can  internalize  a  variety  of  bacte- 
rial species  or  even  inert  latex  particles  that  are 
coated  with  this  relatively  small  region  of  the 
protein.  Evidently  invasin's  sole  function  is  to 
present  this  binding  region  to  host  cell  receptors 
in  order  to  permit  entry  into  the  cell.  After  the 
binding  occurs,  the  host  cells  do  most  of  the  work 
in  internalizing  the  bacterium. 

Invasin  binds  at  least  four  different  receptors. 
Called  integrins,  these  had  been  previously  iden- 
tified by  investigators  interested  in  a  variety  of 
mammalian  cell-adhesion  processes.  The  particu- 
lar integrin  receptors  that  bind  invasin  can  adhere 
to  a  variety  of  mammalian  proteins,  such  as  fibro- 
nectin  and  molecules  that  allow  adhesion  of  im- 
mune response  cells  to  inflamed  tissues.  It  is  also 
well  known  that  members  of  this  receptor  family 
interact  in  some  functional  fashion  with  an  im- 
portant host  cell  structure  known  as  the 
cytoskeleton. 

Although  invasin  binds  these  well-character- 
ized receptors,  there  is  no  obvious  sequence  simi- 
larity between  invasin  and  other  proteins  that 
bind  integrins,  and  mutations  that  eliminate  the 
interaction  between  invasin  and  its  receptors 
identify  amino  acid  residues  not  previously 
shown  to  be  involved  in  integrin  binding. 
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Our  investigation  of  invasin/integrin  has  led  to 
a  model  for  Yersinia  uptake  into  host  cells.  Bind- 
ing of  invasin  to  its  integrin  receptor  leads  to  rear- 
rangement of  the  cytoskeleton — rearrangement 
requisite  to  entry.  A  signal  must  be  sent  to  cause 
the  host  cell  to  internalize  the  microorganism, 
and  the  internalization  is  facilitated  by  the  ex- 
traordinary avidity  with  vi'hich  invasin  binds  its 
receptors.  Other  proteins  that  bind  the  identical 
integrins  cannot  produce  this  signal  so  effi- 
ciently, because  they  do  not  bind  the  receptors 
tightly. 

So,  invasin  appears  to  promote  entry  of  the  mi- 
croorganism because  it  binds  an  important  recep- 
tor that  communicates  with  the  cell  cytoskeleton, 
and  because  it  binds  so  tightly  to  this  receptor. 

Legionella  pneumophila  Growth  in 
Phagocytic  Cells 

L.  pneumophila  causes  a  variety  of  diseases  in 
humans,  including  Legionnaire's  disease  pneumo- 
nia. The  bacterium  grows  in  lung  tissues  after  en- 
counter with  its  human  host.  Its  favorite  habitat  is 
within  alveolar  macrophages,  cells  that  normally 
function  to  kill  invading  microorganisms.  An  im- 
portant mechanism  for  macrophages  to  kill  or  in- 
hibit the  growth  of  a  microorganism  is  to  interna- 
lize it  and  sequester  it  in  a  compartment  called  a 
phagosome,  which  in  turn  fuses  with  a  lysosomal 
compartment  filled  with  antibacterial  factors.  L. 


pneumophila  is  able  to  grow  within  the  phago- 
some, convert  it  into  an  organelle  with  a  unique 
morphology,  and  prevent  the  introduction  of  the 
antibacterial  lysosomal  components  into  this  site. 

We  have  been  interested  in  determining  how  L. 
pneumophila  is  able  to  establish  and  grow  within 
this  protective  niche.  Our  approach  has  been  to 
isolate  mutations  in  this  bacterium  that  prevent  it 
from  growing  intracellularly.  Three  easily  distin- 
guishable classes  of  mutants  have  been  isolated. 
The  first  class  causes  the  bacterium  to  be  internal- 
ized by  a  macrophage  via  a  novel  pathway,  and 
this  causes  an  extreme  defect  in  bacterial  growth. 
The  second  class,  and  most  easily  isolated,  con- 
sists of  mutants  that  are  no  longer  able  to  prevent 
the  lysosomal  contents  from  being  introduced 
into  the  phagosome.  The  third  class  appears  nor- 
mal for  uptake  as  well  as  for  shutting  out  the  lyso- 
somal components,  but  the  phagosome  contain- 
ing the  mutant  microorganism  no  longer  exhibits 
the  unique  morphology  usually  found  in  a  Le- 
gionella infection. 

These  classes  of  mutants  indicate  that  the  mi- 
croorganism performs  a  distinct  series  of  steps 
within  the  macrophage,  each  of  which  contrib- 
utes to  the  parasite's  efficient  growth.  We  are 
currently  trying  to  identify  the  factors  missing  in 
these  mutants,  using  a  combination  of  molecular 
and  genetic  techniques,  in  hopes  of  determining 
how  the  bacterium  is  able  to  perform  each  of 
these  self-serving  steps. 
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TUBERCULOSIS,  caused  by  Mycobacterium 
tuberculosis,  continues  to  be  the  major 
cause  of  death  throughout  the  world  today.  The 
World  Health  Organization  estimates  that  each 
year  there  are  approximately  10  million  new 
cases  of  tuberculosis  and  over  3  million  deaths. 
After  32  years  of  a  steadily  decreasing  incidence 
in  the  United  States,  a  surprising  and  alarming 
increase  in  the  numbers  of  new  cases  has  been 
reported  in  many  of  our  cities  in  the  last  four 
years.  This  is  thought  to  be  a  result  of  the  AIDS 
(acquired  immune  deficiency  syndrome)  epi- 
demic and  is  of  considerable  concern.  Tubercu- 
losis is  not  only  a  disease  common  to  AIDS  pa- 
tients but  is  also  one  of  the  few  diseases  that  can 
be  readily  spread  from  persons  with  AIDS  to  the 
general  population. 

Another  microorganism  of  the  same  genus,  My- 
cobacterium avium,  although  not  a  pathogen  to 
healthy  individuals,  is  a  major  opportunistic 
pathogen  in  AIDS.  Mycobacterium  leprae  is  the 
causative  agent  of  leprosy,  an  affliction  dating 
back  to  ancient  times  that  affects  over  1 3  million 
people  in  the  world  today. 

In  contrast  to  the  pathogens,  BCG  (bacille  Cal- 
mette-Guerin) ,  the  tuberculosis  vaccine,  has 
been  used  through  parts  of  the  world  since  1922 
to  prevent  tuberculosis.  We  believe  that  BCG  rep- 
resents an  ideal  candidate  as  a  recombinant  vac- 
cine vector  containing  foreign  antigen  genes,  be- 
cause it  is  safe,  has  excellent  adjuvant  properties, 
and  should  elicit  long-lasting  immunity.  Thus  the 
goals  of  my  laboratory  are  1)  to  dissect  patho- 
genic mycobacteria  genetically  in  order  to  de- 
velop effective  strategies  to  cure  and  prevent  my- 
cobacterial infections  and  2)  to  engineer  BCG  as 
a  recombinant  vaccine  vector. 

Historically  mycobacteria  have  played  a  promi- 
nent role  in  the  development  of  microbiology. 
Robert  Koch,  in  1882,  established  the  criteria 
(Koch's  postulates)  by  which  one  ascertains 
whether  an  organism  causes  an  infectious  disease 
when  he  established  that  the  tubercle  bacillus 
causes  tuberculosis.  By  analogy,  "Koch's  molecu- 
lar postulates"  is  the  method  by  which  one  ascer- 
tains that  a  characteristic  of  a  bacterium,  such  as 


its  virulence,  is  caused  by  a  particular  gene.  The 
essential  steps  involve  1 )  identification  of  a  mu- 
tant of  the  bacterium  that  lacks  some  characteris- 
tic, 2)  cloning  of  an  individual  gene(s)  from  the 
parent  bacterium,  and  3)  transfer  of  the  cloned 
normal  genes  into  the  bacterial  mutant  to  demon- 
strate restoration  of  the  original  characteristic. 
For  example,  to  identify  a  gene  required  for  viru- 
lence of  M.  tuberculosis,  we  would  1)  identify 
an  avirulent  variant  of  the  organism,  2)  clone  the 
genes  from  the  virulent  strain,  and  3)  introduce 
the  putative  virulence  gene  back  into  the  aviru- 
lent mutant  to  demonstrate  that  this  gene  confers 
virulence. 

For  the  mycobacteria,  these  sorts  of  experi- 
ments entail  considerable  difficulties.  The  organ- 
isms have  been  objects  of  study  since  Koch's  pio- 
neering work,  but  their  genetic  analyses  had  not 
been  achieved,  primarily  because  of  their  slow 
growth.  The  tubercle  bacillus,  which  multiplies 
only  once  every  24  hours,  requires  three  weeks 
to  form  a  colony  from  a  single  cell.  In  contrast, 
Escherichia  coli  yield  visible  colonies  in  eight 
hours.  The  leprosy  bacillus  has  yet  to  be  culti- 
vated in  the  laboratory  and  can  only  be  grown  in 
mouse  footpads  or  the  nine-banded  armadillo. 
Furthermore,  although  recombinant  DNA  tech- 
nology has  enabled  us  to  clone  any  mycobacterial 
gene,  the  technologies  to  transfer  recombinant 
DNA  back  into  mycobacteria  did  not  exist  five 
years  ago. 

We  took  advantage  of  bacterial  viruses  that  in- 
fect mycobacteria,  called  mycobacteriophages, 
as  the  building  blocks  for  developing  vectors  ca- 
pable of  transferring  recombinant  DNA  effi- 
ciently into  mycobacterial  cells.  A  novel  hybrid 
vector,  called  a  shuttle  phasmid,  was  con- 
structed. It  replicates  in  mycobacteria  as  a  phage 
and  in  E.  coli  as  a  plasmid,  which  permitted  us  to 
introduce  cloned  genes  into  a  wide  variety  of  my- 
cobacterial strains  for  the  first  time. 

Using  these  vectors,  we  identified  selectable 
marker  genes  that  have  allowed  us  to  develop 
phage-  and  plasmid-derived  vectors,  as  well  as 
systems  to  insert  genes  into  the  mycobacterial 
chromosomes.  We  have  isolated  mutants  of  a  fast- 
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growing  mycobacterium,  M.  smegmatis.  These 
exhibit  high  plasmid  transformation  frequencies 
and  thus  provide  a  most  useful  surrogate  host  for 
analyzing  the  genes  of  slow-growing  pathogenic 
mycobacteria. 

Genetic  Analysis  of  Mycobacterial 
Virulence  Determinants 

A  major  focus  of  this  laboratory  is  to  elucidate 
the  causal  mechanisms  of  mycobacterial  diseases 
by  identifying  the  genes  responsible  for  specific 
virulence  characteristics.  The  systems  we  have  de- 
veloped allow  us  to  introduce  libraries  of  genes 
from  virulent  mycobacteria  into  avirulent  strains 
with  great  efficiency.  A  complementary  strategy 
is  the  generation  of  specific  mutants  that  no 
longer  are  virulent  in  our  animal  models.  The 
combination  of  these  strategies  has  allowed  us  to 
identify  a  number  of  genes  necessary  for  the  bio- 
synthesis of  amino  acids,  purines,  and  complex 
polysaccharides  found  on  the  surface  of  the 
pathogenic  mycobacteria.  By  identifying  the 
genes  and  their  products  responsible  for  viru- 
lence, we  hope  to  be  able  to  design  approaches  to 
the  control  of  mycobacterial  disease.  In  addition, 
the  genetically  engineered  avirulent  mutants 
should  provide  promising  vaccine  candidates. 

Genetic  Analysis  of  Drug  Targets  of  the 
Pathogenic  Mycobacteria 

Although  there  are  various  efi'ective  antibiotics 
for  tuberculosis  and  leprosy,  antibiotic-resistant 
bacteria  are  continually  emerging  as  a  major  pub- 
lic health  problem  here  and  abroad.  Moreover, 
there  is  no  effective  chemotherapy  for  M.  avium 


infections.  We  are  employing  genetic  approaches 
to  identify  the  mechanisms  of  action  of  present 
anti-mycobacterial  antibiotics  and  the  mecha- 
nisms of  resistances  to  these  antibiotics.  The  ba- 
sic information  so  acquired  should  lead  to  the 
design  of  more  effective  drugs,  strategies  to  over- 
come resistances,  and  diagnostic  tests  to  detect 
mycobacterial  strains  containing  resistance-con- 
ferring genes.  We  hope  that  improved  strategies 
for  treating  leprosy,  tuberculosis,  and  other  myco- 
bacterial infections  can  be  developed. 

Development  of  Recombinant  BCG  Vaccines 

By  cloning  and  introducing  foreign  antigen 
genes  into  BCG,  it  may  be  possible  to  develop 
novel  vaccines  that  would  protect  not  only 
against  tuberculosis  but  also  against  the  pathogen 
from  which  the  foreign  gene  was  obtained.  In  col- 
laboration with  Barry  Bloom  (HHMI,  Albert  Ein- 
stein College  of  Medicine)  and  colleagues  at 
Medlmmune  and  the  University  of  Pittsburgh,  we 
are  developing  systems  to  express  foreign  anti- 
gens from  a  variety  of  parasitic,  viral,  and  bacte- 
rial pathogens.  We  have  devoted  considerable  ef- 
fort to  elucidating  the  mycobacterial  genetic 
signals  necessary  for  expression  and  stable  repli- 
cation of  foreign  genes.  Preliminary  results  dem- 
onstrate that  high  levels  of  foreign  proteins  can 
be  produced  in  BCG  and  that  these  recombinant 
vaccines  can  elicit  both  humoral  and  cellular  im- 
mune responses  in  infected  animals.  It  is  our 
hope  that  recombinant  BCG  will  not  only  yield 
effective  vaccines  but  also  provide  a  novel  and 
useful  tool  for  studying  immune  responses  in 
mammalian  hosts. 
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VOLTAGE-sensitive  potassium  channels  proba- 
bly constitute  the  most  diverse  and  wide- 
spread class  of  ion  channels.  More  than  30  differ- 
ent types  of  potassium  channels  have  been 
characterized.  They  dilfer  in  their  voltage  sensi- 
tivity, their  kinetic  properties,  and  their  sensitiv- 
ity to  second  messengers  within  the  cell.  Potas- 
sium channels  have  been  found  in  almost  every 
eukaryotic  cell  type  examined,  in  both  the  ani- 
mal and  the  plant  kingdoms.  They  are  important 
for  a  wide  range  of  physiological  functions,  in- 
cluding insulin  release  due  to  raised  glucose  lev- 
els, proliferation  of  lymphocytes  induced  by  mi- 
togens, and  the  movements  of  leaflets  in  plants  or 
the  opening  and  closing  of  leaf  stomatal  pores.  In 
the  mammalian  nervous  system,  potassium  chan- 
nels control  excitability  and  the  strength  of  sig- 
naling between  nerve  cells.  Indeed,  some  of  the 
potassium  channels  have  been  implicated  as 
playing  a  role  in  learning  and  memory. 

In  order  to  study  how  the  diversity  of  potas- 
sium channels  arises  and  how  they  serve  the  wide 
variety  of  cellular  functions,  one  needs  to  study 
these  channels  biochemically  as  well  as  biophysi- 
cally.  However,  they  are  difficult  to  purify  be- 
cause they  are  rather  heterogeneous  and  inacces- 
sible. For  this  reason  we  have  taken  advantage  of 
the  well-developed  genetic  technologies  applica- 
ble to  the  fruit  fly  Drosophila  melanogaster.  In 
this  organism,  if  a  gene  (say,  coding  for  a  potas- 
sium channel)  can  be  identified  by  the  abnormali- 
ties caused  by  its  mutations,  one  can  clone  it  for 
molecular  studies  of  the  gene  product. 

More  than  a  decade  ago,  Yuh  Nung  Jan  (HHMI, 
University  of  California,  San  Francisco),  Mike 
Dennis,  and  I  found  that  mutations  at  the  Shaker 
locus  cause  prolonged  transmitter  release  from 
the  motor  nerve  terminal,  probably  because  of  a 
defect  in  potassium  channel  function.  Subse- 
quent studies  by  Larry  Salkoff,  Mark  Tanouye,  Al- 
berto Ferrus,  C.  F.  Wu,  and  Leslie  Timpe  provided 
strong  evidence  suggesting  that  the  Shaker  locus 
codes  for  a  component  of  a  rapidly  inactivating 


potassium  channel,  the  A  channel.  The  locus  was 
subsequently  cloned  by  Diane  Papazian,  Tom 
Schwarz,  and  Bruce  Tempel  in  our  laboratory.  It 
codes  for  proteins  that  contain  multiple  stretches 
of  hydrophobic  amino  acids  that  can  potentially 
span  the  cell  membrane. 

These  Shaker  proteins  appear  to  be  integral 
membrane  proteins,  as  indicated  by  their  se- 
quence and  by  subsequent  immunoblot  (West- 
ern) studies.  They  show  homology  to  the  se- 
quence of  vertebrate  sodium  channels,  although 
they  are  much  smaller  in  size.  They  correspond 
roughly  to  one  of  the  four  internally  homologous 
domains  of  the  sodium  channel.  Finally,  Dr. 
Timpe  demonstrated  that  RNA  encoding  for  four 
of  the  Shaker  proteins  of  known  deduced  se- 
quence, when  injected  into  frog  oocytes,  causes 
the  functional  expression  of  potassium  channels 
that  in  several  ways  resemble  the  A  channel  in  the 
fruit  fly.  Taken  together,  these  studies  showed 
that  the  Shaker  locus  is  a  potassium  channel 
gene. 

Starting  with  the  Shaker  gene  in  the  fruit  fly. 
Dr.  Tempel  isolated  a  gene  in  the  mouse  that 
codes  for  a  potassium  channel  in  the  mouse 
brain.  This  protein  is  65  percent  identical  in  its 
sequence  to  the  Drosophila  potassium  channel 
protein.  In  frog  oocytes,  the  gene  produces  potas- 
sium channels  that  do  not  inactivate  rapidly.  Anal- 
ysis of  the  distribution  of  amino  acid  residues  that 
appear  to  be  essential  and  have  been  totally  con- 
served over  600  million  years  has  provided  some 
clues  to  the  channel's  structure.  By  now  more 
than  10  different  mammalian  potassium  channel 
genes  have  been  characterized  by  a  number  of 
laboratories.  One  of  these  (the  rat  Shall  gene), 
cloned  and  characterized  in  our  laboratory  by 
Tim  Baldwin,  Meei-Ling  Tsaur,  and  George  Lo- 
pez, produces  a  rapidly  inactivating  potassium 
channel  in  frog  oocytes  and  is  expressed  in  the 
heart  as  well  as  the  brain.  The  high  degree  of  con- 
servation between  these  mammalian  and  the  fruit 
fly  potassium  channels  reiterates  the  point  that 
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any  experimental  organism,  as  long  as  it  is  amena- 
ble to  the  specific  type  of  experimentation,  will 
reveal  information  of  medical  interest. 

An  intriguing  finding  is  that  the  Shaker  potas- 
sium channel  gene  in  the  fruit  fly  gives  rise  to  a 
number  of  protein  products  by  alternative  splic- 
ing of  the  primary  RNA  transcript.  These  diff'erent 
protein  products  probably  form  diff'erent  sub- 
types of  this  potassium  channel,  because  they 
show  differential  tissue  distribution  and,  when 
expressed  in  frog  oocytes,  they  give  rise  to  chan- 
nels of  different  kinetic  properties.  Alternative 
splicing  of  the  potassium  channel  gene,  together 
with  the  potential  mix  and  match  of  subunits 
from  one  or  more  genes,  offers  a  possible  mecha- 
nism for  generating  potassium  channel  diversity. 

Having  cloned  a  potassium  channel  gene,  we 
can  now  ask  how  this  channel  works.  How  does  it 
detect  a  voltage  change  across  the  cell  membrane 
and,  responding,  open?  How  does  it  "inactivate" 
after  it  opens?  How  does  it  discriminate  between 
sodium  and  potassium  ions  and  show  exquisite 


selectivity?  To  probe  these  questions,  we  have 
altered  specific  residues  of  the  potassium  chan- 
nel to  see  how  the  various  functions  are  affected. 

For  studies  of  the  biological  functions  of  potas- 
sium channels,  we  have  chosen  to  concentrate  on 
the  mammalian  heart  and  hippocampus.  A  variety 
of  cardiac  potassium  channels  have  been  charac- 
terized biophysically  and  are  important  in  con- 
trolling the  rhythmic  heartbeat.  Molecular  stud- 
ies of  these  channels  not  only  will  contribute  to 
our  understanding  of  channel  function  but  also 
will  be  relevant  clinically,  for  example,  in  the 
development  of  more  specific  drugs  for  arrhyth- 
mia. The  hippocampus  is  a  region  of  the  mamma- 
lian brain  that  appears  to  play  an  important  role 
in  learning  and  memory.  It  has  also  been  studied 
extensively  in  experimental  paradigms  that  in- 
duce epileptic  activity.  By  cloning  and  analyzing 
potassium  channel  genes  that  are  expressed  in 
this  tissue,  we  hope  to  learn  about  the  involve- 
ment of  these  potassium  channels  in  the  normal 
function  and  pathology  of  the  nervous  system. 
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HOW  a  nervous  system  is  organized  during 
development  is  a  major  unresolved  problem 
in  biology.  For  the  last  few  years,  we  have  been 
interested  in  the  following  questions  in  neural 
development:  How  do  neurons  arise  from  undif- 
ferentiated ectodermal  cells?  What  gives  them 
their  individual  identity  as  to  shape  and  function? 

Our  long-term  goal  is  to  understand  these  pro- 
cesses at  the  molecular  level.  Our  approach,  es- 
sentially genetic,  is  first  to  isolate  mutations  that 
affect  neurogenesis,  neuronal  type,  or  axonal 
pathway  formation,  and  then  to  identify  these 
mutations,  leading  to  the  isolation  of  important 
genes. 

During  the  last  five  years,  our  laboratory  has 
been  engaged  in  an  extensive  search  and  analysis 
of  mutants  affecting  neural  development  in  Dro- 
sophila. To  identify  and  analyze  such  mutants, 
we  are  using  the  embryonic  sensory  nervous  sys- 
tem, which  has  been  characterized  in  consider- 
able detail  at  the  single-cell  level.  Roughly  half  of 
the  Drosophila  genome  has  been  screened  for 
mutations  that  alter  the  peripheral  nervous  sys- 
tem (PNS) .  This  has  resulted  in  the  identification 
of  a  number  of  genes  that  specify  cell  fate  in  the 
embryonic  fly.  Analysis  of  those  genes  led  us  to 
propose  a  model  for  "progressive  determination 
of  the  PNS." 

Early  during  embryogenesis,  cells  in  different 
locations  within  the  ectodermal  layer  acquire  un- 
equal developmental  potential,  as  a  result  of 
genes  that  specify  positions  in  the  embryo,  the 
"prepattem  genes."  These  include,  genes  that 
specify  dorsal-ventral  and  anterior-posterior 
axes.  Some  cells  apparently  acquire  the  potential 
to  become  neuronal  precursors  from  the  action  of 
"proneural  genes,"  which  include,  for  example, 
genes  of  the  achaete-scute  complex  {AS-C)  and 
daughterless  (da).  Both  AS-C  and  da  encode 
proteins  with  the  helix-loop-helix  (HLH)  struc- 
tural motif.  It  is  likely  that  AS-C  and  da  products 
form  homo-  or  heterodimers  that  bind  to  DNA  and 
regulate  the  transcription  of  target  genes,  initiat- 
ing neuronal  precursor  development. 

As  a  neuronal  precursor  forms,  it  inhibits  neigh- 


boring cells  from  assuming  this  role.  Such  "lat- 
eral inhibition"  involves  the  action  of  "neuro- 
genic genes."  Removing  the  function  of  any  of 
the  six  known  zygotic  neurogenic  genes — Notch 
(IV),  Delta  (Dl),  the  Enhancer  of  split  complex 
[E(spl)-C\,  mastermind  (mam),  neuralized 
(neu),  and  big  brain  (bib) — leads  to  hyper- 
trophy of  both  the  central  nervous  system  (CNS) 
and  the  PNS,  presumably  as  a  result  of  losing  lat- 
eral inhibition. 

There  appear  to  be  at  least  two  independent 
cell-cell  interaction  pathways.  One  is  mediated 
by  the  gene  products  of  A'^and  Dl,  both  encoding 
for  membrane  proteins  with  epidermal  growth 
factor  (EGF)-like  repeats.  E(spl)-C,  mam,  and 
perhaps  neu  are  involved.  The  second  pathway  is 
mediated  by  bib,  which  encodes  a  membrane 
protein  with  significant  homology  to  the  bovine 
major  intrinsic  protein  (MIP),  soybean  nodulin 
26,  and  E.  co// glycerol  facilitator,  which  allows 
passive  transport  of  small  molecules  such  as  gly- 
cine. The  commitment  of  neuronal  precursors 
may  involve  the  actions  of  certain  "master  regula- 
tory genes,"  which  lock  a  cell  into  a  particular 
fate. 

The  identity  of  a  neuronal  precursor  is  further 
specified  by  "neuronal  type  selector  genes." 
These  control  the  type  of  sensory  neuron  that  a 
precursor  will  give  rise  to.  For  example,  the  cut 
locus  is  required  for  external  sensory  organs  to 
acquire  their  correct  identity.  In  the  absence  of 
cut  function,  those  organs  are  transformed  into 
chordotonal  organs.  Normally  cut  is  expressed  in 
external  sensory  organ  precursors  but  not  in 
chordotonal  organ  precursors.  We  think  that  ex- 
pression of  CM?  gene  activity  determines  which  of 
the  two  types  of  organs  the  precursor  will  be- 
come. The  CM?  product  contains  a  homeodomain 
and  probably  acts  as  a  transcription  factor  regu- 
lating the  expression  of  downstream  differentia- 
tion genes. 

Sequence  information  on  a  number  of  genes 
involved  in  neural  development  indicates  that 
the  majority  of  these  genes  contain  a  previously 
identified  functional  motif — e.g.,  the  EGF  repeat 
in  Notch,  the  tyrosine  kinase  domain  in  seven- 
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less,  the  homeodomain  in  cut,  and  the  helix- 
loop-helix  motif  in  da.  In  each  case,  the  motif 
immediately  suggests  the  likely  mode  of  gene  ac- 
tion, which  may  be  tested  experimentally. 

The  existence  of  readily  identifiable  motifs  in 
the  majority  of  genes  involved  in  neural  develop- 
ment suggests  that  cells  have  a  limited  repertoire 
of  mechanisms  for  essential  regulatory  functions. 
These  include  various  signaling  systems  such  as 
ligands,  receptors,  second  messengers,  and  regu- 
lators of  gene  expression.  Perhaps  only  a  modest 
number  of  new  tools  had  to  be  invented  for  the 
formation  of  the  nervous  system,  and  many  of  the 
cellular  mechanisms  in  neural  development  may 
turn  out  to  be  ones  already  in  use  before  the  ner- 


vous system  evolved.  Understanding  neural  devel- 
opment may  require  an  understanding  of  the 
usage  and  manipulations  of  these  basic  functional 
motifs. 

It  is  apparent  that  many  of  the  functional  motifs 
have  been  highly  conserved  during  evolution. 
Several  hundred  million  years  separate  the  fly  and 
vertebrates,  yet  the  aforementioned  motifs — EGF 
repeat,  tyrosine  kinase  domain,  etc. — are  clearly 
conserved.  Thus  studying  development  in  organ- 
isms with  well-developed  genetics,  such  as  fruit 
fly  and  nematode,  may  provide  not  only  reason- 
ably satisfactory  answers  but  also  useful  tools 
for  the  study  of  corresponding  problems  in 
vertebrates. 
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THE  key  function  of  the  immune  system  is  to 
recognize  infectious  agents  as  foreign  and 
eliminate  them  from  the  body.  The  defense  mech- 
anisms for  the  elimination  phase  can  also  damage 
the  body's  own  tissues.  This  is  normally  avoided 
through  a  process  of  self/non-self  discrimination 
mediated  by  highly  specific  receptors  for  anti- 
gen. Our  studies  have  focused  on  rwo  aspects  of 
this  process.  First,  we  have  explored  the  nature  of 
receptors  on  T  cells  and  of  ligands  on  self  cells  in 
both  normal  immune  responses  and  experimen- 
tal autoimmune  diseases.  Second,  we  have 
probed  the  mechanisms  by  which  the  immune 
system  identifies  a  foreign  material  as  infectious. 
A  main  objective  is  to  identify  evolutionarily 
stable  microbial  constituents  that  act  on  nonclon- 
ally  distributed  receptors  and  lead  to  the  induc- 
tion of  signals  alerting  the  immune  system  to  the 
presence  of  foreign  matter. 

We  have  examined  activation  of  normal  T  cells 
that  requires  both  binding  of  ligand  to  their  clon- 
ally  distributed  receptor  and  delivery  of  "co- 
stimulatory"  signals  through  a  still  uncharacter- 
ized  receptor-ligand  mechanism.  These  studies 
have  also  revealed  a  diverse  collection  of  micro- 
bial constituents  that  can  induce  the  co-stimula- 
tor required  for  clonal  expansion  of  T  cells.  By 
means  of  two  signals — a  specific  ligand  recog- 
nized by  the  clonally  distributed  receptor,  and  a 
nonspecific  signal  derived  from  a  host  cell  upon 
activation  by  a  stable  microbial  constituent — the 
immune  system  can  discriminate  infectious  non- 
self  from  noninfectious  self.  Without  the  delivery 
of  the  co-stimulatory  factor,  no  clonal  expansion 
or  differentiation  to  effector  function  occurs. 
Thus  a  major  component  of  self/non-self  discrimi- 
nation is  mediated  by  the  system  of  nonclonally 
distributed  receptors  for  stable  microbial 
constituents. 

Immune  responses  are  triggered  by  specialized 
antigen-presenting  cells  whose  distinguishing 
feature  is  the  ability  to  express  co-stimulatory 
factors.  Immune  responses,  however,  must  be 
able  to  detect  infection  in  normal  tissue  cells  that 
do  not  express  co-stimulatory  signals.  For  this  to 
occur,  the  delivery  of  effector  responses  must  not 


be  dependent  upon  co-stimulation — and  is 
known  not  to  be  in  the  case  of  cytolytic  T  cells 
and  helper  T  cells.  A  risk,  however,  is  inherent  in 
this  aspect  of  immune  effector  function:  the  oc- 
currence of  T  cell  responses  to  foreign  microbial 
agents  that  cross-react  with  host  cells.  Such  re- 
sponses might  lead  to  autodestruction  of  host 
tissue. 

Avoidance  of  such  destruction,  however,  might 
be  effected  through  a  third  mechanism  that  we 
have  recently  revealed.  Activated  cells  that  en- 
counter ligand  in  the  absence  of  co-stimulatory 
factors,  as  on  the  surface  of  host  tissue  cells,  are 
induced  to  secrete  effector  lymphokines  and  then 
abruptly  die.  The  mechanism  of  death  involves 
interferon-7  produced  by  the  effector  T  cell,  and 
perhaps  inhibited  by  the  cytokine  interleukin-4. 
This  process  of  activation-induced  cell  death  has 
been  observed  both  in  cultured  cells  and,  re- 
cently in  other  laboratories,  in  vivo. 

If  self/non-self  discrimination  is  to  be  regu- 
lated not  only  at  the  level  of  receptor-ligand  in- 
teraction but  also  by  delivery  of  co-stimulatory 
signals,  as  required  by  all  peripheral  mechanisms 
of  tolerance,  then  there  must  also  be  a  require- 
ment for  both  the  specific  ligand  and  the  co- 
stimulator  to  be  delivered  by  a  single  cell.  This 
would  be  necessary  to  avoid  the  activation  of  T 
cells  that  recognize  ligand  on  the  surface  of  a  tis- 
sue cell  but  receive  co-stimulation  from  resident 
tissue  antigen-presenting  cells.  We  have  exam- 
ined this  requirement  for  normal  CD4  T  cells  and 
find  that  it  does  indeed  apply.  That  is,  delivery  of 
the  ligand  for  the  T  cell  receptor  by  one  cell  and 
separate  delivery  of  the  co-stimulatory  stimulus 
does  not  lead  to  T  cell  activation,  clonal  expan- 
sion, or  the  acquisition  of  effector  function.  This 
requirement  allows  antigen-presenting  cells  to 
be  distributed  throughout  our  tissues,  where  they 
can  acquire  local  viruses  and  lead  to  the  induc- 
tion of  anti-viral  immune  responses  without  jeop- 
ardizing the  critical  parameter  of  self-tolerance. 

Given  all  of  these  restraints  on  self  reactivity, 
one  might  ask  how  autoimmunity  can  ever  occur? 
One  mechanism  that  we  have  uncovered  operates 
through  the  B  lymphocyte.  Activated  T  cells  rec- 
ognizing antigen  on  the  surface  of  a  B  lymphocyte 
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can  induce  the  expression  of  co-stimulatory  activ- 
ity in  the  latter  cell.  It  has  been  known  for  some 
time  that  T  cells  reactive  to  a  foreign  epitope 
linked  to  a  self  protein  can  activate  B  cells  that 
will  produce  antibody  directed  at  the  protein. 
Our  studies  have  now  shown  that  anti-self  protein 
B  cells  become  activated  in  this  way  so  that  they 
can  now  present  the  self  protein  in  an  immuno- 
genic form  to  self  T  cells,  leading  to  a  sustained 
autoimmune  T  cell  response.  We  have  generated 
autoreactive  T  lymphocytes  by  this  means  and  are 
testing  whether  they  can  generate  a  sustained  au- 
toimmune response  in  vivo. 

Now  that  the  phenomenology  of  discrimina- 
tion of  infectious  non-self  from  noninfectious  self 
has  been  established,  and  experimental  systems 
put  in  place  to  characterize  this  response,  our 
studies  will  begin  to  focus  on  the  receptor-ligand 
pairs  involved  in  the  discrimination  of  infectious 
organisms  from  the  noninfectious  host.  A  system 
already  established  to  detect  the  increase  in  co- 
stimulatory  activity  has  revealed  that  inducers  of 
co-stimulation  act  on  virtually  all  antigen-present- 


ing cells.  Assuming  that  this  process  operates 
through  a  receptor-ligand  pair,  the  result  demon- 
strates that  such  receptors  are  nonclonally  distrib- 
uted, as  we  predicted  in  1989. 

We  propose  that  the  receptors  currently  uti- 
lized by  the  immune  system  to  discriminate  in- 
fectious from  noninfectious  are  those  that  once 
allowed  primitive  organisms  to  regulate  their  im- 
mune effector  responses.  Such  receptors  may  still 
function  in  the  early  nonclonal  phases  of  host  de- 
fense, as  well  as  in  triggering  the  induction  of 
specific  clonal  defense  as  discussed  above.  If  this 
proves  to  be  so,  the  innate  and  specific  immune 
responses  will  be  shown  to  be  linked  not  only  at 
the  effector  phase  but  also  at  the  recognitive 
phase.  Such  a  synthesis  is  the  long-term  goal  of 
our  research  in  this  area. 

On  a  more  practical  level,  identification  of  re- 
ceptors that  regulate  the  induction  of  immunity 
may  lead  to  up-  and  down-regulation  of  immune 
responses  to  achieve  the  desired  level  of  effector 
function  in  such  situations  as  allergy,  autoimmu- 
nity, graft  rejection,  and  cancer. 
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Changes  in  neural  cell  pattern  induced  by  the  notochord  and 
floor  plate.  Micrograph  shows  the  change  in  pattern  of  floor 
plate  and  motor  neuron  cell  differentiation  in  response  to  a 
grafted  notochord  in  chick  embryos.  In  this  section  of  an  em- 
bryonic chick  spinal  cord,  the  left  side  shows  a  normal  ventral 
motor  neuron  pool,  labeled  with  a  monoclonal  antibody.  A 
grafted  notochord  adjacent  to  the  right  side  of  the  spinal  cord 
induces  an  additional  floor  plate  and  ectopic  motor  neurons  in 
dorsal  regions. 
Research  of  Thomas  M.  Jessell. 
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A major  aim  of  our  research  is  to  define  the 
mechanisms  that  control  the  patterning  of 
neural  tissues  in  vertebrate  embryos.  Our  studies 
continue  to  focus  on  the  events  contributing  to 
the  differentiation  of  specific  cell  types  in  the 
developing  neural  tube.  Over  the  past  year  we 
have  obtained  evidence  that  the  floor  plate,  a  spe- 
cialized group  of  neuroepithelial  cells,  is  in- 
volved in  establishing  the  pattern  of  cell  differen- 
tiation along  the  dorsoventral  (D-V)  axis  of  the 
central  nervous  system. 

The  development  of  the  vertebrate  nervous  sys- 
tem begins  v^ith  induction  of  the  neural  plate, 
followed  by  differentiation  of  distinct  cell  types 
at  different  positions  along  the  D-V  axis  of  the 
neural  tube.  For  example,  in  the  spinal  cord,  mo- 
tor neurons  are  located  ventrally,  whereas  com- 
missural neurons  and  neural  crest  cells  appear  in 
dorsal  positions.  The  distribution  of  each  of  these 
cell  types  is  bilaterally  symmetric  with  reference 
to  the  midline  of  the  neural  tube. 

Cells  at  the  midline  of  the  neural  plate  cells 
give  rise  to  a  specialized  region  called  the  floor 
plate.  In  collaboration  with  Jane  Dodd's  labora- 
tory, we  have  developed  functional  assays  show- 
ing that  the  floor  plate  differentiation  is  induced 
by  underlying  mesodermal  cells  of  the  noto- 
chord.  We  have  also  found  that  specialized  prop- 
erties of  the  floor  plate  cells  influence  the  devel- 
opment of  other  neural  cells.  A  diffusible 
chemoattractant  released  by  the  floor  plate  or- 
ients the  growth  of  a  subset  of  developing  spinal 
cord  axons.  In  addition,  the  floor  plate  and  the 
notochord  are  sources  of  a  polarizing  signal  that 
respecifies  cell  pattern  along  the  anterior-poste- 
rior (A-P)  axis  of  the  developing  chick  limb,  mim- 
icking the  effect  of  the  putative  morphogen  reti- 
noic  acid. 

These  findings  raised  the  possibility  that  the 
floor  plate  and  notochord  may  also  be  involved  in 
controlling  the  patterning  of  cell  differentiation 
along  the  D-V  axis  of  the  developing  nervous  sys- 
tem. To  test  this  possibility,  we  have  used  anti- 
bodies directed  against  cell-specific  antigens  to 
determine  whether  the  pattern  of  cells  in  the  em- 
bryonic chick  nervous  system  changes  after  a) 


induction  of  an  additional  floor  plate  at  ectopic 
positions  in  the  neural  tube  by  a  notochord  graft 
or  b)  grafting  of  a  floor  plate  next  to  the  neural 
tube  or  c)  removal  of  the  notochord  to  prevent 
floor  plate  differentiation.  Such  manipulations 
cause  marked  changes  in  the  fate  and  position  of 
neural  cell  types.  For  example,  a  and  b  result  in 
the  appearance  of  ectopic  motor  neurons,  and  c 
results  in  the  absence  of  motor  neurons. 

Our  results  suggest  that  the  pattern  of  cell  dif- 
ferentiation within  the  neural  tube  is  established 
by  signals  originating  at  the  floor  plate  and  noto- 
chord. For  example,  the  notochord  and  floor 
plate  could  act  as  local  sources  of  a  factor  that 
diffuses  through  the  adjacent  neural  epithelium, 
establishing  a  concentration  gradient  with  its 
high  point  at  the  ventral  midline.  In  this  scheme, 
the  differentiation  of  neural  epithelial  cells  into 
distinct  classes  during  normal  development 
would  be  controlled  by  the  signal  concentration 
to  which  they  are  exposed. 

A  similar  gradient  model  has  been  proposed  to 
explain  the  pattern  of  cell  differentiation  along 
the  A-P  axis  of  the  developing  chick  wing  bud. 
A-P  pattern  in  the  bud  appears  to  be  under  the 
control  of  a  specialized  region  of  posterior  mesen- 
chyme known  as  the  zone  of  polarizing  activity 
(ZPA),  which  can  respecify  A-P  polarity  when 
grafted  to  ectopic  sites.  Retinoic  acid  mimics  the 
effects  of  the  ZPA  and  appears  to  be  distributed  in 
a  graded  manner  along  the  A-P  axis  of  the  limb 
bud,  with  its  highest  concentration  in  the  poste- 
rior mesenchyme.  On  this  basis,  it  has  been  sug- 
gested that  retinoic  acid  functions  as  an  endoge- 
nous morphogen  involved  in  establishing  axial 
polarity  in  the  developing  chick  limb. 

The  notochord  and  floor  plate,  but  not  other 
regions  of  the  neural  tube,  mimic  the  action  of 
the  ZPA  and  retinoic  acid  in  respecifying  digit 
pattern  in  the  chick  limb.  Moreover,  our  bio- 
chemical studies  show  that  the  floor  plate  can 
synthesize  morphogenetically  active  retinoids  in 
vitro.  The  ability  of  the  notochord  and  floor  plate 
to  control  the  pattern  of  cell  differentiation  in 
both  the  developing  limb  bud  and  the  neural 
tube  may  therefore  have  a  common  molecular 
basis,  possibly  involving  retinoids. 
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Although  these  studies  implicate  the  floor 
plate  as  a  source  of  retinoids,  definite  evidence 
that  the  plate,  or  in  fact  any  vertebrate  tissue,  can 
release  morphogenetically  active  retinoids  is  still 
lacking.  We  therefore  devised  a  sensitive  and  spe- 
cific assay  to  detect  the  release  of  morphogenetic 
retinoids  from  neural  tissues.  This  has  provided 
further  evidence  that  the  floor  plate  does  indeed 
release  retinoids.  The  basis  of  this  assay  is  to  use  a 
fragment  of  DNA,  which  acts  as  a  retinoic  acid- 
responsive  element,  to  drive  expression  of  a  re- 
porter gene  in  tissue  culture  cells.  Using  F9  cells 
transfected  with  the  retinoid  response  element 
placed  upstream  of  the  (8-galactosidase  (lacZ) 
gene,  it  has  been  possible  to  coculture  floor 
plate,  dorsal  neural  tube,  and  other  chick  and  rat 
tissues,  and  to  monitor  retinoid  release  by  the  ap- 
pearance of  histologically  detectable  /3-galactosi- 
dase  in  cells  adjacent  to  the  test  tissue. 

Rat  or  chick  floor  plate  tissue  was  found  to  be  a 
potent  inducer  of  /«cZ activity  in  nearby  F9  cells. 
Dorsal  neural  tube  tissue  also  induced  lacZ  ex- 
pression, but  the  incidence  of  labeled  cells, 
when  compared  with  that  observed  with  the  floor 
plate,  was  markedly  reduced.  The  intensity  of 
lacZ  expression  was  also  reduced.  These  results 
show  that  the  floor  plate  is  considerably  more 
efi'ective  than  dorsal  neural  tube  in  inducing  lacZ 
expression,  and  support  the  idea  that  the  floor 
plate  may  be  a  local  source  of  retinoids  in  the 
developing  neural  tube. 

The  evidence  that  the  floor  plate  is  a  local 
source  of  morphogenetically  active  retinoids,  to- 
gether with  the  finding  that  it  has  polarizing  activ- 
ity within  the  neural  tube  and  limb,  raises  the 


question  of  whether  the  polarizing  activity  of  the 
floor  plate  within  the  neural  tube  is  mediated  by 
retinoids.  To  address  this,  we  grafted  retinoic 
acid-soaked  beads  adjacent  to  the  neural  tube  of 
chick  embryos.  When  bead  implants  were  intro- 
duced into  early  embryos,  defects  in  neural  tube 
formation  were  frequently  observed.  However, 
the  use  of  antibodies  that  reveal  the  pattern  of 
cell  diff^erentiation  within  the  neural  tube  indi- 
cates that  a  local  source  of  retinoic  acid  does  not 
mimic  the  eff^ect  of  a  floor  plate  graft. 

There  are  several  possible  reasons  for  the  fail- 
ure of  retinoic  acid-impregnated  beads  to  mimic 
the  floor  plate.  First,  retinoic  acid  may  be  synthe- 
sized and  released  by  its  floor  plate  but  may  not, 
in  fact,  be  involved  in  neural  tube  patterning.  Sec- 
ond, retinoic  acid  may  act  in  conjunction  with 
another  floor  plate-derived  signaling  molecule. 
And  third,  the  concentration  of  retinoic  acid  dif- 
fusing from  the  bead  may  be  too  low  to  cause 
changes  in  cell  pattern.  We  are  currently  investi- 
gating these  possibilities. 

Taken  together,  our  findings  indicate  that  a  cas- 
cade of  inductive  interactions  occurring  early  in 
the  development  of  the  vertebrate  nervous  sys- 
tem has  a  central  role  in  the  patterning  of  neural 
cells.  Signals  from  the  notochord  induce  the  floor 
plate  at  the  midline  of  the  neural  plate  and  neural 
tube.  The  floor  plate  and  notochord  then  appear 
to  control  the  differentiation  of  other  neural  cells 
and  to  contribute  to  the  D-V  pattern  of  the  ner- 
vous system.  Thus  the  pattern  of  cell  differentia- 
tion in  the  neural  tube,  as  in  many  other  develop- 
ing tissues,  appears  to  depend  critically  on  the 
organizing  properties  of  specialized  cell  groups. 
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A critical  basic  biological  problem  that  re- 
mains unsolved  is  the  mechanism  of  energy 
transduction  in  biological  membranes.  A  wide 
range  of  seemingly  disparate  phenomena,  such  as 
oxidative  phosphorylation,  bacterial  motility, 
and  solute  accumulation  against  a  concentration 
gradient  (secondary  active  transport) ,  are  driven 
by  bulk-phase,  transmembrane  electrochemical 
H"*^  or  Na"*"  gradients.  However,  the  molecular 
mechanism  by  which  energy  stored  in  such  gra- 
dients is  transduced  into  work  or  energy-rich 
compounds  (e.g.,  ATP)  remains  unknown.  In 
order  to  gain  insight  into  this  process,  studies  in 
our  laboratory  have  focused  on  the  lactose  (lac) 
permease  of  Escherichia  coli  as  a  paradigm. 

The  ability  of  E.  coli  to  accumulate  the  disac- 
charide  lactose  and  other  |S-galactosides  against  a 
large  concentration  gradient  depends  on  the  lac 
permease,  a  very  hydrophobic  cytoplasmic  mem- 
brane protein  that  catalyzes  the  coupled  translo- 
cation of  these  sugars  and  with  a  stoichiome- 
try  of  unity  (i.e.,  symport  or  co-transport).  Under 
physiological  conditions,  where  the  electro- 
chemical gradient  across  the  cytoplasmic  mem- 
brane is  interior  negative  and/or  alkaline,  lac 
permease  utilizes  free  energy  released  from 
downhill  translocation  of  to  drive  accumu- 
lated (8-galactosides  against  a  concentration  gra- 
dient, the  magnitude  of  which  is  directly  related 
to  that  of  the  H"^  gradient.  In  the  absence  of  an 
gradient,  the  permease  catalyzes  the  converse  re- 
action, utilizing  free  energy  released  from  down- 
hill translocation  of  /?-galactosides  to  drive  uphill 
translocation  of  and  generating  an  electro- 
chemical gradient,  the  polarity  of  which  depends 
on  the  direction  of  the  concentration  gradient  of 
the  substrate. 

Encoded  by  the  /acFgene — the  second  struc- 
tural gene  in  the  lac  operon — the  permease  has 
been  solubilized  from  the  membrane,  purified  to 
homogeneity,  and  reconstituted  into  proteolipo- 
somes  in  a  fully  functional  state.  In  addition,  we 
have  presented  evidence  that  the  permease  is 
functional  as  a  monomer.  Circular  dichroic  mea- 
surements demonstrating  that  purified  permease 


is  about  80  percent  helical,  and  hydropathy  analy- 
sis of  the  deduced  amino  acid  sequence,  suggest  a 
secondary  structure  in  which  the  protein  is  pre- 
dicted to  have  a  short  hydrophilic  amino  ter- 
minus, 12  hydrophobic  domains  in  a-helical 
conformation  that  traverse  the  membrane  in  zig- 
zag fashion  connected  by  hydrophilic  loops,  and 
a  17-residue  hydrophilic  carboxyl-terminal  tail. 
Through  other  approaches,  we  have  confirmed 
the  general  features  of  the  model  and  demon- 
strated that  the  amino  and  carboxyl  termini  are  on 
the  cytoplasmic  surface  of  the  membrane.  More- 
over, strong  exclusive  support  for  the  topological 
predictions  of  the  1 2-helix  model  is  provided  by 
Calamia  and  Manoil's  studies  on  an  extensive  se- 
ries of  lac  permease-alkaline  phosphatase  fusion 
proteins. 

The  topology  of  polytopic  membrane  proteins 
is  thought  to  result  from  either  the  orientation  of 
the  first  amino-terminal  hydrophobic  domain  in 
the  membrane  or  from  topogenic  determinants 
dispersed  throughout  the  molecules.  We  have 
now  studied  the  insertion  and  stability  of  in- 
frame  deletion  mutants  in  lac  permease.  So  long 
as  the  first  and  last  putative  a-helical  domains  are 
retained,  stable  polypeptides  are  inserted  into 
the  membrane,  even  when  an  odd  number  of  he- 
lical domains  are  deleted.  Moreover,  when  an 
odd  number  of  helices  are  deleted,  the  carboxyl 
terminus  remains  on  the  membrane's  cytoplas- 
mic surface.  Thus  relatively  short  carboxyl-termi- 
nal domains  of  the  permease  appear  to  contain 
topological  information  sufficient  for  insertion  in 
the  native  orientation.  Finally,  permease  mole- 
cules devoid  of  even  or  odd  numbers  of  putative 
transmembrane  helices  retain  a  specific  pathway 
for  downhill  lactose  translocation. 

Previous  experiments  indicate  that  amino  acid 
residues  396-40 1  at  the  carboxyl  terminus  of  the 
last  putative  transmembrane  helix  of  lac  per- 
mease are  important  for  protection  against  pro- 
teolytic degradation  and  suggest  that  this  region 
of  the  permease  may  be  necessary  for  proper  fold- 
ing. Termination  codons  have  now  been  substi- 
tuted sequentially  for  amino  acid  codons  396  to 
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401  in  the  lacY gene.  With  respect  to  transport, 
permease  molecules  truncated  at  residue  396  or 
397  are  completely  defective,  while  those  trun- 
cated at  398  through  401  exhibit  15-25,  30-40, 
40-45,  and  70-100  percent  of  wild-type  activ- 
ity, respectively.  Wild-type  permease  or  per- 
mease truncated  at  position  401  is  stable,  while 
molecules  truncated  at  residues  400  down  to  396 
are  degraded  at  increasingly  rapid  rates.  Thus  ei- 
ther the  last  turn  of  putative  helix  XII  or  the  re- 
gion immediately  distal  is  important  for  pro- 
per folding  and  protection  against  proteolytic 
degradation. 

When  the  lacY  gene  is  restricted  into  two  ap- 
proximately equal-size  fragments  and  subcloned 
individually  or  together  under  separate  lac  opera- 
tor/promoters, the  permease  is  expressed  in  two 
portions:  1)  the  amino  terminus,  the  first  six  pu- 
tative transmembrane  helices,  and  most  of  puta- 
tive loop  7;  and  2)  the  last  six  putative  transmem- 
brane helices  and  the  carboxyl  terminus. 
Remarkably,  cells  expressing  both  fragments 
transport  lactose  to  a  steady-state  level  of  accu- 
mulation at  about  30  percent  of  the  rate  of  cells 
expressing  intact  permease.  In  contrast,  cells  ex- 
pressing either  portion  of  the  permease  indepen- 
dently do  not  transport  lactose. 

Since  intact  permease  is  completely  absent 
from  the  membrane  of  cells  expressing  /acFfrag- 
ments  either  individually  or  together,  transport 
activity  must  result  from  an  association  between 
independently  synthesized  pieces  of  lac  per- 
mease. If  the  gene  fragments  are  expressed  indi- 
vidually, the  amino-terminal  portion  of  the 
permease  is  observed  sporadically  and  the  car- 
boxyl-terminal  portion  is  not  observed.  When  the 
gene  fragments  are  expressed  together,  polypep- 
tides identified  as  the  amino-  and  carboxyl-termi- 


nal  moieties  of  the  permease  are  found  in  the 
membrane.  The  results  indicate  that  the  amino- 
or  carboxyl-terminal  halves  of  lac  permease  are 
proteolyzed  when  synthesized  independently 
and  that  association  between  the  two  comple- 
menting polypeptides  leads  to  a  more  stable,  cata- 
lytically  active  complex.  More  recent  experi- 
ments demonstrate  that  the  permease  can  be  split 
in  a  similar  fashion,  with  comparable  results,  be- 
tween putative  helix  I  and  the  rest  of  the  protein 
or  within  putative  helix  VII. 

Notwithstanding  the  importance  of  high- 
resolution  structure,  site-directed  mutagenesis 
can  be  used  to  delineate  amino  acid  residues  im- 
portant for  active  transport.  Over  100  single- 
amino  acid  replacements  have  been  made  in  the 
permease,  and  about  70  percent  of  the  mutations 
have  no  significant  effect  on  permease  activity. 
Therefore  it  is  unlikely  that  individual  amino  acid 
replacements  result  in  large  conformational  alter- 
ations. Arg302  (putative  helix  IX)  and  Lys319, 
His,  and  Glu325  (all  putative  helix  X)  are  criti- 
cally involved  in  lactose-coupled  transloca- 
tion and/or  substrate  binding  and  recognition. 
Moreover,  molecular  modeling  studies  suggest 
that  the  four  residues  may  be  sufficiently  close  to 
form  an  H-bond  network. 

These  findings — the  specific  transport  proper- 
ties of  the  mutants,  and  construction  of  mutants 
in  which  the  putative  geometric  relationship  be- 
tween the  residues  is  altered — have  led  to  the 
suggestion  that  the  four  residues  may  function  in 
a  type  of  H^-relay  mechanism.  In  contrast,  Cys 
residues,  long  thought  to  play  a  central  role,  are 
not  involved  in  substrate  binding  or  transloca- 
tion. Thus  the  Cys  residues  in  the  permease  can 
be  mutagenized  individually  or  simultaneously, 
and  significant  activity  is  retained. 
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THE  expression  of  immunoglobulin  genes  is 
limited  to  only  one  cell  type  in  the  body, 
namely  B  lymphocytes.  In  addition  to  exhibiting 
this  cell-type  specificity,  the  expression  of  these 
genes  is  temporally  regulated  during  lymphocyte 
development,  initiating  in  the  pre-B  and  B  cell 
stages  and  later  increasing  as  those  cells  differen- 
tiate to  become  mature  plasma  cells.  Results  from 
many  laboratories  have  led  to  the  conclusion  that 
a  major  component  of  this  regulation  occurs  at 
the  level  of  transcription,  the  process  of  creating 
an  RNA  copy  of  the  gene.  Obtaining  a  detailed 
understanding  of  the  transcriptional  control  of 
immunoglobulin  genes  will  aid  in  elucidating 
the  mechanisms  of  transcription  regulation  of 
other  genes.  Thus  the  study  of  immunoglobulin 
gene  transcription  should  shed  light  on  a  variety 
of  transcriptional  control  processes,  including 
those  that  go  awry  and  result  in  deleterious  con- 
sequences (e.g.,  the  transcriptional  activation  of 
oncogenes,  leading  to  cancer) . 

Studies  of  the  structure  of  immunoglobulin 
genes  and  of  the  DNA  sequences  that  influence 
their  rate  of  transcription  in  lymphocytes  indi- 
cate that  there  are  at  least  two  major  transcrip- 
tional regulatory  signals  (elements)  within  the 
genes.  The  first  is  the  promoter,  a  conventional 
DNA  element  located  close  to  the  site  of  tran- 
scription initiation.  The  second  is  an  enhancer, 
which  stimulates  activity  of  the  promoter.  Al- 
though many  enhancers  are  found  close  to  pro- 
moters, the  enhancers  within  the  immunoglobu- 
lin genes  are  located  in  the  central  portion  of  the 
genes,  a  few  thousand  bases  from  the  promoter 
elements. 

Since  the  activities  of  the  immunoglobulin 
promoters  and  enhancers  are  restricted  to  B  lym- 
phocyte cells,  it  has  been  suggested  that  those 
cells  may  uniquely  express  transcription  factors 
(proteins)  that  activate  those  control  elements. 
One  of  our  prime  interests  is  to  understand  the 
mechanisms  underlying  the  B  cell-specific  ex- 
pression of  the  enhancer  located  in  the  immuno- 
globulin heavy-chain  gene. 

Our  previous  work  and  that  of  others  has  led  to 
the  drawing  of  a  detailed  enhancer  map.  This  map 


provides  the  precise  locations  for  the  binding  of 
these  regulatory  proteins  and  additional  informa- 
tion as  to  how  those  proteins  act.  The  enhancer  is 
a  relatively  small  (200  base  pairs),  but  exceed- 
ingly complex,  segment  of  DNA.  Many  of  the  per- 
haps nine  or  more  distinct  proteins  that  bind  the 
enhancer  are  found  in  multiple  cell  types,  even 
in  cells  where  the  enhancer  is  normally  inactive. 
It  is  assumed  that  some  of  these  proteins  act  to 
stimulate  enhancer  activity,  while  others  may 
function  to  repress  it. 

Recently  our  efforts  have  been  directed  toward 
the  cloning  and  functional  characterization  of  the 
genes  that  encode  these  enhancer-binding  pro- 
teins. Thus  far  we  have  cloned  segments  (i.e., 
cDNAs)  that  correspond  to  at  least  six  (possibly 
seven)  distinct  genes  that  encode  enhancer-bind- 
ing proteins.  We  are  using  these  gene  segments  to 
manipulate  and  characterize  the  proteins,  both 
structurally  and  functionally. 

Two  of  the  encoded  proteins  (E2-5  and  TFE3) 
are  involved  in  a  fascinating  transcriptional  regu- 
latory scheme.  In  B  cells  the  situation  is  relatively 
straightforward:  both  E2-5  and  TFE3  bind  the  en- 
hancer and  act  in  concert  to  activate  transcrip- 
tion. In  non-B  cells,  the  situation  is  more  com- 
plex. In  vivo  experiments  suggest  the  presence 
of  an  additional  repressor  protein  that  binds  the 
enhancer  and  precludes  binding  of  E2-5. 

Binding  of  this  putative  repressor  has  two  ef- 
fects. First,  the  enhancer  is  less  active  due  to  the 
absence  of  bound  E2-5  protein.  Second,  the  re- 
pressor has  the  ability  to  attenuate,  at  a  distance, 
the  ability  of  the  TFE3  protein  to  function.  Thus 
the  presence  of  the  repressor  in  non-B  cells  re- 
sults in  the  shutdown  of  both  E2-5-mediated  and 
TFE3-mediated  activation.  These  effects  can  be 
overcome  by  artificially  overproducing  the  E2-5 
protein  in  non-B  cells.  Presumably  this  overex- 
pression  is  sufficient  to  displace  the  bound 
repressor. 

Because  of  its  key  role  in  transcriptional  regula- 
tion, we  are  interested  in  identifying  and  charac- 
terizing the  repressor.  Currently,  however,  we 
have  only  genetic  evidence  that  it  exists.  Our  re- 
cent isolation  of  a  cDNA  that  encodes  yet  another 
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enhancer-binding  protein,  ZEB,  may  solve  that 
problem;  we  are  exploring  the  possibility  that 
ZEB  is  the  repressor.  It  binds  the  same  region  of 
the  enhancer  that  is  bound  by  E2-5  but  does  not 
bind  some  of  the  other  sites  (found  in  other 
genes)  that  are  bound  by  E2-5.  Moreover,  not  all 
sites  bound  by  ZEB  will  bind  E2-5.  This  overlap- 
ping, yet  distinct,  binding  specificity  should  al- 
low us  to  evaluate  whether  ZEB  (or  something 
like  ZEB)  is  the  repressor. 

We  have  demonstrated  that  one  component  of 
the  heavy-chain  enhancer's  cell-type  specificity 
results  from  a  balance  between  positive-  and  nega- 
tive-acting transcription  factors.  In  Drosophila, 
such  protein  gradients  are  thought  to  establish 
transcriptional  repertoires  that  dictate  pat- 
tern formation  during  embryonic  development. 
Hence,  although  our  findings  contradict  the 
model  initially  proposed  that  only  lymphoid 
cells  possess  the  appropriate  positive-acting  fac- 
tors, they  are  not  without  precedent.  In  fact,  it 
appears  that  the  enhancer  uses  both  types  of 
mechanisms  to  mediate  lymphoid  specificity. 
The  protein  Oct- 2  activates  the  enhancer 
(through  its  binding  site)  and  is  found  only  in 


lymphoid  cells.  Moreover,  we  have  recently  iso- 
lated the  gene  for  an  additional  B  cell-restricted 
enhancer-binding  protein.  Experiments  are 
under  way  to  evaluate  this  protein's  role  in  en- 
hancer activity. 

Although  we  are  just  beginning  to  scratch  the 
surface  in  our  understanding  of  transcriptional 
control,  the  immunoglobulin  heavy-chain  en- 
hancer has  served  as  an  important  paradigm  by 
reflecting  its  potential  complexity.  It  is  likely 
that  many  other  genes  employ  such  protein  gra- 
dients, perhaps  combined  with  cell  type- 
specific  factors,  to  mediate  their  transcriptional 
activities.  It  is  now  thought  that  a  group  of  pro- 
teins, all  highly  related  to  E2-5  (encoded  by  the 
same  gene) ,  activates  transcription  of  not  just  im- 
munoglobulin genes  in  B  cells  but  of  muscle- 
specific  and  pancrease-specific  genes  as  well. 
These  very  different  rolesfor  E2-5  may  reflect  its 
ability  to  form  heterodimers  with  a  variety  of 
other  proteins  of  similar  type  but  with  distinct 
modes  of  expression.  This  combinatorial  control 
undoubtedly  serves  to  increase  the  overall  variety 
of  transcriptional  repertoires  available  to  the 
multitude  of  cell  types  within  an  organism. 
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THE  focus  of  our  research  is  the  molecular 
basis  of  human  genetic  diseases  affecting  the 
hematopoietic  cells.  The  two  diseases  we  have 
studied  in  detail  are  sickle  cell  anemia  and  thalas- 
semia. Both  are  the  results  of  abnormal  globin 
production  and  constitute  important  health 
problems  in  the  Mediterranean  region,  Africa,  the 
Middle  East,  and  Asia.  In  the  United  States  these 
disorders  occur  frequently  among  people  of  Afri- 
can, Italian,  Greek,  and  Asian  descent.  We  are  de- 
fining the  mutations  that  give  rise  to  these  defects 
and  devising  DNA  analysis  for  their  detection.  In 
addition,  we  are  studying  the  factors  that  control 
the  expression  of  globin  genes  in  the  red  cell  pre- 
cursors and  the  signals  that  switch  from  fetal  to 
adult  globin  gene  production. 

Previously  we  demonstrated  that  the  common 
genetic  defect  in  a-thalassemia  is  deletion  of  the 
a-globin  structural  gene.  We  also  defined  some  of 
the  molecular  lesions  in  i8-thalassemia.  These 
studies  led  to  our  ability  to  detect  thalassemia  by 
analysis  of  fetal  DNA. 

We  initiated  a  new  method  of  linkage  analysis 
using  restriction  endonucleases  to  detect  poly- 
morphism in  DNA  sequences  and  applied  it  to 
tracing  the  evolution  of  the  sickle  and  thalasse- 
mia mutations.  Restriction  enzyme  site  polymor- 
phism is  now  an  important  tool  for  detecting 
many  genetic  disorders  and  for  mapping  the  ge- 
netic loci  of  many  diseases. 

We  developed  a  method  for  prenatal  diagnosis 
of  sickle  cell  anemia  and  thalassemia.  Initially, 
fetal  blood  samples  were  required.  With  the  ad- 
vent of  recombinant  DNA  technology,  mutations 
in  the  human  genome  can  be  analyzed  directly 
using  DNA  obtained  by  amniocentesis  or  chori- 
onic villus  biopsy,  permitting  early  in  utero  diag- 
nosis of  these  conditions. 

Prenatal  Diagnosis 

Currently  we  are  refining  the  prenatal  diagnos- 
tic tests  to  facilitate  diagnosis  of  /3-thalassemia  in 
those  areas  of  the  world  where  this  disease  is  an 
important  health  problem.  For  these  tests  to  be 


used  routinely,  simple  and  nonradioactive  meth- 
ods are  needed.  The  polymerase  chain  reaction 
(PGR),  which  amplifies  specific  segments  of 
DNA  many  millionfold,  has  facilitated  the  design 
of  rapid  and  nonradioactive  tests  for  sickle  cell 
anemia,  a-thalassemia,  and  (8-thalassemia. 

Although  prenatal  diagnosis  of  sickle  cell  ane- 
mia and  a-thalassemia  is  relatively  simple  be- 
cause the  genetic  defects  responsible  for  them 
are  known  and  are  readily  detectable,  diagnosis 
of  /3-thalassemia  is  more  complex.  Glose  to  100 
different  mutations  have  been  found  to  cause  the 
clinically  important  i8-thalassemia  syndromes.  It 
is  necessary  to  determine  which  mutations  pre- 
dominate in  a  particular  region  so  the  appro- 
priate DNA  test  can  be  chosen,  and  consequently 
we  have  defined  the  common  mutations  in  geo- 
graphic areas  where  thalassemia  is  prevalent  (Sar- 
dinia, other  parts  of  Italy,  Lebanon,  and  Ghina). 
With  this  knowledge,  simple  and  nonradioactive 
tests  are  being  devised  to  detect  the  various  de- 
fects in  these  different  areas. 

Control  of  Globin  Gene  Expression 

We  are  studying  the  factors  that  control  the  tis- 
sue- and  development-specific  expression  of  the 
human  globin  genes.  Synthesis  of  these  chains  is 
precisely  coordinated  during  development.  The 
embryonic  e-  and  f-globin  chains,  which  are  syn- 
thesized in  the  early  embryo,  are  replaced  in  the 
fetus  by  the  a-  and  7-gIobin  chains.  Prior  to  birth, 
the  18-globin  chain  takes  over  from  the  7-globin 
chain  as  the  predominant  globin  chain.  The  fac- 
tors that  control  the  expression  of  the  ^-globin 
gene  in  the  bone  marrow  cells  and  coordinate  the 
developmental  expression  of  the  globin  genes 
during  development  have  not  been  elucidated. 
We  are  now  studying  the  DNA  sequences  and  pro- 
tein factors  that  govern  the  expression  of  these 
genes. 

We  have  defined  some  of  the  sequences  neces- 
sary for  the  tissue-specific  expression  of  the  glo- 
bin genes.  A  DNA  segment  as  short  as  36  base 
pairs  appears  to  enhance  tissue-specific  globin 
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gene  expression.  These  regions  have  been  joined 
to  the  |8-globin  gene  and  are  capable  of  augment- 
ing globin  gene  expression  in  tissue  culture  cells 
and  in  transgenic  mice.  In  addition,  nuclear  ex- 
tracts from  cell  lines  with  erythroid  characteris- 
tics are  being  examined  for  factors  that  bind  to 
putative  control  regions  of  the  globin  genes.  A 
nuclear  factor  from  erythroid  cells  appears  to 
bind  to  a  tandem  repeat  of  a  consensus  recogni- 
tion sequence  for  a  class  of  transacting  factors 
called  API .  We  are  attempting  to  isolate  this  ery- 
throid-specific  API -like  protein.  Our  long-term 
goal  is  to  understand  the  cellular  processes  con- 
trolling the  tissue-specific  expression  of  the  dif- 
ferent globin  genes  and  the  signals  that  control 
the  switch  from  fetal  to  adult  globin.  This  knowl- 
edge may  lead  to  better  approaches  for  correcting 
the  genetic  defects  in  hereditary  hemoglobin 
disorders. 

Red  Cell  Membrane  Disorders 

Many  hereditary  hemolytic  anemias  are  accom- 
panied by  spherocytosis  or  elliptocytosis.  The  de- 
fects in  these  disorders  are  known  to  lie  in  the 
membrane  proteins,  though  the  mutations  were 


not  defined  until  recently.  To  facilitate  the  study 
of  these  groups  of  hemolytic  anemias,  we  have 
isolated  the  gene  encoding  for  protein  4.1.  We 
have  studied  families  with  hereditary  elliptocyto- 
sis due  to  protein  4.1  deficiency  and  have  de- 
fined three  different  defects  due  to  gene  rear- 
rangements: one  results  in  the  absence  of  the 
mature  form  of  protein  4.1  in  the  red  cell,  an- 
other produces  an  elongated  4.1  protein  mole- 
cule due  to  duplication  of  three  exons,  and  the 
third  causes  a  shortened  protein  4 . 1  due  to  a  two- 
exon  deletion. 

We  continue  to  define  the  alternate  splicing 
mechanism  by  which  multiple  isoforms  of  pro- 
tein 4.1  are  generated.  We  have  found  at  least  five 
different  regions  that  undergo  alternate  splicing. 
Antibodies  prepared  against  these  isoforms  are 
being  used  to  study  their  distribution  and  func- 
tion. Some  isoforms  are  more  abundant  in  nu- 
cleated cells,  and  others,  in  mature  red  cells. 
Also,  protein  4.1  is  found  in  many  cell  types  out- 
side of  the  red  blood  cell.  Thus  it  seems  that,  in 
addition  to  its  role  as  a  red  cell  cytoskeletal  pro- 
tein, protein  4.1  serves  other  cellular  functions 
that  need  to  be  defined. 
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LEARNING  is  commonly  divided  into  two  ma- 
jor types,  declarative  and  reflexive.  Declara- 
tive learning  refers  to  the  acquisition  of  informa- 
tion about  persons,  places,  or  things.  Reflexive 
learning  refers  to  the  acquisition  of  procedures 
and  motor  skills. 

In  the  past  our  laboratory  has  focused  primarily 
on  elementary  forms  of  reflexive  learning  as  man- 
ifest in  the  gill-withdrawal  reflex  of  the  marine 
snail  Aplysia.  We  showed  that  this  simple  reflex 
can  be  modified  by  both  nonassociative  and  asso- 
ciative learning,  giving  rise  to  short-  and  long- 
term  memory,  whose  duration  is  a  function  of  the 
number  of  training  trials. 

To  analyze  the  relationship  between  the  short- 
and  long-term  processes  for  nonassociative  learn- 
ing (sensitization),  we  focused  on  one  compo- 
nent of  the  neural  circuit  of  this  reflex — the 
connections  between  the  siphon  sensory  neuron 
and  the  gill  motor  neurons.  Here  we  found  that 
both  the  short-  and  long-term  processes  involve 
an  increase  in  transmitter  release.  Whereas  the 
short-term  process  reflects  enhanced  transmitter 
release  from  preexisting  synaptic  connections 
due  to  covalent  modification  of  preexisting  pro- 
teins, the  long-term  process  results  from  alter- 
ations in  gene  expression  and  the  growth  of  syn- 
aptic connections. 

What  molecular  mechanisms  contribute  to  de- 
clarative forms  of  learning?  In  an  attempt  to  com- 
pare the  biochemical  mechanisms  underlying  re- 
flexive forms  of  learning  with  those  underlying 
declarative  learning,  we  have  turned  to  the  hip- 
pocampus in  the  mammalian  brain. 

Since  Milner's  pioneering  work  in  the  late 
1950s,  the  hippocampus  has  been  known  to  be 
important  for  aspects  of  long-term  declarative 
memory  storage  in  humans  and  other  mammals. 
The  hippocampus  may  be  essential  for  initially 
storing  long-term  memory  for  days  or  weeks  be- 
fore the  memory  trace  is  consolidated  elsewhere, 
perhaps  in  the  cerebral  cortex.  In  1973  Bliss  and 
L0mo  first  demonstrated  that  a  brief,  high- 
frequency  train  of  action  potentials  in  the  perfor- 
ant  path  increases  the  excitatory  synaptic  poten- 


tial in  the  granule  cells.  The  increase  can  last  for 
hours  and,  under  some  circumstances,  even  for 
weeks.  They  called  this  facilitation  long-term  po- 
tentiation, or  LTP.  Later  studies  showed  that  LTP 
occurs  at  each  of  the  three  major  synaptic  path- 
ways in  the  hippocampus. 

Recent  studies  in  the  CAl  region,  by  Tim  Bliss, 
Charles  Stevens  (HHMI,  the  Salk  Institute), 
Richard  Tsien,  and  their  colleagues,  have  pro- 
vided insights  into  the  cellular  mechanisms  in- 
volved in  the  acquisition  and  maintenance  of 
LTP.  Despite  important  differences,  this  form  of 
synaptic  plasticity  in  the  mammalian  brain  may 
bear  certain  similarities  on  the  cellular  level  to 
presynaptic  facilitation  that  accompanies  sensiti- 
zation in  Aplysia.  In  both  cases  synaptic  connec- 
tions are  strengthened  through  an  enhancement 
of  transmitter  release,  and  both  the  short-  and  the 
long-term  changes  occur  at  the  same  synaptic 
locus. 

Therefore,  Tom  O'Dell,  Seth  Grant,  and  I  have 
begun  to  address  the  question.  What  are  the  mo- 
lecular steps  involved  in  LTP?  It  has  been  known 
for  several  years  that  calcium  influx  through  a 
glutamate  receptor  of  the  TV-methyl-o-aspartate 
type  is  critical  for  the  induction  of  LTP.  Less  is 
known,  however,  about  the  subsequent  biochem- 
ical steps  responsible  for  LTP  expression  and 
maintenance.  Several  studies  using  various  kinase 
inhibitors  have  implicated  as  important  events 
following  calcium  influx  the  activation  of  two 
serine-threonine  protein  kinases:  kinase  C  (PKC) 
and  calcium/calmodulin-sensitive  kinase  II 
(CaMK-II).  Consistent  with  the  involvement  of 
PKC  and  CaMK-II  in  LTP,  high  levels  of  these  ki- 
nases are  found  in  hippocampal  pyramidal  cells. 

In  addition  to  these  serine-threonine  kinases, 
hippocampal  neurons  also  express  high  levels  of 
protein  tyrosine  kinase  activity.  Indeed,  the  brain 
is  a  richer  source  of  tyrosine  kinases  than  any 
other  organ,  and  these  kinases  are  particularly 
enriched  in  the  hippocampus.  Subcellular  frac- 
tionation studies  showing  highest  levels  of  pro- 
tein tyrosine  kinase  activity  in  crude  synaptic  vesi- 
cle fractions  have  suggested  a  role  for  tyrosine 


233 


Cell  Biological  Studies  of  Memory 


kinases  in  aspects  of  synaptic  physiology  such  as 
transmitter  release.  Although  the  importance  of 
tyrosine  kinases  for  synaptic  transmission  is  not 
known,  the  above  findings  and  the  demonstrated 
interaction  of  tyrosine  kinases  with  serine-threo- 
nine kinases  in  various  nonneuronal  signal  trans- 
duction pathways  led  us  to  investigate  whether 
protein  tyrosine  kinases  are  involved  in  the  bio- 
chemical events  underlying  LTP. 

A  major  limitation  in  the  study  of  tyrosine  ki- 
nases in  the  brain  has  been  the  lack  of  inhibitors 
of  the  sort  that  have  proved  useful  in  the  study 
of  serine-threonine  kinases.  Recently,  several 
groups  have  developed  two  classes  of  inhibitors. 
One  class  (e.g.,  genistein)  blocks  the  ATP-bind- 
ing  domain,  and  the  other  (tyrophostin)  blocks 
the  substrate-binding  domain.  These  inhibitors 
block  tyrosine  kinase  activity  in  lymphocytes  and 
in  other  nonneural  tissues  at  concentrations  that 
have  a  negligible  effect  on  serine  and  threonine 
kinases.  To  determine  whether  these  inhibitors 
have  similar  specificity  in  the  brain,  O'Dell, 
Grant,  and  I  examined  their  action  on  hippocam- 
pal  kinase  activity.  We  found  that  there,  too,  they 
inhibit  tyrosine  kinase  activity  at  concentrations 
that  have  little  effect  on  serine-threonine  phos- 
phorylation produced  by  kinase  A,  kinase  C,  and 
CaM  kinase. 

We  next  used  four  different  inhibitors  (two 
from  each  class)  to  examine  LTP  produced  in  the 
CAi  neurons.  All  four  blocked  the  induction  of 
LTP.  Moreover,  their  ability  to  block  the  facilita- 
tion was  related  to  their  effectiveness  in  blocking 
tyrosine  kinase  activity.  This  effect  was  restricted 
to  induction;  the  inhibitors  had  no  effect  on  the 


maintenance  phase  of  LTP.  Nor  did  they  have  an 
effect  on  the  amplitude  or  duration  of  post- 
tetanic  potentiation. 

We  also  examined  the  specificity  of  the  tyro- 
sine kinase  inhibitors  electrophysiologically  and 
found  that  several  forms  of  synaptic  plasticity  at- 
tributed to  other  serine  and  threonine  kinases 
were  not  affected.  The  enhanced  firing  (anti- 
accommodation)  produced  by  norepinephrine 
(which  is  simulated  by  cAMP  and  thought  to  be 
mediated  by  kinase  A)  was  not  affected  by  the 
tyrosine  kinase  inhibitor  genistein.  Conversely, 
the  enhanced  firing  (anti-accommodation)  pro- 
duced by  muscarine  (which  is  thought  to  be  me- 
diated by  the  diacylglycerol-IPj  pathway)  is  in- 
hibited by  H-7  but  not  affected  by  genistein. 
Finally,  the  synaptic  facilitation  induced  by  phor- 
bol  esters  and  mediated  by  kinase  C,  which  was 
blocked  by  H-7,  was  not  affected  by  the  inhibi- 
tors. These  data  suggest  that  the  induction  of  LTP, 
or  the  initial  steps  in  its  maintenance,  may  in- 
volve a  tyrosine  kinase.  We  are  now  trying  to  ex- 
plore this  possibility  directly  by  examining  slices 
for  changes  in  tyrosine  phosphorylation  after  in- 
duction of  LTP. 

Given  the  known  functions  of  tyrosine  kinases 
in  inter-  and  intracellular  signaling  processes  that 
modulate  cell  growth,  it  is  notable  that  the  in- 
duction of  hippocampal  LTP,  or  learning  at  other 
synapses,  results  in  growth  and  morphological 
change  of  neurons.  With  the  identification  of  rele- 
vant kinases,  their  activators,  and  their  substrates, 
the  role  of  tyrosine  kinases  in  synaptic  transmis- 
sion and  morphological  changes  accompanying 
learning  could  be  evaluated. 
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HIGHER  animals  are  at  great  risk  of  invasion 
by  foreign  organisms.  In  order  to  avert  de- 
struction by  parasites  such  as  bacteria,  they  have 
developed  complex  protective  mechanisms, 
broadly  termed  the  immune  system.  The  opera- 
tion of  the  system  requires,  first,  that  it  recognize 
an  invading  organism  and,  second,  that  it  mount 
an  effective  response  to  destroy  the  invader. 

Our  laboratory  is  particularly  interested  in  the 
first  process,  the  means  by  w^hich  the  immune 
system  recognizes  foreign  material.  It  does  so  pri- 
marily by  use  of  lymphocytes  (white  blood  cells) 
of  three  different  types:  B  cells,  and  T  cells  bear- 
ing and  76  receptors.  We  have  concentrated 
on  ajS-bearing  T  cells. 

Mice  contain  about  200  million,  and  humans 
about  1  million  million  such  cells.  All  of  the  a/3 
receptors  on  each  cell  are  identical.  From  one 
cell  to  another,  however,  the  a/3  receptors  differ 
in  amino  acid  sequence.  This  is  because,  as  T 
cells  develop,  each  one  chooses  different  compo- 
nents from  which  to  build  its  receptors. 

The  a/3  receptors  are  made  up  of  five  variable 
components:  Va,  Ja,  V(8,  D/3,  and  J/3.  The  develop- 
ing T  cell  chooses  for  its  receptors  1  Va  from  a 
selection  of  about  50.  Likewise,  it  chooses  1  Ja 
from  about  50  Ja's,  1  from  about  50  V/3's,  1  D/3 
from  6  D/3's,  and  1  J/3  from  1 3  J;8's.  Because  each 
developing  cell  chooses,  by  chance,  a  different 
combination  of  Va's,  Ja's,  etc.,  and  because  these 
components  are  joined  together  in  slightly  differ- 
ent ways  by  different  T  cells,  the  T  cells  in  a  given 
animal  can  express  a  very  large  number  of  differ- 
ent receptors,  perhaps  as  many  as  10'°. 

When  an  invading  organism  enters  the  body, 
fragments  of  the  organism  (antigens)  appear  on 
cell  surfaces,  linked  to  one  or  more  of  a  collec- 
tion of  self  proteins,  the  products  of  the  major 
histocompatibility  complex  (MHC).  So  pre- 
sented, the  fragments  can  be  bound  by  T  cells 
bearing  the  right  receptors.  Usually  a  T  cell  has  to 
have  just  the  right  combination  of  Va,  Ja,  V|8,  D/3, 
and  J/3  in  order  to  bind  the  fragments  of  a  particu- 
lar invader.  Consequently,  for  many  such  organ- 
isms, only  a  few  T  cells — perhaps  1  in  100,000 


— can  recognize  that  the  organism  has  arrived  in 
the  body.  These  few  T  cells,  however,  are  stimu- 
lated to  divide  by  their  interaction  with  the  in- 
vader, so  in  a  few  days  they  give  rise  to  many  cells 
able  to  bind  fragments  from  the  organism.  Fi- 
nally, T  cells  differentiate  into  cells  able  to  de- 
stroy the  invader  or  to  stimulate  other  cells  to 
do  so. 

A  few  years  ago  we  discovered  a  collection  of 
foreign  materials  that  we  named  superantigens. 
These  materials  bind  to  MHC  proteins  and  stimu- 
late T  cells,  but  unlike  most  antigens  they  will 
stimulate  almost  any  T  cell  bearing  a  particular 
V/3.  Staphylococcal  enterotoxin  B,  for  example, 
will  stimulate  mouse  T  cells  bearing  the  V;S8's  or 
V/37,  or  human  T  cells  bearing  V/33.  Because  there 
is  a  reasonably  large  percentage  of  T  cells  in  mice 
bearing  the  V/38's  and  V)87,  or  in  humans  bearing 
V/33,  these  superantigens  stimulate  a  very  large 
percentage  of  T  cells  at  once,  perhaps  as  many  as 
10  percent  of  all  cells  in  a  given  individual.  This 
gross  T  cell  stimulation  causes  overactivation  of 
the  cells,  massive  release  of  T  cell  mediators,  and 
consequent  illness. 

A  number  of  organisms  make  superantigens. 
Staphylococci  and  streptococci  are  among  the 
bacteria  that  produce  them.  Superantigens  are 
also  made  by  mycoplasma  and  by  mice  them- 
selves. We  have  been  studying  how  these  super- 
antigens bind  to  MHC  and  to  the  V/3  components 
of  the  T  cell  receptors.  Our  data  show  that  the 
superantigens  made  by  mice,  and  by  staphylo- 
cocci, bind  to  V/3  at  an  exposed  face  of  the  mole- 
cule well  away  from  the  part  of  the  T  cell  recep- 
tor that  is  thought  to  bind  to  conventional 
antigens  plus  MHC.  There  are  also  some  indica- 
tions that  the  staphylococcal  toxins  bind  to  some 
human  MHC  proteins  better  than  others.  Since 
human  beings  differ  widely  in  the  sequences  of 
the  MHC  proteins  they  make,  perhaps  this  is  why 
some  individuals  get  food  poisoning  when  they 
eat  staphylococcus-contaminated  egg  salad,  for 
example,  whereas  others  at  the  same  picnic  eat 
the  same  egg  salad  without  getting  sick. 

We  have  also  been  studying  the  structures  on 
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the  bacterial  toxins  that  cause  the  toxins  to  bind 
to  MHC  and  V(8's.  So  far  we  have  identified  three 
sites,  one  that  seems  to  be  concerned  with  bind- 
ing to  MHC  and  two  that  seem  to  be  involved  with 
VjS  binding. 

Recently  we  identified  the  superantigen  mole- 
cules made  by  mice.  These  turn  out  to  be  the 
products  of  viruses  that,  over  the  course  of  evolu- 
tion, have  become  integrated  into  mouse  DNA.  A 


similar  virus  is  transmitted  from  mice  to  their  ba- 
bies in  milk,  causing  breast  cancer.  It  is  likely  that 
T  cell  stimulation  via  the  viral  superantigen  is  an 
essential  middle  step  in  mouse  breast  cancer  de- 
velopment. This  odd  interaction  between  vi- 
ruses, T  cells,  and  various  seemingly  unrelated 
diseases  may  not  be  unique  to  mice.  Currently  we 
are  trying  to  find  out  whether  a  similar  scenario 
applies  to  some  viruses  in  humans. 
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VISION  is  accomplished  through  the  integra- 
tion of  signals  received  by  the  eye  and  com- 
municated to  the  brain.  The  neurons  of  the  eye, 
or  photoreceptor  cells,  elaborate  long  processes 
called  axons  that  connect  in  specific  patterns 
with  central  target  areas  in  the  brain  to  generate  a 
map  of  the  visual  world.  We  are  interested  in  un- 
derstanding the  mechanisms  by  which  photore- 
ceptor cells  are  specified  and  connect  with  their 
targets  during  development. 

The  eye  of  the  fruit  fly  Drosophila  melanogas- 
ter  has  proved  to  be  a  particularly  suitable  sub- 
ject for  studies  of  this  nature.  Unlike  the  mamma- 
lian eye,  the  fly  eye  is  a  compound  structure 
composed  of  repeating  units  called  ommatidia, 
which  each  contain  eight  photoreceptor  cells. 
Six  of  these  cells,  Rl-6,  send  axon  bundles  into 
the  optic  lobe  (a  part  of  the  brain  specialized  for 
visual  processing)  that  terminate  in  the  first  syn- 
aptic zone,  the  lamina.  The  central  two  photore- 
ceptors, R7  and  R8,  send  their  axons  past  the  lam- 
ina into  a  second  zone,  the  medulla,  where  they 
form  their  connections  in  two  distinct  layers.  Our 
eff'orts  have  focused  on  understanding  the  genera- 
tion of  connections  by  the  R7  and  R8  cell  types. 

To  visualize  the  photoreceptor  axons,  we  fill 
those  originating  from  any  selected  population  of 
ommatidia  with  a  dye  (or  an  enzyme  that  can  be 
viewed  by  its  reaction  product) .  The  pathway  and 
termination  zone  of  each  R7  and  R8  axon  can  be 
scored  individually,  and  the  path  followed  dur- 
ing development  can  be  reconstructed.  By  visual- 
izing the  axons  in  this  manner,  those  that  have 
followed  unusual  routes  or  terminated  in  incor- 
rect locations  can  be  identified. 

We  have  been  studying  a  mutation,  nac  (neu- 
rally  altered  carbohydrate),  that  causes  a  cold- 
sensitive  disruption  of  the  diff'erentiation  of  the 
fly's  photoreceptor  cells  and  other  selected  neu- 
rons. We  have  shown  that  this  mutation  affects 
the  expression  of  a  neural-specific  glycan  in  all 
neurons  of  the  fly,  although  mutant  flies  are  via- 
ble and  show  only  selective  defects  in  sensory 


behavior.  When  nac  photoreceptor  cells  are 
filled  with  dye,  the  axons  of  R7  and  R8  can  be 
seen  to  enter  the  medulla  correctly,  but  they 
show  increasing  defects  as  they  approach  their 
terminal  zones.  Some  axons  miss  their  targets  al- 
together. Conversely,  small  groups  of  ectopic 
fibers  have  been  seen  to  separate  from  the  main 
axon  bundles  and  take  unorthodox  routes  across 
the  medulla,  nonetheless  arriving  in  their  correct 
terminal  zones.  These  behaviors  are  likely  to  re- 
sult from  both  an  altered  terrain  and  an  alteration 
on  the  surface  of  the  photoreceptor  axons. 

To  identify  additional  mutations  that  might  af- 
fect these  processes  more  specifically,  we  com- 
bined our  axon-filling  technique  with  a  genetic 
strategy.  Mutations  were  generated  by  the  in- 
sertion of  a  mobile  piece  of  DNA,  called  a  trans- 
posable  element.  When  the  element  inserts  into 
a  gene,  it  inactivates  it,  causing  a  mutant 
phenotype. 

The  expression  of  the  mutated  gene  can  be  in- 
ferred from  the  expression  of  a  reporter  gene 
carried  by  the  transposable  element,  which  ap- 
pears to  be  regulated  by  the  same  controls  that 
tell  the  endogenous  gene  where  and  when  to  be 
expressed.  We  first  selected  stocks  of  flies  in 
which  the  reporter  gene  was  expressed  in  the  de- 
veloping photoreceptor  cells  or  in  their  target 
cells  in  the  optic  lobe.  We  then  asked  if  the  inser- 
tion of  the  mobile  element  had  caused  a  mutation 
that  affected  the  diff'erentiation  of  the  photore- 
ceptor cells  or  the  ability  of  their  axons  to  reach 
the  correct  target  sites  in  the  brain.  Several  inter- 
esting mutations  have  been  isolated  in  this 
screen. 

The  manner  in  which  these  mutations  were 
generated — by  the  insertion  of  a  marked  piece  of 
DNA — has  furthered  genetic  and  molecular  stud- 
ies now  in  progress.  In  addition,  we  are  analyzing 
these  mutant  phenotypes  with  independently 
generated  markers  that  highlight  various  portions 
of  the  axon  pathways  to  study  the  rules  that  allow 
the  generation  of  a  visual  map. 
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OUR  continuing  goal  is  to  understand  how  the 
essentially  one-dimensional  linear  informa- 
tion encoded  in  the  DNA  molecule  is  elaborated 
into  a  three-dimensional  organism  during  the  pro- 
cess of  development.  As  a  model  system  in  this 
analysis,  we  are  using  the  fruit  fly  Drosophila 
melanogaster  and  are  concentrating  on  a  group 
of  adjacent  genes  in  a  small  region  of  one  chro- 
mosome arm.  Mutations  in  these  homeotic  genes 
transform  one  organ  of  the  fly  embryo,  larva,  or 
adult  into  a  homologous  structure.  For  example, 
mutations  in  the  Antennapedia  gene  transform 
the  antenna  of  the  adult  fly  into  a  leg,  while  muta- 
tions in  the  proboscipedia  gene  transform  the 
mouthparts  into  legs. 

A  combined  genetic  and  developmental  analy- 
sis of  this  region  of  the  Drosophila  chromosome 
has  revealed  the  presence  of  five  such  genes  that 
are  collectively  referred  to  as  the  Antennapedia 
gene  complex  (ANT-C) .  These  five  genes  collec- 
tively specify  the  proper  identity  of  the  anterior 
end  of  the  fruit  fly  and  can  be  viewed  as  develop- 
mental switches  that  make  either/or  decisions  in 
cell  fate  at  specific  times  and  places  in  the  em- 
bryo and  larva. 

Recombinant  DNA  technology  has  allowed  the 
cloning  of  the  genes  and  has  revealed  the  nature 
of  their  protein  products.  All  of  the  homeotic 
genes  contain  a  specific  DNA  sequence  that  en- 
codes a  common  protein  motif  called  the  homeo- 
box.  It  has  been  shown  that  this  motif  acts  as  a 
DNA-binding  domain  and  that  these  homeotic 
proteins  are  associated  with  the  chromosomes  in 
the  nuclei  of  the  developing  fly.  Therefore  we 
now  know  that  these  homeotic  genes  act  as  devel- 
opmental switches  because  their  protein  prod- 
ucts serve  to  regulate  directly  the  expression  of 
other  genes.  What  we  do  not  fully  understand  is 
how  these  homeotic  genes  are  so  elegantly  or- 
chestrated in  their  proper  spatiotemporal  pat- 
terns and  what  is  the  nature  of  the  battery  of  genes 
that  are  in  turn  regulated  by  the  protein  products 
of  the  five  members  of  the  ANT-C.  Our  current 
research  is  focused  on  these  two  questions. 

We  are  using  three  of  the  five  genes  in  the  com- 
plex— labial  (Jab) ,  proboscipedia  (pb) ,  and  Sex 


combs  reduced  (5cr) — in  our  attempts  to  under- 
stand spatiotemporal  patterning.  These  were 
chosen  because  of  the  unique  properties 
each  displayed  during  our  initial  characteriza- 
tion of  their  respective  roles  in  Drosophila 
development. 

The  labial  Gene 

Using  a  minigene  construct  and  P-element-me- 
diated  transformation,  we  have  been  able  to  ame- 
liorate completely  the  embryonic  defects  asso- 
ciated with  lab  deficiency;  however,  adult 
transgenic  animals  show  severe  deformities,  with 
thoracic  structures  replacing  portions  of  the  head 
capsule.  We  have  now  shown  that  this  defect  is 
caused  not  by  a  failure  of  lab  activity  in  the  devel- 
oping adult  head  but  by  the  ectopic  expression  of 
the  minigene  in  the  head  anlagen,  which  is  also 
associated  with  a  failure  to  express  the  Deformed 
(Dfd)  and  Scr  loci  in  their  normal  pattern  in  this 
same  tissue.  Thus  it  appears  that  lab  can  act  as  a 
transregulator  of  these  other  homeotic  members 
of  the  ANT-C.  This  is  in  marked  contrast  to  our 
observation  that  cross-regulatory  interactions  do 
not  take  place  among  any  of  the  ANT-C  homeotic 
loci  in  the  embryo.  It  would  appear  therefore  that 
the  regulatory  hierarchy  in  the  embryo  and  adult 
stages  is  difi'erent. 

The  fact  that  the  minigene  shows  ectopic  ex- 
pression only  in  a  lab~  background  points  up  an- 
other regulatory  phenomenon.  Since  the  resident 
lab  gene  is  able  to  prevent  ectopic  expression, 
the  native  protein  appears  to  be  able  to  influence 
the  expression  of  the  minigene  negatively:  i.e., 
lab  shows  autogenous  negative  regulation.  The 
product  of  the  minigene,  however,  does  not  per- 
form this  function.  In  the  construction  of  the 
minigene  we  used  a  cDNA  fragment  that  did  not 
include  a  minor  alternate  splice  form.  This  alter- 
nate RNA  product  would  produce  a  protein  prod- 
uct that  is  six  amino  acids  longer  than  the  single 
protein  encoded  by  the  minigene.  The  resident 
gene  can,  of  course,  make  both  proteins.  These 
observations  lead  to  the  exciting  possibility  that 
the  longer  protein  product  is  responsible  for  the 
negative  regulation  and  that  the  short  form  is  inca- 
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pable  of  this  activity.  We  are  currently  testing  this 
hypothesis  in  transgenic  animals. 

The  proboscipedia  Gene 

As  is  the  case  for  the  lab  locus,  we  have  con- 
structed a  minigene  of  the  pb  transcription  unit, 
returned  it  to  flies,  and  alfected  a  rescue  of  the 
adult  mouthpart-to-leg  transformation.  However, 
unlike  lab  we  were  unable  to  identify  sequences 
upstream  of  the  start  of  transcription  that  were 
sufficient  to  drive  expression  of  a  |8-galactosidase 
reporter  gene.  We  have  now  found  the  fragments 
necessary  for  a  pb  pattern  of  expression  in  the 
major  intron  of  the  gene.  Specifically,  a  2-kb  frag- 
ment from  the  intron  in  combination  with  7  kb  of 
upstream  sequence  is  sufficient  to  drive  reporter 
expression  in  a  pb  pattern.  The  intronic  regula- 
tory elements  are  evolutionarily  highly  con- 
served, and  identical  DNA  sequences  can  be 
found  in  the  same  position  in  the  pb  loci  of  two 
other  distantly  related  Drosophila  species.  We 
are  currently  defining  the  extent  of  the  pb  pro- 
moter through  a  deletion  analysis  of  the  appro- 
priate sequences  in  both  the  minigene  and  re- 
porter constructs.  Moreover,  we  are  testing  the 
ability  of  the  homologous  sequences  from  the 
two  related  flies  to  drive  pb  expression  in  D. 
melanogaster. 

The  Sex  combs  reduced  Gene 

We  have  shown  that  Scr  locus  expression  in  the 
adult  is  regulated  by  a  transvection-like  mecha- 
nism. Normally  5cr  protein  is  accumulated  in  the 
labial  and  first  thoracic  imaginal  discs.  Transvec- 
tion  mutants  also  express  the  gene  in  the  second 
and  third  thoracic  segments.  This  deregulation 


occurs  by  a  misexpression  of  the  copy  of  the  gene 
held  in  trans  to  the  chromosome  that  bears  the 
mutant  allele.  To  locate  the  sequences  that  re- 
spond to  the  transvection  signal,  we  have  cloned 
portions  of  the  Scr  regulatory  DNA  into  appro- 
priate reporter  constructs.  These  are  now  in  flies, 
and  we  are  in  the  process  of  determining  their 
expression  patterns  in  both  normal  and  transvec- 
tion-inducing  mutant  backgrounds. 

Regulated  Ectopic  Expression  of  the 
ANT-C  Homeotics 

We  have  built  constructs  in  which  each  of  the 
ANT-C  homeotic  loci  are  driven  by  the  hsp70 
promoter  and  have  determined  the  effects  of 
global  expression  of  each  of  these  genes  on  mor- 
phology and  the  expression  of  the  other  home- 
otic  loci.  Now  we  are  using  a  two-element  system 
developed  in  the  laboratory  of  Norbert  Perrimon 
(HHMI,  Harvard  Medical  School)  to  fine-tune  this 
approach.  The  promoter  elements  of  lab  and  pb 
have  been  cloned  in  front  of  the  yeast  GAL4  struc- 
tural gene.  Additionally  cDNAs  and  genomic  frag- 
ments from  all  five  ANT-C  homeotics  have  been 
cloned  downstream  of  a  promoter  that  is  regu- 
lated by  the  GAL4  protein.  These  constructs  will 
be  transformed  separately  into  flies.  It  will  then 
be  possible  to  express  each  of  the  homeotic  struc- 
tural genes  in  the  domains  of  the  lab  and  pb  pro- 
moters by  crossing  flies  containing  either  of  the 
promoter-GAL4  constructs  to  animals  carrying 
each  of  the  receptor- homeotic  chimeras.  These 
constructs  and  crosses  will  allow  us  to  determine 
if  the  precise  ectopic  expression  of  one  homeotic 
gene  has  any  effect  (interference  or  ameliora- 
tion) on  the  locus  normally  expressed  in  that  spa- 
tiotemporal  environment. 
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Images  (top  and  side  views)  of  the  x-ray  crystal  structure  of  a  peptide  corre- 
sponding to  the  leucine  zipper  of  the  yeast  transcriptional  activator  GCN4, 
determined  at  1.8  A  resolution,  in  collaboration  with  Thomas  Alber's  research 
group  at  the  University  of  Utah  School  of  Medicine.  The  leucine  zipper  folds  as  a 
parallel  coiled  coil,  with  two  helices  supercoiled  around  each  other.  The  cross- 
ing angle  of  the  helices  is  about  20° . 
Research  of  Peter  S.  Kim. 
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INFORMATION  transfer  in  biology  generally 
proceeds  from  DNA  to  RNA  (transcription)  and 
then  from  RNA  to  protein  (translation) .  The  lin- 
ear, unfolded  protein  chains  made  during  transla- 
tion must  fold  into  a  three-dimensional  shape  to 
be  functional.  Although  the  basic  mechanisms  of 
transcription  and  translation  are  understood,  at 
least  in  outline,  the  transfer  of  information  from 
one  to  three  dimensions — i.e.,  protein  folding — 
remains  a  major  unsolved  problem  in  molecular 
biology.  To  understand  protein  folding  is  a  prime 
objective  of  this  laboratory. 

A  second  effort  is  aimed  at  understanding  the 
principles  of  macromolecular  recognition:  spe- 
cific protein-protein  interactions  and  interac- 
tions between  protein  molecules  and  DNA.  These 
interactions  are  central  to  much  of  molecular 
physiology  and  developmental  biology.  We  have 
focused  on  a  structural  motif  called  the  leucine 
zipper,  v^hich  occurs  in  several  different  DNA- 
binding  proteins,  including  the  products  of  some 
nuclear  oncogenes. 

Protein  Folding 

Much  of  our  work  in  this  area  is  centered  on 
bovine  pancreatic  trypsin  inhibitor  (BPTI) ,  argu- 
ably the  protein  most  thoroughly  characterized 
in  biophysical  terms.  In  particular,  nuclear  mag- 
netic resonance  (NMR)  assignments  have  been 
made  for  essentially  every  proton  in  BPTI  by  Kurt 
Wiithrich  and  co-workers  (Eidgenossische  Tech- 
nische  Hochschule,  Switzerland),  and  several 
high-resolution  crystal  structures  of  the  protein 
are  available.  In  addition,  Thomas  Creighton  (Eu- 
ropean Molecular  Biology  Organization,  Ger- 
many) has  characterized  the  folding  of  BPTI  in 
terms  of  disulfide  bond  formation. 

It  is  difficult  to  determine  the  structures  of 
protein  folding  intermediates  because  protein 
folding  is  a  cooperative  process.  Indeed,  a  high- 
resolution  structure  has  not  yet  been  determined 
for  any  protein  folding  intermediate.  Trapped  di- 
sulfide-bonded  intermediates,  such  as  those  iden- 
tified in  the  early  folding  steps  of  BPTI,  are  often 
rather  insoluble;  this  hinders  detailed  structural 
characterization  by  NMR.  We  have  developed  a 


peptide  model  approach  that  circumvents  the 
problem  of  cooperativiry  and  improves  solubil- 
ity, so  that  the  structures  contained  within  pro- 
tein folding  intermediates  can  be  characterized 
in  detail. 

Peptide  models  that  simulate  two  crucial  early 
intermediates  in  the  folding  of  BPTI  have  been 
designed  and  synthesized  chemically.  By  using 
two-dimensional  NMR,  we  find  that  the  struc- 
tures contained  within  these  peptide  models  are 
remarkably  native-like,  corresponding  to  subdo- 
mains  of  BPTI.  It  appears  that  much  of  the  folding 
pathway  of  BPTI  can  be  explained  by  the  forma- 
tion of  native-like  subdomains  in  these  two  inter- 
mediates. If  our  results  are  general,  and  native- 
like subdomains  turn  out  to  be  key  determinants 
of  protein  folding,  then  solving  the  protein  fold- 
ing problem  might  be  reduced  in  large  part  to 
identifying  and  understanding  subdomains  of  na- 
tive proteins. 

In  addition  to  peptide  models,  we  are  studying 
recombinant  BPTI  molecules  that  contain  a  sub- 
set of  the  native  cysteine  residues.  We  are  testing 
our  understanding  of  the  folding  pathway  by 
making  mutant  BPTI  molecules  that  should  alter 
it  and  are  developing  new  methodologies  for 
studying  it.  The  overall  goal  is  to  learn  in  detail 
how  this  small  protein  folds,  in  structural,  ther- 
modynamic, and  kinetic  terms. 

Theoretical  attempts  to  model  the  protein  fold- 
ing process  from  first  principles  are  severely  hin- 
dered because  of  the  enormous  number  of  inter- 
actions to  consider  and  because  the  calculations 
must  necessarily  be  very  accurate  (the  stability  of 
most  proteins  is  determined  by  a  tiny  difference 
between  large  energies  favoring  and  opposing 
folding).  One  of  the  major  complications  is  sol- 
vent: interactions  between  water  and  protein  mol- 
ecules, and  between  water  molecules  them- 
selves, are  numerous  and  difficult  to  calculate 
accurately.  We  are  therefore  trying  to  study  pep- 
tides in  the  gas  phase.  Using  a  relatively  new 
method  called  laser  desorption,  we  are  putting 
peptides  into  the  gas  phase  for  study  by  optical 
measurements — e.g.,  fluorescence  and  circular 
dichroism.  Experimental  studies  of  the  structure 
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and  dynamics  of  short  peptides  in  isolated,  well- 
defined  conditions  such  as  the  gas  phase  will  pro- 
vide a  valuable  bridge  to  theoretical  simulations. 

Macromolecular  Recognition 

In  this  area,  we  have  focused  on  the  leucine 
zipper  class  of  DNA-binding  transcriptional  acti- 
vator proteins,  originally  identified  by  Steven 
McKnight  and  co-workers  (HHMI,  the  Carnegie 
Institution) .  The  leucine  zipper  regions  of  these 
proteins  are  important  for  homodimer  or  specific 
heterodimer  formation. 

Our  approach  in  this  work  is  to  use  "protein 
dissection."  GCN4,  a  homodimeric  transcription 
factor,  serves  as  a  prototype  protein.  A  synthetic 
peptide  corresponding  to  the  33-residue  leucine 
zipper  region  folds  as  a  parallel  pair  of  helices. 
This  led  us  to  propose  that  leucine  zippers  are 
actually  short  coiled  coils.  Recent  x-ray  crystallo- 
graphic  studies  of  this  peptide  (with  Tom  Alber's 
group,  University  of  Utah)  confirm  that  the  leu- 
cine zipper  of  GCN4  is  a  coiled  coil  and  provide 
the  first  high-resolution  structure  of  a  two- 
stranded  parallel  coiled  coil.  We  are  using  NMR 
to  investigate  the  dynamics  of  the  leucine  zipper 
dimer  by  measuring  amide  proton  exchange 
rates.  Combinatorial  mutagenesis  is  being  used 


(with  Robert  Sauer's  group,  Massachusetts  Insti- 
tute of  Technology)  to  investigate  the  sequence 
requirements  for  dimerization  by  this  leucine 
zipper. 

Proper  biological  function  requires  that  recog- 
nition between  many  different  macromolecules 
in  the  cell  occurs  with  exquisite  specificity.  We 
have  found  that  the  isolated  leucine  zipper  re- 
gions from  the  nuclear  oncogene  products  Fos 
and  Jun  are  sufficient  to  mediate  specific  hetero- 
dimer formation.  This  provides  a  very  simple 
model  system  for  studying  the  specificity  of  pro- 
tein-protein interactions:  two  helices  that  prefer 
to  interact  with  each  other  rather  than  with  them- 
selves. We  are  investigating  the  mechanism  and 
structural  basis  of  this  specificity  in  detail. 

A  region  of  GCN4  rich  in  basic  amino  acid  resi- 
dues, immediately  adjacent  to  the  leucine  zipper, 
is  involved  in  DNA  recognition.  We  find  that  this 
basic  region  by  itself,  when  dimerized  via  a  flexi- 
ble disulfide  linker  in  place  of  the  leucine  zip- 
per, is  also  capable  of  sequence-specific  DNA 
binding.  In  addition  to  simplifying  structural 
analysis  of  this  new  DNA-binding  motif,  the  find- 
ing provides  a  new  strategy  for  the  design  of  DNA- 
binding  peptides. 
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FOR  numerous  viruses  and  other  subcellular 
parasites,  RNA  rather  than  DNA  is  the  mole- 
cule used  for  storage  and  transmission  of  genetic 
information.  We  are  interested  in  the  genetic  and 
biochemical  implications  of  this  fact  for  a  virus, 
and  in  any  mechanistic  similarities  or  differences 
in  genetic  processes  between  organisms  with 
RNA  and  DNA  genomes.  In  addition,  we  are  ex- 
ploring the  mechanisms  of  RNA  packaging,  RNA 
replication,  and  RNA  recombination  in  the  ge- 
nome of  poliovirus  and  other  viruses.  We  are  also 
interested  in  the  interactions  between  viruses  and 
their  host  cells,  especially  in  the  area  of  RNA- 
protein  biochemistry. 

Many  of  our  genetic  studies  utilize  poliovirus, 
a  small  icosahedral  virus  with  an  RNA  genome 
only  7,500  nucleotides  in  length.  We  have 
shown,  for  example,  that  RNA  recombination 
among  poliovirus  genomes  occurs  at  sufficient 
frequency  that  1  out  of  every  25  is  a  recombinant. 
In  contrast  to  the  breaking  and  joining  of  preex- 
isting molecules  that  leads  to  DNA  recombina- 
tion, RNA  recombination  occurs  during  RNA  syn- 
thesis. This  results  from  the  switching  of  parental 
template  RNAs  by  RNA  polymerase.  We  are  devel- 
oping an  in  vitro  RNA  recombination  system  with 
a  view  to  asking  more  detailed  mechanistic  ques- 
tions. For  example,  can  the  RNA  replication  pro- 
teins switch  RNA  strands  themselves  or  are  other 
proteins  required?  How  do  the  RNA  strands  in- 
volved in  a  recombination  event  find  each  other 
and  align  in  the  proper  way?  Can  RNA  polymer- 
ases from  RNA  viruses  other  than  poliovirus  also 
accomplish  recombination? 

Further  investigation  into  the  incidence  of  RNA 
recombination  among  other  RNA  genomes  be- 
sides that  of  poliovirus  will  be  facilitated  by  the 
use  of  physical  rather  than  genetic  assays.  We  are 
using  the  polynucleotide  chain  reaction  to  screen 
the  progeny  of  crosses  of  various  RNA  viruses, 
phage,  and  subviral  parasites  that  are  not  amena- 
ble to  direct  genetic  analysis.  We  hope  to  in- 
crease understanding  of  the  prevalence  and  mech- 
anism of  genetic  recombination  among  RNA 
genomes,  a  process  that  is  certainly  responsible 


for  much  of  the  variability  and  rapid  evolution  of 
RNA  viruses. 

Using  x-ray  crystallography,  Jim  Hogle  at 
Scripps  Clinic  has  determined  the  three-dimen- 
sional structure  of  the  poliovirion.  However,  an 
appreciation  of  functional  interactions  between 
the  viral  RNA  and  the  virion  proteins  calls  for  the 
application  of  genetics  as  well  as  structural  bio- 
chemistry. We  do  not  know,  for  example,  exactly 
which  subviral  protein  particles  package  the 
virus  RNA  into  the  final  virion  structure,  nor  do 
we  know  the  structural  requirements  of  the  RNAs 
and  proteins  participating  in  the  packaging  reac- 
tions. Is  the  viral  RNA  threaded  into  an  intact, 
preformed  icosahedral  capsid,  or  do  smaller  parts 
of  the  capsid  condense  around  the  viral  RNA  to 
form  the  final  icosahedral  structure? 

We  have  constructed  several  mutants  in  the  po- 
liovirus RNA  genome  and  have  characterized 
them  in  great  detail.  Two  of  these  have  pointed 
out  a  region  of  the  viral  capsid,  quite  internal  to 
the  virion,  that  is  involved  both  in  RNA  packaging 
and  RNA  uncoating. 

We  are  examining  the  RNA-binding  properties 
of  intact  empty  particles  and  smaller  subviral  par- 
ticles from  both  mutant  and  wild-type  poliovirus- 
infected  cells.  We  hope  to  find  a  correlation  be- 
tween the  mutant  defects  in  RNA  packaging  and  a 
defect  in  the  binding  of  one  of  these  particles  to 
RNA  in  vitro.  Such  a  correlation  would  certainly 
strengthen  the  idea  that  the  particle  is  an  inter- 
mediate in  viral  assembly,  possibly  directly  re- 
sponsible for  packaging  the  viral  RNA. 

We  are  also  using  in  vitro  RNA-binding  assays 
to  examine  the  sequence  specificity  of  the  viral 
RNA  binding  to  the  subviral  protein  particles  that 
are  candidate  intermediates  in  viral  assembly,  to 
understand  any  RNA  structure  or  sequence  speci- 
ficity of  viral  packaging. 

To  investigate  the  role  of  host  cells  in  the  prop- 
agation of  the  genomes  of  RNA  parasites,  we  are 
extending  the  study  of  RNA  genetics  to  yeast. 
Yeast  cells,  unlike  the  primate  cells  in  which  po- 
liovirus and  other  RNA  viruses  of  medical  interest 
are  propagated,  are  amenable  to  elegant  genetic 
analysis,  making  it  possible  to  identify  quite 
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quickly  the  cellular  molecules  that  are  involved 
in  any  given  process.  Virus-like  particles  contain- 
ing double-stranded  RNA  genomes  replicate  in 
the  cytoplasm  of  yeast  cells  in  large  numbers. 
Their  genetic  analysis,  however,  will  depend  on 
our  ability  to  make  defined  mutations  in  their 
RNA  genomes.  To  this  end,  we  are  constructing 
perfect  DNA  replicas  of  these  RNA  genomes. 

It  has  been  necessary  to  develop  the  technol- 
ogy of  introducing  RNA  directly  into  yeast  cells. 


Using  RNA  molecules  encoding  the  luciferase 
protein  of  fireflies,  we  have  been  able  to  demon- 
strate the  direct  introduction  of  RNA  into  yeast, 
and  to  determine  how  best  to  transfect  RNA  viral 
genomes  into  yeast  cells.  By  introducing  into 
cells  mutant  RNA  viral  genomes,  we  will  be  able 
to  study  any  mutation-induced  defects  in  the  re- 
sulting infection.  And,  by  studying  the  effects  of 
yeast  mutants,  we  will  be  able  to  assess  contribu- 
tions of  the  host  cell  to  the  infectious  process. 
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THE  autonomic  nervous  system  serves  as  the 
master  control  center  for  the  cardiovascular 
system.  It  monitors  the  effectiveness  of  the  latter 
system  in  providing  nutrients  and  oxygen  to  the 
rest  of  the  body  and  appropriately  adjusts  the 
heart  rate,  blood  pressure,  and  blood  flow.  These 
adjustments  are  made  via  nerves  that  serve  the 
heart,  blood  vessels,  and  kidneys. 

Adrenergic  receptors  form  the  interface  be- 
tween these  nerves  (of  the  sympathetic  sub- 
system) and  the  organs  they  innervate.  Cate- 
cholamines released  from  sympathetic  nerve 
terminals  bind  to  adrenergic  receptors  on  the  sur- 
face of  target  cells,  and  the  activated  receptors 
modify  the  function  of  these  cells. 

When  a  catecholamine  occupies  its  binding 
site,  the  receptor  activates  a  GTP-binding  protein 
(G  protein)  inside  the  cell.  The  activated  G  pro- 
tein may  then  modulate  the  activity  of  a  cellular 
enzyme  or  ion  channel.  The  genes  (or  corre- 
sponding cDNAs)  for  eight  types  of  adrenergic 
receptors  have  been  cloned.  These  receptors 
have  different  functional  properties  and  play  dif- 
ferent roles  in  the  sympathetic  nervous  system. 

All  of  these  receptors,  however,  are  struc- 
turally similar,  having  seven  hydrophobic  do- 
mains that  are  thought  to  be  membrane  spanning. 
Mutagenesis  studies  have  revealed  that  the  hydro- 
phobic domains  are  involved  in  forming  the  cate- 
cholamine-binding  pocket.  The  cytoplasmic  do- 
mains, which  are  hydrophilic,  are  involved  in 
activating  G  proteins  and  in  desensitizing  the 
receptors. 


Adrenergic  Receptor  Structure 

A  major  focus  in  my  laboratory  is  to  learn  more 
about  the  three-dimensional  structure  of  adrener- 
gic receptors  and  to  determine  how  they  transmit 
signals  across  the  cell  membrane's  lipid  bilayer. 
We  are  taking  several  approaches  to  study  the  re- 
ceptor structure.  Mutagenesis  studies  are  identi- 
fying amino  acid  residues  involved  in  binding 
subtype-specific  ligands.  These  studies  will  help 
to  define  the  boundaries  of  the  ligand-binding 


pocket  and  possibly  provide  insight  into  the  dif- 
ferences between  agonist  and  antagonist  binding. 

A  long-range  goal  in  my  laboratory  is  to  charac- 
terize the  three-dimensional  structure  of  the  182' 
adrenergic  receptor  and  to  understand  how  the 
structure  changes  during  signal  transduction. 
Our  current  efforts  are  focused  on  developing 
methods  to  produce  large  quantities  of  pure, 
functional  receptor  protein  for  biochemical  and 
biophysical  studies.  We  are  attempting  to  in- 
crease production  of  ^2  receptor  and  to  improve 
the  purification  procedure. 

Efforts  include  designing  expression  vectors 
and  determining  the  optimum  conditions  for 
growth  of  tissue  culture  cells.  Efforts  at  im- 
proving the  efficiency  of  purification  involve  de- 
veloping new  ligand  affinity  resins  and  making 
recombinant  receptors  with  minor  structural 
modifications  that  facilitate  purification. 

Receptor  Biosynthesis 

The  primary  amino  acid  sequence  of  a  receptor 
contains  all  of  the  essential  information  needed 
for  the  receptor's  proper  folding,  post-transla- 
tional  processing,  and  cellular  targeting.  Under- 
standing the  process  by  which  receptors  are 
folded  and  processed  should  provide  insights 
into  receptor  structure  and  may  identify  factors 
that  will  enhance  the  production  of  functional 
receptor  protein. 

We  have  developed  a  cell-free  expression  sys- 
tem capable  of  synthesizing  functional  iS2-adren- 
ergic  receptor.  This  system  was  used  to  study  the 
process  by  which  the  182  receptor  is  inserted  into 
the  endoplasmic  reticulum  and  folded  into  a 
functional  protein.  Research  thus  far  has  shown 
that  the  receptor  is  nonfunctional  immediately 
after  translation  and  translocation  into  the  endo- 
plasmic reticulum.  To  produce  a  functional  re- 
ceptor, additional  processing  is  needed.  ATP, 
intact  microsomal  membranes,  and  a  high-molec- 
ular-weight cytosolic  factor  are  required  for  this 
processing.  We  are  attempting  to  determine  the 
nature  of  this  processing  by  identifying  structural 
differences  between  a  functional  receptor  and 
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newly  synthesized  receptors  that  have  not  under- 
gone the  post-translational  processing  necessary 
to  produce  functional  protein.  We  hope  to  iden- 
tify the  requisite  cytosolic  and  membrane  factors. 

Cellular  Biology  of  /32-Adrenergic  Receptors 

-Following  prolonged  exposure  to  catechol- 
amines, the  /32-adrenergic  receptor  becomes  de- 
sensitized and  is  less  efficient  in  activating  adeny- 
lyl  cyclase.  Several  mechanisms  contribute  to  the 
process  of  desensitization,  including  receptor 


phosphorylation  and  the  removal  of  receptors 
from  the  plasma  membrane.  A  great  deal  has  been 
learned  about  the  role  here  of  phosphorylation, 
but  the  mechanisms  for  receptor  removal  are 
poorly  understood.  We  have  obtained  antibodies 
to  the  /Sj-adrenergic  receptor  and  are  using  them 
to  characterize  the  process  of  agonist-mediated 
receptor  internalization.  We  hope  to  learn  about 
the  molecular  mechanism  by  which  these  ago- 
nist-occupied receptors  are  identified  and  tar- 
geted, either  for  reversible  sequestration  into  an 
endosomal  compartment  or  for  destruction. 
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GENES  that  encode  receptors  for  foreign  anti- 
gens have  provided  our  most  pivotal  insights 
into  early  lymphocyte  development  and  lym- 
phoid malignancies.  The  genes  for  immunoglobu- 
lin (Ig),  or  antibodies,  and  for  the  T  cell  receptor 
(TCR)  encode  the  antigen  receptors  for  B  cells 
and  T  cells,  respectively.  During  early  lympho- 
cyte development,  recombination  at  the  DNA 
level  assembles  these  genes  to  create  a  wide  rep- 
ertoire of  receptor  specificities. 

Much  of  what  we  know  about  these  genes  has 
been  gleaned  from  studies  of  lymphoid  tumors. 
These  malignancies  are  clonal  expansions  of  a 
single  cell  and  provide  multiple  identical  copies 
of  these  genetic  events.  Provocatively,  the  char- 
acteristic interchromosomal  translocations  that 
typify  B  cell  malignancies  break  at  the  Ig  genes, 
while  those  of  T  cell  tumors  often  occur  at  the 
chromosomal  home  of  the  TCR  genes.  We  have 
exploited  this  geography  to  clone  the  DNA  at 
these  illegitimate  interchromosomal  junctures. 
This  serves  as  a  bridge  from  the  antigen  receptor 
loci  to  the  other  chromosomal  partner,  which  has 
often  introduced  a  new  cancer-promoting  gene. 

As  a  prototype,  we  have  cloned  the  juncture 
between  chromosomes  1 4  and  1 8  that  is  present 
in  the  most  frequent  form  of  human  lymphoma, 
follicular-type  B  cell  lymphoma.  This  transloca- 
tion occurs  early  in  the  development  of  a  B  cell 
and  introduces  a  newly  discovered  gene,  Bel- 2, 
into  the  Ig  locus.  A  hybrid  Bcl-2-lg  fusion  gene  is 
created,  resulting  in  the  overproduction  of  Bel- 2. 
Transgenic  mice  were  created  that  possess  a  copy 
of  the  abnormal  Bel-2-lg  fusion  gene  in  their  ge- 
netic material.  They  progressed  from  an  indolent 
expansion  of  resting  B  cells  to  high-grade  life- 
threatening  lymphomas,  recapitulating  the  natu- 
ral course  of  the  human  disease  and  proving  that 


this  translocation  causes  malignancy.  The  Bel-2 
protein  is  unique  among  proto-oncogenes  by  be- 
ing located  in  mitochondria.  Moreover,  it  has  a 
novel  function  in  that  it  blocks  the  programmed 
death  of  cells  independent  of  promoting  their 
growth.  When  deregulated,  fic/-2  extends  the  sur- 
vival of  B  cells  normally  destined  to  die.  Bel-2 
constitutes  the  first  member  of  a  new  oncogene 
category,  regulators  of  cell  death. 

In  a  parallel  set  of  experiments,  unanticipated 
rearrangements  into  the  b  TCR  locus  on  chromo- 
some 14  have  identified  the  interchromosomal 
translocation  sites  that  typify  early  T  cell  acute 
lymphoblastic  leukemias.  Two  of  these  new 
genes,  Ttg- 1  from  chromosome  1 1  and  Tel-3  from 
chromosome  10,  are  not  normally  expressed  in  T 
cells.  Instead  their  function  is  diverted  from  their 
normal  sites  to  T  cells.  This  provides  an  impor- 
tant opportunity  to  unravel  how  the  redirection 
of  these  regulatory  genes  into  another  cell  type 
causes  malignancy. 

The  majority  of  chromosomal  defects,  how- 
ever, have  no  candidate  gene  at  either  side  of  the 
juncture.  The  responsible  gene  lies  a  consider- 
able distance  from  known  genes.  In  characteriz- 
ing such  defects,  the  challenge  is  to  develop  tech- 
nologies that  analyze  large  expanses  of  the 
human  genome.  One  promising  approach  uti- 
lizes portions  of  chromosomes  from  yeast  organ- 
isms to  obtain  and  propagate  long  stretches  of 
human  DNA.  This  makes  it  possible  to  clone  and 
map  entire  segments  of  human  chromosomes, 
precisely  linking  known  genes,  and  generating 
probes  to  search  for  new  disease  loci. 

This  group  of  studies  aims  to  improve  our  un- 
derstanding of  the  genetic  pathways  of  early  T 
and  B  cell  development,  as  well  as  the  aberran- 
cies  that  result  in  malignancy. 
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OVER  the  years  our  laboratory  has  worked  on 
how  abnormalities  of  the  protein  dystrophin 
result  in  the  clinical  symptoms  of  Duchenne  and 
Becker  muscular  dystrophy.  From  our  initial 
identification  of  the  dystrophin  gene,  we  have 
turned  to  the  normal  protein  function  and  to  how 
this  might  be  restored  to  mitigate  disease  symp- 
toms. Our  efforts  have  centered  on  proteins  re- 
lated to  dystrophin  by  structure  or  function.  Not 
only  may  these  replace  absent  dystrophin  in  dis- 
eased muscle,  but  they  are  themselves  prime  can- 
didates for  disease-producing  alteration. 

Over  the  past  year  we  cloned  two  dystrophin 
relatives  and  studied  their  function  in  diseased 
and  normal  persons.  We  also  designed  more  accu- 
rate ways  of  detecting  mutations  within  the  dys- 
trophin gene  and  pinpointed  dystrophin  in  the 
central  nervous  system.  We  have  also  hypothe- 
sized a  role  abnormal  dystrophin  might  play  in 
cognitive  function. 

After  dystrophin  had  been  cloned  and  se- 
quenced, it  was  seen  to  belong  to  a  family  of  pro- 
teins with  cytoskeletal  function.  A  group  at  Ox- 
ford University,  using  clones  for  dystrophin,  was 
able  to  isolate  a  part  of  a  member  of  the  family. 
This  protein,  encoded  by  a  locus  on  chromosome 
6,  was  partly  sequenced  and  shown  to  be  85  per- 
cent identical  to  dystrophin. 

Using  the  published  sequence,  we  cloned  a 
part  of  this  locus  and  expressed  the  protein  in 
bacterial  cells.  Antibodies  directed  against  the 
new  protein  were  produced.  Used  as  a  probe 
against  normal  muscle,  the  antibodies  detected  a 
protein  the  same  size  as  dystrophin,  and  it  was 
also  found  in  the  muscle  of  Duchenne  patients. 
We  called  it  dystrophin-related  protein,  or  DRP. 

Using  the  antibodies  to  probe  for  DRP,  we  were 
able  to  show  that  many  different  tissues  ex- 
pressed it  and  that  the  highest  levels  were  in  de- 
veloping fetal  muscle.  By  microscopic  localiza- 
tion, we  showed  further  that  DRP  colocalized  in 
muscle  with  the  acetylcholine  receptor.  In  a  de- 
veloping muscle  the  protein  colocalizes  with 
dystrophin,  but  as  the  muscle  matures,  DRP  mi- 
grates along  with  the  acetylcholine  receptor  to 
the  neuromuscular  junction. 


We  are  currently  trying  to  unravel  why  a  pro- 
tein that  is  so  similar  to  dystrophin  has  such  a 
specialized  function  in  muscle  and  how  we  might 
interfere  with  normal  function  to  make  DRP  work 
more  like  dystrophin.  We  are  also  searching  for  a 
neuromuscular  disease  that  might  be  caused  by 
abnormalities  of  DRP. 

Our  antibodies  directed  against  dystrophin  de- 
tected a  second  protein,  which  we  have  identi- 
fied and  partly  cloned.  This  large  molecule  was 
found  exclusively  in  the  brain.  Using  our  dystro- 
phin antibodies  as  a  probe,  we  were  able  to  clone 
pieces  of  cDNA  that  encoded  this  protein.  As  part 
of  our  strategy  in  characterizing  new  members  of 
the  dystrophin  family,  we  mapped  the  encoding 
locus  to  chromosome  5.  A  search  of  the  human 
linkage  map  revealed  that  the  gene  altered  to 
cause  spinal  muscular  atrophy  has  been  localized 
to  chromosome  5ql  2.  We  have  initiated  a  collab- 
orative effort  with  David  Ward  (Yale  University) 
to  localize  further  our  dystrophin-related  protein 
on  chromosome  5 .  We  are  also  attempting  to  gen- 
erate highly  polymorphic  markers  surrounding 
the  locus  to  map  this  gene  relative  to  SMA  muta- 
tions. This  latter  effort  will  be  in  collaboration 
with  Conrad  Gilliam  (Columbia  University) ,  who 
was  involved  in  the  original  mapping  of  SMA  to 
chromosome  5. 

One  of  the  curious  features  of  some  Duchenne 
patients  is  mental  impairment.  Because  virtually 
all  patients  have  almost  no  dystrophin,  it  was  dif- 
ficult to  explain  the  impairment  in  some.  We 
showed  that  dystrophin  was  expressed  in  the 
brain,  but  under  different  regulatory  control 
from  that  found  in  muscle.  In  the  course  of  devel- 
oping more  efficient  dystrophin-detecting  anti- 
bodies, we  found  one  that  was  highly  sensitive  for 
dystrophin  and  showed  little  cross-reaction  with 
other  brain  proteins. 

We  used  this  antibody  to  localize  dystrophin  in 
the  brain,  both  with  the  light  microscope  and  the 
electron  microscope.  Unlike  results  in  muscle, 
where  all  types  of  muscle  reveal  dystrophin,  it 
was  only  found  in  a  subset  of  neurons  of  the  cere- 
bellum and  cerebral  cortex.  We  hypothesize  that 
an  absence  of  dystrophin  in  the  cortical  neurons 
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might  yield  a  mild  memal  impairment,  explain- 
ing the  mental  phenotype  observed  in  Duchenne 
patients. 

Abnormalities  of  dystrophin  are  easily  detected 
at  the  protein  level,  and  nearly  70  percent  of  mu- 
tations that  cause  these  abnormalities  have  been 
shown  to  be  deletions  or  duplications  of  some 
part  of  this  extremely  large  locus.  The  remaining 
mutations  are  probably  single-base  changes  that 
either  disrupt  translation  of  a  functional  protein 
or  affect  the  processing  of  the  large  primary  dys- 
trophin transcript. 

We  have  designed  primers  from  dystrophin's 
nucleotide  sequence  to  allow  polymerase  chain 
reaction  (PGR)  amplification  of  specific  regions 
of  the  protein's  transcript.  By  looking  at  the  tran- 


script, we  should  be  able  to  determine  the  nature 
of  these  other  mutations  and  to  follow  them  in 
families  at  risk  of  having  inherited  such  muta- 
tions. This  ability  will  improve  the  diagnosis  of 
Duchenne  dystrophy  and  complement  existing 
methods. 

Our  aim  for  the  future  year  is  to  build  on  work 
already  in  progress.  We  will  continue  to  identify, 
clone,  and  study  new  dystrophin  relatives  with  an 
eye  toward  their  possible  role  in  mitigating  the 
effects  of  abnormal  dystrophin.  We  will  also 
study  any  possible  role  these  proteins  might  play 
in  other  neuromuscular  diseases.  As  more  disease 
loci  are  mapped  to  regions  of  human  chromo- 
somes, our  candidate  gene  approach  should  help 
in  the  rapid  characterization  of  these  diseases. 
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OUR  interests  are  in  characterizing  the  struc- 
tures of  proteins,  using  crystallography — 
namely,  x-ray  diffraction  experiments — and 
computer  simulations.  Crystallography  and 
computer  simulations  are  complementary  ap- 
proaches to  understanding  protein  function.  The 
x-ray  experiments  provide  three-dimensional 
structures,  which  are  currently  impossible  to  gen- 
erate from  theoretical  considerations  alone,  and 
the  simulations  allow  us  to  visualize  the  effects  of 
extrapolations  that  are  not  experimentally  feasi- 
ble. We  apply  the  knowledge  gained  from  these 
studies  toward  the  design  of  mutations  and  inhibi- 
tors to  modify  protein  activity. 

Part  of  our  work  involves  applications  of  the 
molecular  dynamics  (MD)  method  to  problems 
in  protein  crystallography.  This  powerful  tool  for 
simulating  protein  structure  and  structural  rela- 
tionships involves  computing  the  forces  between 
all  the  atoms  in  a  protein  and  generating  trajec- 
tories of  the  atomic  motion  in  response  to  these 
forces.  Reasonably  realistic  MD  simulations  of 
solvated  enzymes  or  proteins  in  crystal  lattices 
are  now  possible.  The  simulations,  however,  rely 
on  rather  approximate  interatomic  potential 
functions  and,  even  with  the  fastest  computers 
available  today,  are  limited  to  very  short  time- 
scales.  We  are  carefully  testing  and  evaluating  the 
results  of  MD  calculations  through  simulation  of 
small  proteins,  such  as  crambin  and  ribonucle- 
ase,  in  the  crystal  environment,  thus  permitting 
detailed  comparison  with  high-resolution  x-ray 
diffraction  data. 

In  related  work,  we  are  developing  methods 
for  the  better  treatment  of  dynamics  in  refining 
crystallographic  structure. 

Our  projects  in  the  determination  of  crystallo- 
graphic structure  encompass  two  major  areas-, 
one  involves  redox  proteins  and  the  transcrip- 
tional response  to  oxidative  stress  (in  collabora- 
tion with  Peter  Model  and  Anthony  Cerami  at  the 
Rockefeller  University  and  with  Gisela  Storz  at 
the  National  Institutes  of  Health),  and  the  other  is 
concerned  with  transcription  factors  in  Drosoph- 
ila  development  (in  collaboration  with  Claude 
Desplan,  HHMI,  the  Rockefeller  University).  We 
focus  here  on  just  the  first  topic. 


One  of  the  efforts  in  this  area  is  aimed  at  devel- 
oping an  understanding  of  the  catalytic  and  sub- 
strate recognition  mechanisms  of  two  related 
redox  enzymes,  thioredoxin  reductase  and  try- 
panothione  reductase.  These  are  members  of  a 
widely  distributed  family  that  channel  the  reduc- 
tive power  of  NAD(P)H  via  a  protein-bound  fla- 
vin (FAD),  to  an  active-site  disulfide  bond.  Other 
well-known  members  include  glutathione  reduc- 
tase (for  which  a  high-resolution  x-ray  structure 
was  first  determined),  lipoamide  dehydrogenase, 
and  mercuric  ion  reductase. 

Thioredoxin  is  a  small  redox-active  protein 
that  is  the  reductant  of  ribonucleotide  reductase 
in  the  DNA  synthesis  pathway.  It  has  diverse  other 
functions,  ranging  from  the  light-activated  regula- 
tion of  enzyme  pathways  in  plants  to  the  catalysis 
of  protein  disulfide  isomerization.  In  Esche- 
richia coli,  thioredoxin  is  maintained  in  the  re- 
duced state  by  the  action  of  NADPH-dependent 
thioredoxin  reductase  (TR).  Previous  x-ray  dif- 
fraction studies  on  the  related  enzyme  glutathi- 
one reductase  (GR)  revealed  a  dimeric  structure 
(Af,  =  2  X  52,400)  with  four  domains  within 
each  molecule:  the  FAD-  and  NADPH-binding  do- 
mains, the  "central"  domain  that  is  also  responsi- 
ble for  binding  FAD,  and  the  carboxyl-terminal 
domain  that  provides  the  dimer  interface  as  well 
as  residues  that  are  critical  for  substrate  binding 
and  catalytic  activity. 

Although  TR  and  GR  both  catalyze  the  same 
chemical  reaction,  that  of  disulfide  reduction  in 
their  substrates,  TR  is  significantly  smaller  (M^  = 
2  X  34,500)  and  lacks  the  interface  domain  that 
in  GR  forms  part  of  the  active  site.  Another  funda- 
mental difference  in  their  sequences  involves  the 
redox-active  disulfide,  which  in  GR  is  part  of  a 
highly  conserved  hexapeptide  (Cys-Val-Asn-Val- 
Gly-Cys)  in  the  FAD  domain,  and  in  TR  is  a  shorter 
segment  (Cys-Ala-Thr-Cys)  found  instead  in  the 
NADPH  domain. 

The  three-dimensional  structure  of  TR,  deter- 
mined at  3.0  A  resolution  by  multiple  isomor- 
phous  replacement  (in  collaboration  with  Peter 
Model  and  with  Charles  Williams  at  the  Univer- 
sity of  Michigan)  and  initially  refined  at  a  resolu- 
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tion  of  2  A  using  synchrotron  x-ray  data,  shows 
that  despite  these  differences  the  tertiary  struc- 
ture of  TR  is  closely  similar  to  that  of  GR.  How- 
ever, the  quaternary  structures  and  active-site  ar- 
chitectures are  unrelated,  and  the  two  enzymes 
appear  to  have  acquired  the  disulfide  reductase 
activity  independently.  These  results  provide  a 
striking  example  of  modular  evolution  of  three- 
dimensional  protein  structure  and  enzyme 
function. 

Trypanothione  reductase  (TrypR)  represents 
an  enzyme  target  for  drug  intervention  in  African 
trypanosomiasis,  Chagas'  disease,  and  leishman- 
iasis. The  protozoan  parasites  do  not  possess  GR, 
and  instead  use  a  glutathione-based  peptide,  try- 
panothione, for  the  essential  reduction  of  gluta- 
thione. Trypanothione  is  in  turn  reduced  by 
TrypR,  which  was  first  characterized  from  an  in- 
sect trypanosomatid,  Crithidia  fasciculata.  This 
analysis  revealed  that  it  is  very  similar  to  GR  in 
size,  catalytic  mechanism,  and  active-site 
structure. 

The  two  enzymes,  however,  are  mutually  ex- 
clusive for  substrate.  This,  combined  with  the 
known  susceptibility  of  trypanosomatids  to  oxi- 
dative stress  in  the  absence  of  reduced  glutathi- 
one, makes  TrypR  a  promising  target  for  the  devel- 
opment of  therapeutic  agents  that  are  not  toxic  to 
the  host  cell.  We  have  obtained  nicely  diffracting 
single  crystals  of  trypanothione  reductase  from  C. 
fasciculata  and  have  recently  solved  the  struc- 
ture to  2.4  A  resolution  by  molecular  replace- 
ment, using  the  structure  of  human  erythrocyte 


GR  as  a  search  model  (in  collaboration  with 
Anthony  Cerami) .  We  are  in  the  process  of  identi- 
fying the  residues  that  are  responsible  for  the  1 0^ 
enhancement  in  the  turnover  of  glutathione  for 
trypanothione  in  the  parasite  enzyme. 

An  extremely  interesting  and  well-character- 
ized redox-based  regulatory  system  is  the  bacte- 
rial response  to  oxidative  stress.  Exposure  of  E. 
colt  or  Salmonella  typhimurium  to  low  levels  of 
hydrogen  peroxide  results  in  subsequent  resis- 
tance to  high  levels.  This  is  due  to  the  induction 
of  30  proteins.  The  OxyR  protein  is  responsible 
for  the  peroxide  induction  of  nine  of  these,  in- 
cluding GR,  catalase,  and  an  alkyl  hydroperoxide 
reductase  that  is  similar  in  sequence  to  TR. 

Unlike  many  bacterial  regulatory  systems, 
where  distinct  proteins  are  involved  in  sensing 
the  environmental  change  and  in  activating  tran- 
scription, OxyR  is  both  the  sensor  and  the  tran- 
scriptional activator.  The  levels  of  OxyR  do  not 
change  significantly  when  the  cell  is  challenged 
with  peroxide.  Rather,  OxyR  undergoes  rapid 
and  reversible  changes  in  its  DNA-binding  proper- 
ties. Depending  on  which  site  it  is  bound  to, 
OxyR  acts  as  either  an  activator  or  a  repressor  of 
transcription.  We  have  entered  into  a  collabora- 
tion with  Gisela  Storz  to  crystallize  OxyR  and  de- 
termine its  three-dimensional  structure,  its  mode 
of  binding  to  DNA,  and  the  mechanism  of  its  re- 
sponse to  oxidative  stress.  Since  OxyR  is  member 
of  a  large  family  of  bacterial  regulatory  proteins 
of  as  yet  unknown  structure  or  mechanism,  our 
studies  are  likely  to  have  broad  implications. 
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WE  are  interested  in  the  two  major  aspects  of 
Down  syndrome,  namely  the  etiology  and 
pathogenesis  of  this  disorder. 

Etiology 

Why  does  a  woman's  risk  of  having  an  offspring 
with  Down  syndrome  increase  dramatically  with 
age?  There  are  two  competing  theories.  The 
"older  egg"  model  states  that  as  women  age,  they 
produce  more  abnormal  (aneusomic)  eggs.  The 
"relaxed  selection"  model  states  that  as  women 
age,  they  lose  the  ability  to  abort  abnormal  con- 
ceptuses.  To  determine  which  of  these  hypothe- 
ses, or  whether  a  combination  of  the  two,  is 
correct,  it  is  necessary  to  study  many  cases  of 
Down  syndrome. 

Our  laboratory  examines  the  molecular  aspects 
of  such  a  study.  We  utilize  polymerase  chain  reac- 
tion (PGR)  techniques  that  analyze  polymorphic 
variability  due  to  differences  in  the  quantity  of 
interspersed  d(A,C)  :d(G,T)  sequences  and  to  dif- 
ferences in  the  length  and/or  sequence  of  the  3' 
end  of  human  interspersed  highly  repetitive  Alu 
sequences.  Two  major  advantages  of  these  tech- 
niques are  the  extensive  variability  of  these  se- 
quences and  the  requirement  of  only  small 
amounts  of  subject  DNA  for  the  PCR-based  analy- 
sis. The  latter  point  is  particularly  important,  as 
blood  from  even  small  children  becomes  almost 
an  inexhaustible  resource  with  this  technology. 

To  detect  useful  variability  on  chromosome 
21,  we  hybridized  cosmids  containing  flow- 
sorted  DNA  enriched  for  this  chromosome  (cour- 
tesy of  Lawrence  Livermore  Laboratories  and  Im- 
perial Cancer  Research  Fund)  with  a  poly 
d(A,C):d(G,T)  probe.  In  addition,  we  isolated 
several  cosmids  and  yeast  anificial  chromosomes 
(YACs)  that  contain  variable  oligo  d(A,C)  :d(G,T) 
and  Alu  sequences  near  the  centromere  of  21q. 
Variability  detected  by  oligomers  that  abut  these 
sequences  on  2  Iq  along  with  variability  detected 
by  more  standard  restriction  fragment  length 
polymorphisms  has  resulted  in  two  important 
(and  unexpected)  findings:  1)  94  percent  of 
nondisjunction  errors  are  maternal  in  origin,  a  sig- 
nificantly higher  percentage  than  the  75  percent 


figure  based  on  subjective  cytogenetic  observa- 
tions that  must  have  been  in  error;  2)  a  significant 
plurality  of  Down  syndrome  cases  show  no  detect- 
able crossing  over  on  chromosome  2 1  in  the  par- 
ent responsible  for  nondisjunction. 

It  will  be  especially  important  to  distinguish 
between  meiosis  I  and  II  nondisjunction  errors. 
To  accomplish  this,  we  have  performed  locus  ex- 
pansion on  D21S16  and  D21S120  and  isolated 
various  YACs  that  in  aggregate  comprise  the  most 
proximal  known  markers  on  21q.  We  are 
currently  extracting  variable  markers  from  these 
clones.  The  establishment  of  molecular  variabil- 
ity near  the  centromere  of  chromosome  2 1  will 
enable  us  to  determine  whether  an  error  is  mater- 
nal meiosis  I,  maternal  meiosis  II,  paternal  meio- 
sis I,  or  paternal  meiosis  II.  By  comparing  these 
rates  among  women  of  different  age  groups,  it 
will  be  possible  to  determine  whether  the  older 
egg  or  relaxed  selection  model  (or  a  combination 
of  the  two)  is  correct.  Furthermore,  it  is  too  early 
to  determine  whether  the  lack  of  crossing  over 
observed  in  a  plurality  of  cases  reflects  no  chias- 
mata  or  terminalization  of  chiasmata  and  whether 
this  depression  of  crossing  over  is  associated  with 
advanced  maternal  age.  To  answer  these  ques- 
tions, it  will  be  necessary  to  examine  variable  mo- 
lecular probes  that  span  the  long  arm  of  chromo- 
some 21,  including  markers  at  the  distal  tip  of 
2  Iq.  Whatever  the  results,  the  biology  of  nondis- 
junction is  of  even  greater  interest  and  complex- 
ity than  imagined  originally. 

Pathogenesis 

To  isolate  genes  encoded  by  chromosome  2 1 , 
we  use  a  novel  recombination-based  methodol- 
ogy. Genomic  fragments  isolated  from  chromo- 
some 2 1  are  cloned  into  a  plasmid  vector  with 
the  genetic  marker  supY.  RNA  isolated  from  a  vari- 
ety of  human  tissues  was  used  to  construct  bacte- 
riophage X  cDNA  libraries,  which  are  then  in- 
fected into  cells  harboring  supY  plasmids 
carrying  individual  nonrepeated  genomic  frag- 
ments on  chromosome  2 1 .  If  any  member  of  the 
genie  cDNA  library  shares  homology  with  the  ge- 
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nomic  DNA  fragment,  then  recombination  me- 
diated by  that  homology  will  ensue. 

Following  recombination  between  the  plasmid 
and  the  bacteriophage,  selection  for  bacterio- 
phages carrying  a  given  plasmid  with  swpF  will 
result  in  selection  for  bacteriophages  carrying  a 
cDNA  that  is  homologous  to  a  genomic  DNA 
cloned  in  the  plasmid.  In  other  words,  the  system 
is  designed  to  select  for  genomic  sequences  that 
are  transcribed.  The  system  is  also  designed  to 
stand  alone  or  to  interdigitate  with  the  genomic 
initiative  as  it  proceeds.  In  the  latter  case,  as  se- 
quencing detects  open  reading  frames,  the  recom- 
bination-based assay  is  designed  to  delineate  the 
tissue  and  timing  of  transcription  quickly  and  ac- 
curately and  isolate  the  transcribed  sequence. 

This  methodology  has  worked  in  a  model  ex- 
periment and  resulted  in  the  isolation  of  at  least 
one  gene  on  chromosome  2 1 .  However,  in  per- 
forming these  studies,  two  arcane  problems  were 
noted. 

1.  Once  recombination  indicating  transcrip- 
tion has  occurred,  further  recombination  be- 
tween the  X  phage  of  the  cDNA  library  and  the  X 
lysogen  used  originally  to  insert  a  single  copy  of 
the  PI  ban  gene  results  in  frequent  scrambling  of 


the  X  phage,  carrying  the  cDNA  into  an  unrecog- 
nizable state.  This  can  be  circumvented  by  clon- 
ing a  copy  of  the  V\  ban  gene  in  another  way  than 
as  a  X  lysogen.  The  final  successful  strategy 
(which  we  accomplished  recently)  was  to  con- 
struct a  bacterial  strain  (DK43)  carrying  a  low- 
copy  PI  lysogen  with  a  functioning  ban  gene  and 
the  PI  restriction  system  inactivated  insertion- 
ally.  This  strain  can  now  be  used  to  isolate  genes 
on  chromosome  21. 

2.  The  bulk  of  cDNA  (gene)  libraries,  made  in 
other  laboratories  for  other  purposes,  are  contam- 
inated with  small  amounts  of  ubiquitous  plasmid 
pBR322  DNA  sequences.  These  sequences  pre- 
vent the  screening  of  such  libraries  with  genomic 
clones  inserted  in  ColEl  (pBR322)-based  plas- 
mids  by  recombination.  To  circumvent  this  prob- 
lem, we  are  cloning  the  sup¥  gene  into  an  R6K- 
derived  plasmid  that ,  lacks  homology  with 
pBR322.  Since  the  R6K  replicon  is  used  rarely 
and  need  not  be  present  in  the  strains  used  for 
propagation  of  Xgtl  1  libraries,  contamination  of 
cDNA  libraries  with  this  plasmid  will  not  occur. 
This  will  allow  the  screening  of  a  wide  variety  of 
extant  cDNA  libraries  for  transcription  by  chro- 
mosome-specific elements. 
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MEDICAL  history  is  marked  by  extraordinary 
successes  against  viral  infections,  but  it  is 
also  punctuated  by  the  continual  emergence  of 
new  viruses.  Since  viruses,  in  general,  contain 
very  limited  genetic  information,  they  must  rely 
upon  host  cells  for  their  own  growth.  How  they 
cause  diseases  and  how  they  continue  to  flourish 
in  nature  are  not  only  interesting  subjects  in 
themselves,  but  offer  a  lesson  in  the  everyday 
workings  of  normal  cells.  Our  laboratory  is  inter- 
ested in  RNA  viruses,  replicating  entities  in 
which  RNA,  in  contrast  to  DNA,  is  the  genetic 
material.  We  are  exploring  how  these  viruses  rep- 
licate and  cause  diseases. 

One  of  those  we  are  studying  is  the  corona- 
virus,  named  for  its  similar  appearance  to  the  co- 
rona of  the  sun.  The  virus  causes  the  common 
cold  in  humans  and  a  variety  of  gastrointestinal 
and  respiratory  problems  in  animals.  It  also 
causes  symptoms  very  similar  to  those  of  multiple 
sclerosis,  thus  providing  a  model  system  for  study- 
ing this  disease.  It  has  an  RNA  genome  of  3 1 ,000 
nucleotides,  which  is  the  longest  known  stable 
RNA.  We  are  interested  in  learning  how  this  un- 
usually large  RNA  expresses  its  genes  and  main- 
tains its  genetic  stability,  despite  an  overwhelm- 
ingly high  frequency  of  error  in  RNA  synthesis. 
We  have  recently  determined  the  complete  se- 
quence of  the  genome,  giving  us  a  glimpse  of 
how  the  viral  genes  express  themselves. 

The  virus  utilizes  a  novel  RNA  synthesis  mecha- 
nism, a  discontinuous  process  that  fuses  a  leader 
RNA  to  a  gene  located  some  distance  from  it.  This 
unusual  mechanism  allows  the  leader  RNA  to 
control  the  expression  of  viral  genes  and  change 
the  biological  properties  and  pathogenicity  of 
the  virus  as  a  result.  This  means  that  the  enzyme 
catalyzing  coronaviral  RNA  synthesis  is  also  un- 
usual, which  is,  indeed,  suggested  from  the  se- 
quence of  the  gene  encoding  the  enzyme.  Our 
laboratory  is  investigating  this  novel  RNA  synthe- 
sis mechanism. 

Another  unusual  characteristic  of  coronavirus 
RNA  has  been  revealed  in  our  findings:  it  can  un- 
dergo genetic  exchange  (RNA-RNA  recombina- 
tion) at  an  extraordinarily  high  rate.  RNA-RNA  re- 


combination was  previously  thought  to  be  a  rare 
event  in  nature.  We  demonstrated,  however,  that 
it  occurs  readily  between  coronaviruses.  This  re- 
combination can  take  place  almost  anywhere  in 
the  RNA  genome,  both  in  tissue  culture  cells  and 
during  animal  infections. 

Furthermore,  we  demonstrated  that  recombi- 
nant viruses  could  become  a  predominant  virus 
population  under  certain  conditions,  replacing 
the  parental  viruses  by  a  simple  process  of  natural 
selection.  Thus  recombination  represents  a  pow- 
erful evolutionary  tool  for  RNA  viruses.  From  the 
standpoint  of  viral  biology,  RNA  recombination 
may  be  the  genetic  mechanism  by  which  corona- 
viruses  weed  out  defective  RNA  sequences  gener- 
ated by  errors  in  RNA  synthesis.  Coronavirus  is 
thus  able  to  maintain  an  RNA  genome  larger  than 
was  thought  theoretically  possible.  RNA  recombi- 
nation has  now  been  demonstrated  in  many  dif- 
ferent viruses,  suggesting  its  important  role  in 
virus  evolution. 

This  genetic  phenomenon  also  has  an  impor- 
tant implication  in  vaccine  development  for  dis- 
eases such  as  AIDS  (acquired  immune  deficiency 
syndrome) ,  since  genetic  exchanges  may  lead  to 
genetic  instability  of  attenuated  virus  vaccines. 
We  are  continuing  to  study  the  RNA  recombina- 
tion mechanism  and  attempting  to  use  it  as  a  ge- 
netic tool  in  determining  how  viruses  cause 
disease. 

Another  virus  we  are  studying  is  hepatitis  delta 
virus  (HDV) ,  a  human  hepatitis  virus  commonly 
associated  with  a  severe  form  of  hepatitis.  HDV, 
by  itself,  does  not  infect  humans  because  it  is  de- 
fective and  requires  another  viral  agent,  hepatitis 
B  virus  (HBV),  to  supply  an  essential  envelope 
protein  in  order  to  infect  liver  cells.  The  HDV  has 
been  shown  to  cause  epidemics  of  fulminant  hep- 
atitis in  many  parts  of  the  world.  In  the  United 
States,  it  is  prevalent  among  intravenous  drug 
abusers. 

The  virus  contains  a  circular,  single-stranded 
RNA  genome  of  only  17,000  nucleotides.  It  is  the 
only  animal  virus  with  a  circular  RNA.  This  ge- 
nome structure  is  reminiscent  of  a  group  of  plant 
pathogens,  viroids  or  virusoids,  which  cause  a  va- 
riety of  plant  diseases.  Indeed,  the  similarity  be- 
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tween  HDV  RNA  and  plant  viroid  RNAs  goes 
beyond  their  circular  RNA  structure.  There  are 
several  structural  and  biochemical  features  that 
suggest  a  close  evolutionary  relationship  be- 
tween HDV  and  plant  viroid  RNAs.  Both  RNA 
groups  contain  a  "ribozyme"  activity,  in  which 
the  RNA  serves  as  an  enzyme  that  cleaves  and  li- 
gates  the  RNA  itself.  Thus  HDV  RNA  stands  at  a 
peculiar  place  in  the  evolutionary  ladder:  it  may 
have  been  derived  from  a  plant  pathogen  by  re- 
combination with  a  gene  that  gave  it  the  ability  to 
infect  human  cells  and  cause  diseases.  Our  labora- 
tory is  studying  the  properties  of  this  ribozyme 
activity.  We  have  shown  that  HDV  RNA  represents 
a  new  class  of  ribozyme,  which  is  distinct  from 
any  others  known. 

One  important  difference  between  HDV  RNA 
and  plant  viroid  RNAs  is  the  ability  of  the  former 
to  synthesize  a  protein,  hepatitis  delta  antigen 
(HDAg),  the  HDV  signature  protein.  HDAg  is  re- 
quired for  HDV  RNA  synthesis.  We  have  been 


studying  this  protein's  properties  and  functions 
and  have  shown  that  it  interacts  with  HDV  RNA  in 
a  specific  way.  HDAg  may  be  the  reason  the  virus 
causes  diseases. 

What  is  the  role  of  this  protein,  HDAg,  in  HDV 
RNA  synthesis? — an  unusual  synthesis  because 
HDV  RNA  is  so  small  (1,700  nucleotides)  that  it 
lacks  capacity  to  provide  its  own  synthesizing  en- 
zymes. Therefore,  HDV  most  likely  borrows  cel- 
lular enzymes  to  do  the  job,  which  is  uncharac- 
teristic of  RNA  viruses.  Most  RNA  viruses  must 
make  their  own  enzymes,  since  normal  cells  do 
not  appear  to  have  this  type. 

HDV  thus  provides  a  new  perspective  from  the 
small  end  of  the  RNA  spectrum.  Our  laboratory  is 
studying  one  of  the  largest  RNA  viruses  (corona- 
virus)  and  the  smallest  (HDV),  which  utilize  dif- 
ferent principles  for  viral  replication.  Our  stud- 
ies not  only  offer  insights  into  how  these  viruses 
cause  diseases,  but  also  into  fundamental  mecha- 
nisms of  RNA  synthesis  and  RNA  evolution. 
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MY  laboratory  is  studying  the  molecular  biol- 
ogy of  human  papillomaviruses  types  16 
and  18  and  their  relationship  to  neoplastic  dis- 
ease. HPV-16  and  -18  are  the  probable  causative 
agents  of  the  many  cancers  of  the  urogenital  re- 
gion, in  particular  those  of  the  cervix  of  the 
uterus.  Infection  by  HPVs  is,  however,  not  suffi- 
cient by  itself  to  induce  cervical  cancer;  other 
cellular  events,  such  as  the  activation  of  onco- 
genes, may  also  be  required.  We  are  currently  ana- 
lyzing the  molecular  mechanisms  by  which  HPVs 
contribute  to  the  development  of  malignancy. 

Human  papillomaviruses  are  classified  into 
various  subtypes,  depending  on  sequence  homol- 
ogy. If  an  isolate  shares  less  than  50  percent  DNA 
homology  with  other  subtypes,  it  is  classified  as  a 
new  viral  type.  More  than  66  distinct  HPV  sub- 
types have  been  identified.  The  majority  of  papil- 
lomaviruses (numbered  according  to  chronologi- 
cal order  of  discovery)  cause  benign  skin  warts; 
only  a  limited  subset  is  associated  with  cervical 
cancer  (HPV-16,  -18,-31,  and  -33).  HPV-6  is  re- 
sponsible for  many  genital  warts;  HPV- 1 6  and  - 1 8 
are  found  in  only  a  small  proportion  of  benign 
lesions.  In  contrast,  more  than  95  percent  of  all 
malignant  lesions  contain  HPV-16,  -18,  and  -31 
sequences.  An  important  question  that  our  labora- 
tory is  studying  is  what  factors  differentiate  a 
virus  that  causes  benign  lesions  from  one  capable 
of  inducing  malignancies.  Although  the  number 
of  infected  individuals  has  increased  10-fold  in 
the  last  10  years,  effective  monitoring  proce- 
dures such  as  the  PAP  smear  have  limited  the  in- 
crease in  cervical  cancers. 

HPV- 1 6  and  - 1 8  have  a  latent  period  between 
the  time  of  infection  and  the  appearance  of  a  neo- 
plastic state.  In  the  benign  state  the  virus  is 
usually  found  as  an  episome;  in  the  malignant 
state  the  genome  is  usually  found  integrated  into 
the  host  chromosome.  One  model  suggests  that 
this  integration  process  is  important  in  the  devel- 
opment of  the  cancerous  state,  perhaps  through 
the  removal  of  a  dominant  inhibitor  of  transfor- 
mation. Thus  viral  integration  into  the  chromo- 
some may  be  the  first  step  of  a  multistep  process 
in  the  development  of  neoplasia. 


We  have  been  studying  how  HPV  viral  genes 
contribute  to  the  transformation  process.  Using 
in  vitro  protocols,  we  have  transformed  both  hu- 
man and  mouse  cells  in  tissue  culture.  In  one  set 
of  experiments,  we  introduced  HPV  DNA  into  the 
immortalized  rodent  cell  line  NIH  3T3  and  found 
that  these  cells  could  induce  tumors  after  injec- 
tion into  mice.  Since  NIH  3T3  cells  by  themselves 
do  not  induce  tumors,  this  demonstrates  the  tu- 
morigenic  potential  of  HPVs. 

We  have  also  investigated  the  ability  of  HPV  to 
transform  primary  human  epithelial  cells,  the  nat- 
ural host  cell  for  HPV  infection.  HPV  can  immor- 
talize these  cells,  which  normally  only  have  a  lim- 
ited life  span  in  vitro.  In  addition,  using  a  system 
that  mimics  the  differentiation  properties  of  epi- 
thelial cells  in  vitro,  we  have  shown  that  HPV 
sequences  alter  the  ability  of  epithelial  cells  to 
differentiate.  Although  cells  transfected  by  HPV- 
16  and  -18  continue  to  stratify,  the  expression  of 
viral  genes  disrupts  the  normal  differentiation 
pattern. 

The  morphological  changes  that  we  observe  in 
this  tissue  culture  system  are  similar  to  those  seen 
in  low-grade  cervical  neoplasias.  In  tissue  cul- 
ture, our  HPV  cell  lines  quickly  lose  the  ability  to 
differentiate  and  develop  the  appearance  of  high- 
grade  neoplasias.  Two  HPV  genes,  E6  and  E7, 
seem  to  be  required  for  this  transformation  pro- 
cess. This  system  may  be  an  in  vitro  model  for  the 
development  of  cervical  cancer;  we  hope  to  uti- 
lize it  to  identify  important  factors  involved  in 
controlling  the  development  of  malignancy. 
These  observations  support  the  etiological  role  of 
HPV  in  the  development  of  cervical  cancer. 

Although  it  is  still  not  possible  to  propagate 
HPVs  in  tissue  culture,  we  have  made  significant 
progress  in  this  area.  We  have  duplicated  two  fea- 
tures of  a  productive  infection  in  vitro-,  the  dif- 
ferentiation-specific amplification  of  viral  ge- 
nomes and  the  expression  of  late  transcripts.  A 
block  still  remains  at  the  level  of  synthesis  of  cap- 
sid  proteins,  but  we  hope  to  overcome  this  obsta- 
cle in  the  near  future. 

To  understand  the  life  cycle  of  papillomavi- 
ruses, we  are  examining  the  factors  involved  in 
the  tissue-specific  regulation  of  viral  gene  ex- 
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pression  and  replication.  We  have  identified 
HPV- 1 8  enhancer  sequences  that  are  responsible 
for  expression  in  maturing  keratinocytes.  The 
function  of  one  of  these  sequences  depends  on 
the  action  of  a  cellular  gene  found  only  in  epithe- 
lial cells,  the  keratinocyte-stimulating  factor 
(KRF- 1 ) .  We  hope  to  understand  the  tissue  spec- 
trum of  expression  of  papillomaviruses  by  study- 
ing the  epithelial-specific  mechanisms  for  regula- 
tion of  transcription. 

In  collaboration  M^ith  Robert  Hammer  (HHMI, 
University  of  Texas  Southwestern  Medical  Center 
at  Dallas),  we  have  developed  an  animal  model 


for  papillomavirus-induced  disease.  Transgenic 
mice  have  been  constructed  that  express  only  E6- 
E7  genes  of  HPV- 18.  These  mice  develop  tumors 
of  the  urogenital  tract  :  males  develop  seminal  ves- 
icle tumors;  females  develop  what  appear  to  be 
tumors  of  the  cervix.  This  demonstrates  that  HPVs 
are  capable  of  inducing  urogenital  tumors  in  ani- 
mals. This  model  may  thus  be  useful  in  the  study 
of  virally  induced  neoplasias. 

The  study  of  HPV  will  provide  information  on 
mechanisms  of  transformation  and  tumor  progres- 
sion, as  well  as  tissue-specific  expression  and 
viral  replication. 


In  situ  hybridization  of  raft  culture  of  a  cell  line  derived  from  a  low-grade  cervical  neoplasm  and 
containing  episomal  copies  of  HPV-31b.  The  dark  spots  indicate  amplification  of  HPV  viral  ge- 
nomes in  highly  differentiated  koilocyte-like  cells,  similar  to  the  differentiation- dependent  ampli- 
fication of  HPV  DNA  that  occurs  in  vivo. 

Prom  Bedell,  M.A.,  Hudson,  f.B.,  Golub,  T.R.,  Turyk,  M.E.,  Hosken,  M.,  Wilbanks,  G.D.,  and 
Laimins,  L.A.  1991.  J  Virol  65:2254-2260. 
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ABNORMAL  lipoprotein  concentrations  in 
plasma  are  commonly  observed  in  the  rela- 
tives of  patients  with  early  coronary  disease, 
yielding  various  patterns  of  hypercholesterol- 
emia and  hypertriglyceridemia  within  families. 
Such  complex  phenotypes  are  thought  to  result 
either  from  the  variable  expression  of  a  single- 
gene  defect  or  the  independent  contribution  of 
two  or  more  genes.  The  genetic  contribution  to 
hyperlipidemia  is  further  blurred  by  the  fact  that 
hormonal  influences,  diet,  and  habitus  exen 
major  influences  on  the  regulation  of  lipid 
metabolism. 

Our  investigation  of  a  large  kindred  yielded 
preliminary  results  in  support  of  the  multiple- 
gene  hypothesis.  In  this  pedigree  one  gene  ac- 
counts for  hypercholesterolemia,  but  hypertri- 
glyceridemia depends  on  other  factors.  Among 
these,  the  gene  encoding  lipoprotein  lipase 
(LPL)  stood  as  a  prime  candidate. 

LPL  plays  a  key  role  in  the  metabolism  of  di- 
etary and  endogenous  fat.  Over  90  percent  of  di- 
etary fat  is  hydrolyzed  by  this  enzyme  in  an  initial 
step  controlling  its  delivery  to  peripheral  tissues. 
The  enzyme  is  secreted  by  mesodermal  cells  such 
as  adipocytes  and  muscle  cells,  but  acts  at  a  dis- 
tance from  its  site  of  synthesis.  After  secretion,  it 
becomes  anchored  to  the  luminal  surface  of  capil- 
laries in  extrahepatic  tissues  by  an  ionic  interac- 
tion with  heparan  sulfate. 

The  enzyme  needs  apolipoprotein  C-II  as  a  co- 
factor  to  stimulate  its  catalytic  activity.  Also,  di- 
merization  is  required.  By  binding  to  the  surface 
of  chylomicrons  and  very  low  density  lipopro- 
teins, LPL  hydrolyzes  triglycerides  of  intestinal  or 
hepatic  origin,  releasing  free  fatty  acids  for  cellu- 
lar uptake  where  they  can  be  used  as  fuel  or  re- 
esterified  for  storage.  Thus  the  enzyme  plays  a 
key  role  in  the  distribution  of  fatty  acids  among 
various  tissues. 

A  host  of  factors  regulate  production  of  LPL, 
with  stimulation  by  insulin,  glucocorticoids, 
adenosine  analogues,  gastrin,  and  pancreozymin, 
and  with  inhibition  by  catecholamines,  dibutyryl 
cAMP,  and  estrogens.  While  both  the  messenger 
RNA  and  the  LPL  gene  have  been  characterized, 


little  is  known  of  the  relationship  between  the 
enzyme's  structure  and  its  functional  domains. 
These  include  a  catalytic  site  and  sites  for  lipid 
binding,  heparin  binding,  and  cofactor  in- 
teraction. Other  functional  domains  remain 
hypothetical. 

Defective  functional  enzyme  is  the  diagnostic 
feature  of  a  rare  recessive  chylomicronemia  syn- 
drome, familial  LPL  deficiency.  This  condition  is 
characterized  by  massive  chylomicronemia  in  the 
fasting  state,  episodes  of  abdominal  pain,  recur- 
rent acute  pancreatitis,  and  eruptive  xanthomas. 
Deficiency  of  the  enzyme  can  be  demonstrated  in 
adipose  tissue  or  in  plasma  after  injected  heparin 
induces  its  release. 

The  heterozygous  state  for  LPL  deficiency,  by 
contrast,  remains  poorly  characterized.  Various 
reports  document  either  normal  lipids  or  moder- 
ate hypertriglyceridemia,  and/or  hypercholester- 
olemia, in  relatives  of  deficient  subjects.  Could 
the  heterozygous  state,  much  more  frequent  than 
the  homozygous,  account  for  some  form  of  the 
moderate  to  severe  hypertriglyceridemia  often 
noted  in  various  clinical  contexts? 

We  were  able  to  address  this  issue  by  investi- 
gating the  relatives  of  a  subject  with  classical  LPL 
deficiency.  After  identification  of  the  molecular 
defect  present  in  the  homozygous  state  in  the 
proband,  and  after  demonstration  of  its  func- 
tional significance  through  in  vitro  mutagenesis 
and  expression,  we  determined  carrier  status 
with  respect  to  this  mutation  among  126  relatives 
of  the  patient.  We  contrasted  clinical  and  bio- 
chemical parameters  collected  on  these  subjects. 

Our  analysis  revealed  that  the  heterozygous 
state  for  this  mutation  indeed  accounted  for  a 
common  form  of  hypertriglyceridemia,  accompa- 
nied by  a  sharp  reduction  of  high-density  lipo- 
protein cholesterol  and  subnormal  levels  of  low- 
density  lipoprotein  cholesterol.  These  effects, 
however,  were  manifest  only  in  subjects  over  40. 
Our  current  hypothesis  is  that  the  heterozygous 
state  imparts  a  latent,  partial  deficiency  in  the 
clearance  of  triglyceride-rich  lipoproteins, 
which  becomes  manifest  when  triggered  by  other 
factors  genetic  or  environmental. 
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As  a  consequence  of  these  findings,  LPL  has  be- 
come a  major  focus  of  research  in  our  laboratory. 
We  have  initiated  a  systematic  genetic  and  bio- 
chemical investigation  of  LPL  in  classical  defi- 
ciency states  and  in  a  variety  of  clinical  contexts 
that  feature  hypertriglyceridemia,  including  coro- 
nary heart  disease,  essential  hypertension,  dia- 
betes, acute  pancreatitis,  and  pregnancy-induced 
hypertension  of  diabetes. 

The  identification  of  LPL  mutations  could  clar- 
ify the  pathophysiologic  significance  of  hyper- 
triglyceridemia in  such  conditions.  Of  yet  greater 
interest  to  us,  identification  of  naturally  occur- 
ring mutations  that  impart  a  deficiency  of  func- 
tional enzyme  will  provide  the  basis  for  in-depth 
investigations  of  the  functional  domains  involved 
and  the  structure-function  relationships  of  LPL. 

Our  search  has  already  led  to  the  identification 
of  multiple  new  mutations  of  the  LPL  gene.  For 
each  of  those  leading  to  amino  acid  substitutions, 
in  vitro  expression  experiments  have  confirmed 
that  the  mutation  produces  immunoreactive  but 
inactive  enzyme.  Most  amino  acid  substitutions 
that  we  or  others  have  so  far  identified  cluster  in  a 
region  that  appears  to  harbor  the  catalytic  domain 
of  the  enzyme. 

This  conclusion  can  be  inferred  from  our  re- 
cent report  of  the  three-dimensional  structure  of 
a  related  lipolytic  enzyme,  pancreatic  lipase.  We 
predicted  the  identity  of  the  three  key  amino 


acids  defining  the  catalytic  triad  of  LPL.  System- 
atic amino  acid  substitutions  were  generated  by 
in  vitro  mutagenesis  at  one  site  and  expressed  in 
cultured  cells,  in  order  to  probe  what  aspect  of 
the  structure  was  altered  by  a  naturally  occurring 
variant.  All  substitutions  led  to  the  recovery  of  an 
inactive  enzyme,  suggesting  that  none  could  sub- 
stitute for  a  glycine  strategically  located  with  re- 
spect to  the  catalytic  site. 

In  further  experiments  in  progress,  we  are 
probing  other  putative  functional  domains  of  the 
enzyme,  particularly  those  regions  that  may  be 
involved  either  with  cofactor  interaction  or  with 
heparan  sulfate. 

We  have  extended  our  investigation  of  molecu- 
lar variation  at  the  LPL  locus  by  searching  system- 
atically for  mutations  of  the  gene  among  more 
than  1 30  unrelated  individuals  presenting  a  com- 
mon form  of  hypertriglyceridemia  in  association 
with  premature  coronary  artery  disease,  essential 
hypertension,  hyperlipidemia,  or  pancreatitis  in 
pregnancy.  A  variety  of  molecular  variants  have 
been  identified  on  the  basis  of  altered  electropho- 
retic  mobilities  and  are  currently  being  charac- 
terized by  DNA  sequencing.  We  already  know 
that  a  number  of  common  silent  mutations  ac- 
count for  a  significant  proportion  of  the  variants 
observed.  However,  conclusions  with  respect  to 
the  relative  contribution  of  LPL  defects  to  the 
pathogenesis  of  hypertriglyceridemia  cannot  be 
made  at  this  stage  of  our  ongoing  study. 
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Two  roles  of  the  cellular  chaperone  protein  GRP78-BiP  in  the  endoplasmic 
reticulum  (ER).  This  protein,  a  member  of  the  stress  (heat-shock)  family,  is 
a  resident  component  of  the  ER  and  has  peptide- binding  activity.  The  top 
section  illustrates  the  specific  and  transient  association  of  GRP78-BiP  (dot- 
ted oval)  with  a  viral  glycoprotein,  hemagglutinin- neuraminidase,  during 
its  synthesis  and  folding  in  the  ER.  These  entities  provide  an  interesting 
model  system,  as  GRP78-BiP  is  known  to  be  released  from  the  glycoprotein 
at  a  point  in  time  prior  to  oligomerization.  The  bottom  section  illustrates  the 
more  stable  association  of  GRP78-BiP  with  malfolded  integral  membrane 
proteins  in  the  ER.  If  the  protein  contains  a  lesion  that  prevents  native  fold- 
ing, then  GRP78-BiP  remains  associated  with  it  and  normal  oligomeriza- 
tion does  not  occur. 
Research  of  Robert  A.  Lamb. 
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ANIMAL  viruses  provide  a  unique  tool  w^ith 
which  to  increase  understanding  of  the  com- 
plex biochemical  processes  involved  in  the  bio- 
synthesis and  maintenance  of  eukaryotic  cells. 
Our  laboratory  is  investigating  the  molecular 
structure  and  the  mechanism  of  replication  of 
two  enveloped  viruses,  influenza  virus  and  the 
paramyxovirus  SV5 . 

Influenza  virus  causes  important  diseases  in 
humans  and  animals.  It  has  tremendous  socioeco- 
nomic consequences,  because  influenza  contin- 
ues to  occur  in  regular  epidemics  and  occasional 
pandemics  and  is  a  leading  cause  of  morbidity 
and  mortality.  Paramyxoviruses  cause  many  bio- 
logically and  economically  important  diseases. 
Among  these  viruses  are  parainfluenza  types  1-5, 
mumps,  measles,  canine  distemper,  Newcastle 
disease  of  chickens,  and  rinderpest  of  cattle,  as 
well  as  SV5. 

We  are  elucidating  the  wide  range  of  mecha- 
nisms that  these  RNA  viruses  use  to  maximize  the 
encoded  proteins  in  their  compact  genomes.  We 
have  identified  overlapping  reading  frames, 
splicing  of  mRNAs,  the  use  of  bicistronic  mRNAs, 
transcriptional  stuttering  to  add  nontemplated 
nucleotides  to  an  RNA  transcript  (yielding  a  sepa- 
rate mRNA) ,  and  a  coupled  stop-start  translation 
of  tandem  cistrons. 

Influenza  virus  and  paramyxoviruses  (SV5) 
were  selected  for  study  not  only  because  of  their 
importance  as  the  causative  agents  of  major  dis- 
eases but  also  because  they  provide  excellent 
models  for  examining  various  properties  of  inte- 
gral membrane  proteins.  Since  these  membrane 
proteins  constitute  the  viruses'  major  antigenic 
determinants,  knowledge  about  the  structure  of 
these  proteins  should  enhance  our  understanding 
of  their  ability  to  act  as  immunological  targets, 
thus  aiding  in  developing  rationally  designed 
therapeutic  agents  and  new  means  of  vaccination. 

We  are  investigating  the  mechanism  by  which 
integral  membrane  proteins  are  transported  to 
the  cell  surface  in  the  exocytotic  pathway  and  are 
internalized  from  the  surface  by  the  endocytotic 
pathways.  Our  model  systems  include  the  seven 
integral  membrane  proteins  encoded  by  influ- 


enza virus  and  SV5,  three  of  which  were  discov- 
ered in  our  laboratory.  These  prototype  mem- 
brane proteins,  grouped  as  types  I,  II,  and  III, 
provide  a  diverse  array  of  viral  proteins  that  span 
the  cell  membrane  once. 

Intracellular  Transport  of  Glycoproteins 

We  are  determining  how  polypeptides  are  ini- 
tially inserted  into  the  endoplasmic  reticulum 
and  what  signals  are  necessary  for  the  proteins  to 
interact  with  the  lipid  bilayer.  To  elucidate  the 
rules  for  protein  orientation  in  the  bilayer  is  a 
prime  objective.  One  of  the  major  factors  is  the 
presence  of  positively  charged  residues  flanking 
the  hydrophobic  membrane-spanning  domain  to 
retain  a  region  of  the  protein  in  the  cell  cyto- 
plasm. We  have  also  been  focusing  on  the  factors 
and  signals  needed  to  fold  the  primary  polypep- 
tide chain  once  it  has  been  translocated  across  the 
membrane  of  the  endoplasmic  reticulum. 

The  cellular  glucose-regulated  protein  GRP78- 
BiP  is  a  member  of  the  HSP70  stress  family  of 
gene  products  and  a  resident  component  of  the 
endoplasmic  reticulum,  where  it  is  thought  to 
play  a  role  in  the  folding  and  oligomerization 
of  secretory  and  membrane-bound  proteins. 
GRP78-BiP  also  binds  to  malfolded  proteins,  and 
this  may  be  one  mechanism  for  preventing  their 
intracellular  transport.  The  SV5  hemagglutinin- 
neuraminidase  (HN)  glycoprotein  during  its  fold- 
ing specifically  and  transiently  associates  with 
GRP78-BiP.  The  fact  that  this  complex  formation 
can  only  be  detected  before  oligomerization  of 
the  immature  HN  molecules  forms  the  native  tet- 
ramer  suggests  that  GRP78-BiP  acts  as  a  chaper- 
one  to  promote  correct  folding  of  the  molecule. 
Paramyxovirus  infection  of  cells  causes  a  tran- 
scriptional induction  of  GRP78-BiP  mRNA,  and 
our  studies  indicate  that  the  flux  of  HN  through 
the  endoplasmic  reticulum,  which  requires 
GRP78-BiP  molecules  for  its  maturation,  causes  a 
feedback  that  increases  GRP78-BiP  transcription. 

To  attempt  to  understand  the  nature  of  the  spec- 
ificity of  GRP78-BiP  with  a  protein,  we  con- 
structed various  altered  HN  molecules.  The  data 
indicate  that  HN  contains  more  than  one  domain 
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capable  of  interacting  with  GRP78-BiP  and  that, 
without  the  binding  of  GRP78-BiP  to  the  altered 
HN  molecules,  an  increase  in  the  accumulation 
of  GRP78-BiP  mRNA  does  not  occur.  Thus  the 
transduction  of  the  signal  from  the  endoplasmic 
reticulum  to  the  nucleus  minimally  requires  the 
binding  of  GRPTS-BiP  to  protein. 

Internalization  and  Degradation 
of  Glycoproteins 

The  SV5  HN  glycoprotein  is  internalized  exten- 
sively from  the  virus-infected  cell  surface  and  de- 
graded in  lysosomes.  We  are  intensively  examin- 
ing the  mechanism  of  internalization.  This  is  of 
considerable  interest  because  HN  lacks  an  aro- 
matic amino  acid  in  its  cytoplasmic  tail  that  has 
been  found  necessary'  for  the  internalization  of 
several  well-characterized  receptor  molecules 
via  the  clathrin-coated  vesicle  pathway.  Examina- 
tion of  chimeric  molecules  constructed  between 
HN  and  another  t>'pe  II  integral  membrane  pro- 
tein that  is  not  internalized,  influenza  virus  neur- 
aminidase, suggests  that  the  HN  transmembrane 
domain  signals  internalization  from  the  cell  sur- 
face and  species  targeting  to  lysosomes. 

Virus  Cation  Channels 

The  influenza  virus  M2  protein  is  a  small 
(97-residue)  type  III  integral  membrane  protein 
that  forms  a  disulfide-linked  tetramer.  Circum- 
stantial evidence  based  on  the  virus's  sensitivity 
to  the  drug  amantadine  hydrochloride,  the  cou- 
pling of  antiviral  action  to  the  M2  transmembrane 
domain,  and  the  premature  acid-induced  confor- 
mational change  in  the  viral  hemagglutinin  in  the 
presence  of  the  drug  has  led  to  the  suggestion 
that  M2  is  an  ion  channel  that  alters  the  intracellu- 
lar pH  of  the  trans-Golgi  network.  Our  direct  ex- 
periments (in  collaboration  with  Lawrence 
Pinto)  of  injecting  M2  mRNA  into  Xenopus  oo- 
cytes and  using  a  two-electrode  voltage-clamp  ap- 
paratus to  measure  surface  currents  indicate  that, 
in  the  presence  of  M2,  an  amantadine  hydrochlo- 


ride-sensitive  cation  channel  conductance  can 
be  measured  that  has  the  characteristics  of  an  H 
current,  an  electrical  response  previously  ob- 
served in  heart  and  neuronal  cells. 

We  are  beginning  a  structure-function  analysis 
of  this  cation-specific  channel  to  identify  the  resi- 
dues involved  in  drug  sensitivity  and  cation  speci- 
ficity. We  are  also  investigating  the  possibility 
that  the  influenza  virus  NB  glycoprotein,  the 
paramyxovirus  SH  protein,  and  the  human  immu- 
nodeficiency virus  type  I  vpu  integral  membrane 
proteins,  all  of  which  have  M2-like  structures, 
have  cation  channel  activity. 

Virus  Fusion  Activity 

We  are  also  interested  in  the  manner  in  which 
the  F  protein  of  paramyxoviruses  causes  cell  fu- 
sion. Considerable  evidence  has  implicated  the 
hydrophobic  amino  terminus  of  the  Fj  subunit 
(FRED)  as  being  directly  involved  in  membrane 
fusion,  and  we  have  shown  that  the  FRED  can  in- 
teract stably  with  a  lipid  bilayer.  Comparison  of 
the  amino  acid  sequences  of  many  paramyxovi- 
ruses has  revealed  a  high  degree  of  amino  acid 
identity  in  the  FRED.  To  investigate  the  role  of  the 
conserved  residues  in  mediating  cell  fusion,  we 
have  constructed  many  mutant  F  genes  in  which 
the  invariant  residue  of  FRED  has  been  changed, 
in  most  cases  for  a  similar  residue. 

Unexpectedly,  in  view  of  the  remarkable 
amino  acid  conservation  between  paramyxovi- 
ruses, all  of  the  mutant  fusion  proteins  that  are 
expressed  at  the  cell  surface  are  capable  of  caus- 
ing cell  fusion,  which  suggests  that  the  residues 
serve  some  other  function  of  F  protein  activity. 
Interestingly,  when  the  highly  conserved  glycine 
residues  in  the  FRED  are  changed  to  alanine  resi- 
dues— a  change  that  can  be  predicted  to  make 
the  FRED  more  a-helical  in  nature — the  specific 
activity  of  the  F  protein  in  causing  cell-cell  fusion 
increases  by  several  orders  of  magnitude.  Thus 
we  have  engineered  an  even  more  potent  fuso- 
genic  protein  than  the  natural  F. 
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THYROID  hormone  is  critically  important  in 
the  development  and  regulation  of  metabolic 
processes.  In  humans  the  hormone  is  especially 
necessary  for  normal  development  of  the  central 
nervous  system.  Its  importance  is  exemplified  by 
the  irreversible  mental  retardation  that  ensues  if 
congenital  hypothyroidism  is  not  recognized  and 
treated  within  the  first  few  months  of  life.  In  in- 
fancy and  childhood,  thyroid  hormone  is  abso- 
lutely required  for  normal  growth. 

Our  laboratory  is  interested  in  learning  how 
this  pluripotent  hormone  produces  its  effects. 
The  synthesis  of  thyroid  hormone,  thyroxine,  is  a 
tightly  regulated  process  requiring  oxidation  of 
iodine  and  its  incorporation  into  thyroglobulin,  a 
thyroxine  precursor.  This  protein  is  present 
in  large  quantities  in  the  thyroid  gland,  and 
under  regulation  by  pituitary  thyroid-stimulat- 
ing hormone,  thyroxine  is  released  into  the 
bloodstream. 

The  thyroxine  molecule  has  four  iodine  atoms 
distributed  over  its  two-ring  structure.  In  this 
form,  the  hormone  is  relatively  inactive.  To  be- 
come fully  potent,  thyroid  hormone  must  be 
deiodinated  to  form  3,5,3 -triiodothyronine.  Dur- 
ing this  process,  a  specific  atom  of  iodine  is  lost 
from  the  thyroxine  molecule,  and  it  is  therefore 
termed  T3. 

The  process  of  thyroid  hormone  activation  is 
thus  one  of  great  general  interest.  The  enzymes 
that  carry  out  this  reaction  are  termed  deiodin- 
ases,  since  they  cause  the  removal  of  an  iodine 
atom.  There  are  two  major  deiodinases  in  the 
body.  One  is  present  in  the  liver  and  kidney  and 
acts  on  thyroxine  to  form  T3,  which  then  reenters 
the  bloodstream.  By  this  route  T,  is  carried  to 
other  organs  in  the  body,  such  as  the  skeletal 
muscle  and  heart,  which  do  not  have  the  capacity 
to  produce  it.  The  enzyme  requires  an  intracellu- 
lar cofactor  for  its  function,  which  in  turn  re- 
quires adequate  nutrition  and  the  absence  of 
stress  for  its  production.  In  persons  who  have 
fasted  or  become  ill,  conversion  of  thyroxine  to 
T3  by  this  enzyme  is  markedly  slowed,  allowing 
thyroid  hormone  to  fall  to  conserve  energy. 


A  major  focus  of  our  laboratory  has  been  to  iso- 
late and  clone  this  deiodinase.  The  purification 
of  this  membrane-bound  protein  is  made  difficult 
by  its  insolubility  in  water  and  its  inactivation  by 
oxygen.  To  circumvent  these  difficulties,  we 
have  used  a  sensitive  measurement  for  the  en- 
zyme's activity.  We  quantitate  the  release  of  radio- 
active iodine  from  an  artificially  synthesized  ana- 
logue of  thyroxine.  This  analogue  is  an  even 
better  substrate  for  the  enzyme's  deiodination 
process  than  is  thyroxine  itself. 

Having  a  sensitive  assay  has  allowed  us  to  de- 
vise a  strategy  for  cloning  the  enzyme  without 
purifying  it.  For  this  purpose  we  have  adopted 
the  technique  called  expression  cloning.  In  this 
process,  one  tries  to  identify  sequences  of  DNA 
coding  for  a  protein  by  measuring  the  capacity  of 
the  DNA  to  program  a  cell  to  make  the  protein. 
The  strategy  is  labor  intensive,  but  it  assures  an 
accurate  sequence  if  successful.  One  common 
approach  to  expression  cloning  is  to  employ  the 
egg  (oocyte)  of  the  African  clawed  toad  {Xeno- 
pus)  as  a  highly  efficient  factory  for  making  pro- 
teins from  the  messenger  RNA  (mRNA)  encoded 
by  the  DNA  of  interest.  If  this  system  is  to  be  suc- 
cessful, an  active  form  of  the  protein  must  be  pro- 
duced in  the  egg  after  it  is  injected  with  the 
mRNA  from  the  tissue  that  expresses  the  enzyme. 
In  the  case  of  the  deiodinase,  injection  of  mRNA 
into  Xenopus  oocytes  led  to  the  appearance  of 
the  deiodinase  in  the  egg  proteins,  indicating  the 
feasibility  of  the  approach. 

To  obtain  the  optimal  starting  material  for  pre- 
paring artificial  DNA  sequences  to  code  for  the 
deiodinase,  we  analyzed  the  effect  of  altering  an 
animal's  thyroid  hormone  status  on  the  activity  of 
this  enzyme.  If  an  animal  is  made  hyperthyroid — 
i.e.,  given  large  quantities  of  thyroid  hormone — 
the  activity  of  the  enzyme  is  increased  and  the 
amount  of  mRNA  coding  for  the  enzyme  in  liver 
and  kidney  is  increased.  Conversely,  when  an  ani- 
mal's capacity  to  synthesize  thyroid  hormone  is 
blocked,  the  mRNA  levels  for  the  enzyme  fall. 
Thus  we  have  chosen  hyperthyroid  animals  to  use 
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as  the  starting  source  for  preparing  a  DNA  library 
for  this  purpose. 

The  preparation  of  cDNA  from  the  liver  of  the 
hyperthyroid  rat  was  size  selected  because  stud- 
ies showed  that  it  was  approximately  2,000  nu- 
cleotides in  length.  Many  cDNA  clones  of  this  size 
were  separated  into  pools  of  approximately 
2^000  each.  Then  the  normal  process  that  is 
carried  out  in  the  cell  was  reproduced  in  the  test 
tube.  Messenger  RNA  was  synthesized  from  each 
of  the  pools  of  these  cDNA  clones,  injected  into 
Xenopus  oocytes,  and  the  oocytes  were  analyzed 
three  days  later  for  the  deiodinase  activity. 

One  pool  produced  a  significant  quantity  of 
deiodinase  and  was  therefore  subdivided  further. 
Pools  of  approximately  100  cDNAs  were  again 
studied  by  transcribing  mRNA,  and  this  was  in- 
jected in  turn  into  the  Xenopus.  By  this  tech- 
nique a  positive  pool  of  about  100  of  these 
clones  was  isolated.  These  were  diluted  and 
grown  up  as  single  colonies  in  a  matrix  arrange- 
ment. The  RNA  was  prepared  from  pools  of  these 
clones  from  rows  and  columns,  and  an  intersec- 
tion was  identified  that  corresponded  to  the 
clone  that  coded  for  the  deiodinase.  As  antici- 
pated, the  clone  was  approximately  2,100  nu- 
cleotides in  base  pairs  in  length.  Surprisingly, 
there  was  a  group  of  three  nucleotides,  TGA  (re- 
ferred to  as  a  codon),  in  the  clone  that  corre- 
sponded to  the  signal  for  the  protein  synthesis 
machinery  to  stop.  This  codon,  however,  did  not 
stop  the  synthesis  of  protein  for  this  particular 
mRNA.  We  were  able  to  show  that  in  the  case  of 
this  protein  the  stop  codon  TGA  (UGA  in  the 
mRNA)  coded  for  an  extremely  rare  amino  acid, 
selenocysteine.  This  amino  acid  is  similar  to  the 
more  common  amino  acid  cysteine,  except  that 
selenium  replaces  the  sulfur  atom.  Selenium  is  in 
the  same  category  in  the  periodic  table  of  chemi- 


cal elements  as  is  sulfur,  but  its  chemistry  differs 
from  that  of  sulfur  in  a  number  of  important  re- 
spects. Only  one  other  enzyme  previously  identi- 
fied in  animals  and  plants  contains  a  selenocys- 
teine. The  role  of  this  enzyme — glutathione 
peroxidase — is  to  reduce  the  peroxide  that  accu- 
mulates in  various  cells  and  damages  cell 
membranes. 

In  the  deiodinase,  the  selenium  atom  donates 
the  electron  that  permits  the  removal  of  iodine 
from  the  thyroxine  molecule  to  activate  it.  The 
chemical  nature  of  the  reaction — reduction  of 
substrate  and  oxidation  of  the  enzyme — is  well 
known  in  cellular  biochemistry.  We  were  able  to 
prove  that  this  amino  acid  containing  selenium  is 
essential  for  the  normal  deiodination  process, 
which  explains  the  recent  observation  that  a  defi- 
ciency of  selenium  (which  is  present  as  a  trace 
element  in  the  eanh's  crust)  causes  hypothyroid- 
ism in  experimental  animals.  We  conclude  that 
selenium  is  required  for  normal  thyroid  hormone 
action. 

Our  current  research  is  directed  at  understand- 
ing why  this  particular  mRNA  is  able  to  permit 
the  suppression  of  the  normal  stop  codon  and 
therefore  produce  this  unique  enzyme.  We  are 
also  investigating  the  biochemistry  of  the  normal 
and  mutated  deiodinase  molecule,  so  that  we  may 
understand  more  about  its  physiological  function 
and  deiodination  mechanism. 

Our  laboratory  continues  to  investigate  the 
function  of  the  nuclear  protein  to  which  T,  binds 
and  the  DNA  sequences  that  recognize  it.  With 
these  two  eflforts,  we  hope  to  have  a  complete 
picture  of  how  thyroid  hormone  produces  its  ef- 
fects, beginning  with  the  prohormone  thyroxine 
and  ending  with  the  actual  molecular  mechanism 
by  which  thyroid  hormone-responsive  genes  are 
regulated. 
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THE  growth  of  cells  within  an  organism  is  far 
too  delicate  and  important  a  process  to  be  left 
to  chance.  Rather,  as  with  all  biologic  processes, 
it  is  subject  to  a  stringent  set  of  rules  that  are 
programmed  into  the  genetic  makeup  of  the  or- 
ganism. Genes  form  the  basis  for  controlling 
growth  within  an  organism,  setting  the  parame- 
ters that  allow  the  liver  to  take  the  shape  it  does 
or  the  kidney  to  assume  its  particular  size  and 
function.  Genes  establish  the  rules  that  ensure 
that  an  organ  grows  in  an  orderly  fashion  and 
reaches  a  prescribed  and  limited  size.  Growth 
can  thus  proceed  so  far  but  no  farther,  attaining  a 
programmed  equilibrium  compatible  with  life. 

Cancer  as  a  Disease  of  Genes 

Cancer  is  a  profound  disorder  of  cell  growth. 
The  delicate  balance  established  by  a  genetically 
encoded  program  is  overturned.  Instead  of  reach- 
ing an  equilibrium,  a  cancer  cell  no  longer  re- 
sponds to  signals  that  would  limit  its  ability  to 
divide.  It  is  out  of  control,  and  its  unlimited 
growth  has  profoundly  dangerous  consequences 
for  the  organism. 

Over  the  past  decade  or  so,  it  has  become  in- 
creasingly clear  that  many  cancers  can  be  ac- 
counted for,  at  least  in  part,  by  damage  occurring 
to  genes  that  encode  the  rules  for  control  of  cell 
growth.  Genetic  damage,  or  mutation,  can  inacti- 
vate a  gene  or  cause  it  to  function  at  the  wrong 
time  or  at  the  wrong  place  or,  indeed,  even  cause 
it  to  make  the  wrong  product.  The  set  of  genes 
whose  damage  (or  mutation)  can  give  rise  to 
cancer  are  often  just  those  genes  that  normally 
regulate  cell  growth.  Geneticists  refer  to  the  dam- 
aged genes  that  contribute  to  the  development  of 
malignancy  as  oncogenes  (from  the  Greek  ovKoa, 
or  tumor) . 

Transgenic  Mice  and  the  Genetic  Basis 
of  Cancer 

For  some  time  my  colleagues  and  I  have  been 
interested  in  genes  that  control  cell  growth  and, 
in  particular,  the  control  processes  that  operate 
in  the  living  organism.  Our  work  has  been  consid- 
erably advanced  by  the  technique  of  introducing 


active  oncogenes  into  the  hereditary  makeup  of 
special  strains  of  laboratory  mice.  These  "trans- 
genic" mice  carry  oncogenes  created  in  the  labo- 
ratory, pass  these  cancer-causing  genes  on  to  off- 
spring, and  therewith  transmit  a  strong  tendency 
to  develop  cancer.  Thus,  in  many  ways,  trans- 
genic mice  become  useful  models  of  human  ma- 
lignancy. For  example,  we  have  designed  specific 
mice  that  develop  cancer  of  the  breast  and  others 
that  develop  cancer  of  the  blood  cells — specific 
leukemias  and  lymphomas.  These  experiments 
have  taught  us  that  some  cancers  can  be  caused  by 
certain  specific  oncogenes  and  that  many,  but  not 
necessarily  all,  cancers  are  the  result  of  a  collabo- 
ration between  two  or  more  oncogenes.  This  sug- 
gests that  cancer  is  often  a  "multihit"  process,  a 
process  that  requires  several  activating  events. 

Host  Defenses  Against  Cancer 

Although  transgenic  mice  are  very  useful  in  ana- 
lyzing the  action  of  oncogenes,  they  are  also  use- 
ful in  exploring  the  host  defense  mechanism  that 
can  be  mobilized  to  prevent  the  development  and 
spread  of  cancer.  The  immune  system  is  one  of 
the  organism's  chief  instruments  against  the 
spread  of  infectious  disease  and  for  the  rejection 
of  foreign  tissues.  For  example,  the  body's  im- 
mune system  must  be  neutralized  to  accomplish 
effective  heart  or  kidney  transplants.  The  role 
that  the  immune  system  plays  in  tumor  rejection 
is  at  best  poorly  understood,  but  important  recent 
discoveries  in  the  field  of  immunology  indicate 
that  the  immune  response  is  regulated  by  an  array 
of  hormone-like  agents  called  lymphokines. 
These  are  released  by  cells  of  the  immune  system 
to  influence  the  growth  and  development  of 
other  cells.  They  can  act  as  growth  factors  signal- 
ing target  cells  to  divide,  or  they  can  trigger  ge- 
netic programs  that  cause  a  target  cell  to  change, 
mature,  or  even  assume  a  new  function. 

We  are  particularly  interested  in  how  lympho- 
kines influence  the  host's  response  to  cancer.  In 
the  course  of  this  work,  we  have  focused  on  the 
action  of  two  particularly  interesting  lympho- 
kines, IL-4  and  IL-7.  (These  biologic  response- 
modifying  agents  are  frequently  given  the  name 
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IL,  stemming  from  a  time  when  they  were  called 
"interleukins,"  or  agents  that  mediated  signals 
between  white  blood  cells,  or  leukocytes.)  IL-4 
has  several  cell-signaling  functions  that  have 
been  recognized  by  studies  in  tissue  culture.  Al- 
though its  precise  role  in  the  body  has  not  been 
proved,  IL-4  is  suspected  of  playing  a  role  in  mod- 
ulating the  immune  response.  IL-7  is  thought  to 
have  a  major  role  in  the  orderly  development  of 
antibody-producing  cells,  the  so-called  B  cells. 

To  learn  what  role,  if  any,  these  interleukins 
play  in  modulating  the  body's  response  to  cancer, 
we  devised  a  powerful  test  for  their  function  as 
antitumor  agents.  We  planted  the  gene  for  IL-4  in 
a  tumor  cell  that  allowed  its  high-level  produc- 
tion. The  IL-4-producing  tumor  grew  well  in  a 
tissue  culture  dish,  suggesting  that  IL-4  did  not 
directly  block  growth  of  the  tumor  cell.  How- 
ever, when  the  IL-4-producing  tumor  was  trans- 
planted to  a  host  mouse,  the  tumor  growth  was 
completely  blocked.  Evidently  IL-4  was  eliciting 
a  host  defense  mechanism  that  prevented  tumor 
growth.  We  have  extended  this  observation  to 
many  different  types  of  tumor  cells  under  many 
conditions  and  have  obtained  similarly  dramatic 
results. 

Further  studies  of  the  antitumor  action  of  IL-4 
have  allowed  us  to  identify  two  particular  cell 
types  that  may  be  mediating  this  antitumor  effect. 
One  of  these  is  the  eosinophil,  a  cell  that  is  in- 
volved in  many  allergic  responses.  The  second 
candidate  mediator  cell  is  the  macrophage,  a  scav- 
enger cell  concerned  with  many  cell-killing 
functions.  Our  most  recent  work  has  focused  on 
the  particularly  esoteric  eosinophil,  a  cell 
thought  to  be  involved  in  defense  against  para- 
sitic diseases  and  in  certain  allergic  disorders. 
Studies  using  specific  reagents  now  convince  us 
that  this  peculiar  cell  is  the  major  mediator  of  the 
antitumor  effect. 

We  would  also  like  to  know  whether  a  small 
increase  in  the  production  of  IL-4  can  influence 


an  organism's  ability  to  develop  or  defend  itself 
against  malignancy.  Accordingly,  we  made  trans- 
genic mice  that  carry  extra  copies  of  the  IL-4  gene 
to  ensure  that  their  lymphocytes  produce  large 
amounts  of  this  agent.  Creating  these  mice 
proved  quite  difficult,  since  high  concentrations 
of  IL-4  have  an  extremely  disruptive  influence  on 
the  murine  immune  system.  We  solved  this  prob- 
lem by  creating  a  graded  set  of  transgenic  mice 
that  yielded  reduced  levels  of  IL-4.  By  means  of 
special  regulatory  gene  signals,  we  were  able  to 
create  mice  that  produced  IL-4  in  minimally 
disruptive  amounts.  From  these  animals  we  have 
learned  that  IL-4  has  a  profound  influence  on  the 
immune  cells  that  develop  in  the  thymus  gland,  a 
key  organ  in  the  immune  system.  But  we  have  also 
seen  that  this  lymphokine  can  induce  a  serious 
allergic  inflammatory  response,  suggesting  that 
we  will  have  to  deal  with  this  side  effect  before 
considering  the  therapeutic  uses  of  this  agent. 

We  have  expanded  these  studies  to  view  the 
actions  of  other  lymphokines  that,  theoretically, 
might  influence  tumor  occurrence.  Our  prelimi- 
nary studies  indicate  that  IL-7  may  provoke  a  tu- 
mor-inducing response,  clearly  indicating  that 
different  growth  signals  can  have  dramatically 
different  effects  in  the  organism. 

Finally,  although  this  brief  account  cannot 
cover  the  entire  range  of  our  studies,  we  have 
turned  to  a  third  element  (the  so-called  kit  li- 
gand),  which  has  a  profound  influence  on  the 
development  of  cells  during  the  formation  of  the 
embryo.  This  factor  influences  the  development 
of  cells  that  contribute  to  the  formation  of  blood, 
of  pigment-producing  cells,  and  of  eggs  and 
sperm.  In  other  words,  if  a  mouse  cannot  make 
this  element,  it  is  sterile,  albino,  and  has  a  pro- 
found anemia.  Because  we  feel  that  the  develop- 
mental pathway  a  cell  follows  strongly  influences 
its  propensity  to  undergo  a  malignant  change,  we 
are  studying  this  factor  to  learn  how  it  influences 
this  important  process  in  the  mouse. 
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OUR  laboratory  has  focused  on  molecular  ge- 
netic investigations  of  the  enzyme  methyl- 
malonyl  CoA  mutase  (MCM)  and  its  deficiency 
state  in  humans,  methylmalonic  aciduria.  This 
enzyme  is  required  for  the  metabolism  of  propio- 
nate, which  can  be  produced  by  catabolism  of 
certain  amino  acids  or  absorbed  from  fermenta- 
tion products  in  the  intestines. 

We  have  cloned  genes  for  human  and  mouse 
MCM,  identified  mutations  that  give  rise  to  nota- 
ble phenotypes  of  methylmalonic  aciduria,  and 
demonstrated  correction  of  the  metabolic  defect 
in  cultured  cells  from  methylmalonic  aciduria  pa- 
tients by  gene  transfer  of  a  normal  MCM  gene. 
This  last  experiment  is  prescient  of  somatic  gene 
therapy  in  which  patients  with  inborn  errors  of 
metabolism  might  be  treated  by  introducing  a 
normal  gene  into  somatic  cells  to  perform  the 
function  of  the  inherited  mutant  genes. 

Successful  somatic  gene  therapy  involves  more 
than  introducing  a  normal  gene  into  genetically 
defective  cells.  It  will  be  necessary  to  achieve 
proper  compartmentalization  and  regulation  of 
the  recombinant  enzyme,  substrates,  cofactors, 
and  reaction  products;  to  determine  which  or- 
gans and  somatic  cells  represent  the  best  possible 
targets  for  gene  transfer,  how  many  cells  need  to 
be  transformed,  and  how  the  metabolic  capacity 
and  phenotypic  effect  of  reconstituted  cells  may 
be  maximized;  and  to  address  a  variety  of  clinical 
issues  to  ensure  that  clinical  trials  embody  the 
greatest  potential  benefit  and  entail  the  least  pos- 
sible risk.  It  is  also  necessary  to  adhere  to  the 
regulatory  and  review  process  that  has  been  es- 
tablished to  monitor  human  gene  therapy. 

Our  studies  are  directed  at  attaining  an  under- 
standing of  the  structure  and  function  of  MCM;  of 
its  role  in  homeostasis,  nutrition,  and  pathology; 
and  of  the  multifaceted  issues  required  to  pro- 
pose gene  therapy  for  methylmalonic  aciduria.  In 
the  past  year  this  has  involved  basic  research  con- 
cerning MCM  and  gene  transfer  as  well  as  the  es- 
tablishment of  a  clinical  foundation  for  gene  ther- 
apy trials. 

Role  of  Methylmalonyl  CoA  Mutase  in 
Health  and  Disease 

The  clinical  expression  of  MCM  deficiency 


presents  several  enigmas.  Why  is  this  disorder 
lethal?  Why  are  the  symptoms  intermittent?  Why 
are  some  individuals  with  persistent  methylma- 
lonic aciduria  clinically  normal?  We  have  de- 
scribed a  variety  of  mutations  in  the  gene  for 
MCM.  Some  give  rise  to  completely  inactive  en- 
zymes, others  to  enzymes  that  show  limited  activ- 
ity in  the  presence  of  vastly  excessive  cofactor, 
and  an  interesting  enzyme  that  is  nonfunctional 
in  most  cells  but  active  in  heterologous  combina- 
tion with  some  other  mutant  enzymes.  We  will 
study  whether  the  residual  activity  expressed  by 
mutant  genes  correlates  with  the  different  pheno- 
typic forms  of  methylmalonic  aciduria  or 
whether  other  factors  are  involved  in  this 
pleomorphism. 

We  are  studying  propionate  metabolism  in 
various  normal  and  genetically  defective  cells  to 
investigate  the  role  of  MCM  in  homeostasis.  Our 
studies  suggest  that  under  normal  conditions  the 
rate  of  propionate  metabolism  is  not  limited  by 
the  amount  of  the  enzyme.  Barely  detectable  lev- 
els of  MCM  can  sustain  maximal  levels  of  propio- 
nate metabolism,  and  the  addition  of  more  en- 
zyme to  normal  cells  has  no  effect  on  this 
metabolic  activity. 

Interestingly,  the  level  of  propionate  metabo- 
lism in  cultures  containing  few  cells  with  recon- 
stituted enzyme  is  similar  to  that  of  a  population 
of  completely  normal  cells,  indicating  exchange 
of  intermediate  metabolites  between  cells.  The 
exchange  appears  to  require  cell  contact;  it  does 
not  occur  between  cells  separated  by  a  fluid 
phase.  This  suggests  that  gene  therapy  targeting  a 
small  population  of  cells  may  have  a  significant 
phenotypic  impact,  though  therapy  may  have  to 
be  targeted  to  organs  damaged  by  the  accumula- 
tion of  organic  acids,  particularly  the  liver,  rather 
than  more  accessible  sites  such  as  bone  marrow. 

We  have  also  observed  that  the  ability  of  a  colonic 
epithelial  cell  line  (CACO-2)  to  metabolize  propio- 
nate increases  dramatically  as  the  cell  differentiates 
in  culture.  This  suggests  that  propionate  metabolism 
may  be  a  differentiated  function  of  intestinal  epithe- 
lium. There  are  several  reasons  to  think  that  the  in- 
testine may  play  a  critical  role  in  propionate  homeo- 
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stasis  and  that  the  fermentation  and  absorption  of 
propionate  may  be  an  important  factor  regulating 
organic  acid  homeostasis. 

Basic  Methods  for  Gene  Transfer 

We  have  focused  on  the  liver  and  the  intestines 
as  targets  for  somatic  gene  therapy.  In  an  ongoing 
collaboration  with  Savio  Woo  (HHMI,  Baylor  Col- 
lege of  Medicine) ,  we  have  been  involved  in  de- 
veloping a  scheme  for  hepatic  gene  therapy  that 
involves  harvesting  and  cultivating  hepatocytes, 
transducing  these  cells  with  recombinant  retro- 
viral vectors,  and  returning  them  to  the  host  via 
hepatocellular  transplantation. 

In  collaboration  with  Susan  Henning  (Depart- 
ment of  Pediatrics,  Baylor  College  of  Medicine), 
we  have  developed  methods  for  transducing 
genes  into  intestinal  epithelium.  This  involves  in- 
stilling recombinant  retroviruses  into  isolated 
intestinal  segments  in  the  presence  of  agents  to 
increase  infectivity,  reduce  mucus,  and  increase 
the  number  of  dividing  crypt  cells.  These  experi- 
ments demonstrate  that  it  is  possible  to  transduce 
crypt  cells,  though  considerable  work  remains  to 
achieve  more  efficient  transduction. 

Clinical  Foundation  of  Somatic 
Gene  Therapy 

We  have  also  begun  to  establish  a  clinical  foun- 


dation for  somatic  gene  therapy  by  developing  a 
clinical  trial  of  hepatocellular  transplantation  for 
acute  hepatic  failure  in  children.  We  will  use  he- 
patic cells  transduced  with  recombinant  retrovi- 
rus expressing  a  marker  gene  (in  collaboration 
with  Savio  Woo  and  others  at  Baylor  College  of 
Medicine).  Hepatocellular  transplantation,  an  es- 
sential of  proposed  schemes  for  hepatic  gene 
therapy,  has  never  been  attempted  in  humans.  We 
are  establishing  surgical  and  clinical  methods  for 
such  transplantation.  Use  of  marker  genes  will 
facilitate  assessment  of  cellular  engraftment  and 
of  techniques  for  transducing  recombinant  genes 
into  hepatic  cells. 

The  development  of  this  protocol  has  focused 
our  attention  on  the  myriad  clinical  issues  in- 
volved in  human  trials,  including  proper  patient 
selection,  obtaining  meaningful  informed  con- 
sent, ensuring  adequate  follow-up,  adopting 
methods  that  are  approved  for  clinical  use  and 
meet  the  standards  of  quality  control  required  for 
clinical  practice,  satisfying  guidelines  for  con- 
tainment, and  involving  various  health  profes- 
sionals and  the  public  in  this  exciting  therapeutic 
frontier.  Many  of  these  issues  have  become  re- 
search projects  in  themselves  as  we  have  tried  to 
establish  a  formal  basis  for  clinical  trials  of  gene 
therapy. 
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OUR  research  program  is  concerned  with  the 
molecular  properties  and  regulatory  mecha- 
nisms that  control  the  function  of  plasma  mem- 
brane receptors  for  hormones  and  drugs  under 
normal  and  pathological  circumstances.  Recep- 
tors are  the  cellular  macromolecules  with  which 
biologically  active  substances  (e.g.,  hormones, 
drugs,  neurotransmitters,  growth  factors,  viruses, 
lipoproteins)  initially  interact.  Such  receptors 
perform  two  essential  functions:  1)  They  receive 
or  bind  these  biologically  active  substances  at  the 
surface  of  the  cell.  2)  They  transmit  the  sub- 
stance's message  into  the  cell,  thus  influencing 
its  metabolic  activity  and  function. 

We  have  utilized  the  receptors  for  epinephrine 
(adrenaline)  and  related  compounds,  which  are 
generally  termed  adrenergic  receptors,  as  models 
for  the  study  of  receptors.  Such  receptors  are 
found  throughout  the  brain,  heart,  smooth  mus- 
cle cells,  and  most  other  cells  of  the  human  organ- 
ism. There  are  several  distinct  subtypes  of  adren- 
ergic receptors:  aj,  a-^^,  and  /32-  These 
receptors  interact  not  only  with  endogenous  epi- 
nephrine and  norepinephrine  but  with  a  variety 
of  clinically  important  drugs. 

We  have  studied  these  particular  receptors  for 
several  reasons:  they  are  more  widely  distributed 
than  perhaps  any  other  type  of  receptor,  they  are 
exemplary  of  each  of  the  major  biochemical 
pathways  by  which  receptors  are  known  to  signal 
to  the  interior  of  the  cell,  and  they  are  clinically 
and  therapeutically  significant.  They  mediate 
physiological  responses  as  diverse  as  changes  in 
blood  pressure,  changes  in  heart  rate  and  contrac- 
tility, and  alterations  in  the  metabolism  of  glu- 
cose. Drugs  that  interact  with  them  are  among  the 
most  effective  agents  used  to  treat  various  forms 
of  heart  disease,  hypertension,  asthma,  shock, 
and  depression.  Research  on  these  "exemplary" 
receptors  has  important  implications  for  under- 
standing hormone  and  drug  receptor  interactions 
generally  and  the  mechanisms  by  which  they  are 
regulated. 

Our  research  is  focused  on  several  intimately 
related  goals.  First,  we  wish  to  understand,  in  de- 


tailed molecular  terms,  the  biochemical  nature 
of  the  receptors.  This  is  being  accomplished  by 
the  application  of  recently  developed  recombi- 
nant DNA  or  gene-cloning  techniques.  These  per- 
mit isolation  of  the  genes  for  the  receptors, 
which  in  turn  permits  their  complete  primary 
amino  acid  sequences  to  be  deduced. 

We  have  recently  isolated  the  genes  for  all  of 
the  known  adrenergic  receptors  {a^,  a^,  jSj,  182). 
as  well  as  a  number  of  closely  related  receptors, 
and  determined  their  complete  sequences.  Re- 
markably, the  structures  of  these  receptors  are 
similar  to  each  other  and  to  that  of  the  visual  light 
receptor,  rhodopsin.  This  insight  is  helping  to 
clarify  the  general  mechanisms  by  which  signals 
as  divergent  as  a  photon  of  light  and  a  drug  mole- 
cule like  epinephrine  elicit  their  characteristic 
biochemical  and  physiologic  responses.  By  vary- 
ing the  structure  of  the  gene  and  hence  the  recep- 
tor protein  that  it  produces,  we  can  define  which 
structural  features  of  the  receptor  molecule  de- 
termine its  characteristic  functions. 

An  unexpected  result  of  the  cloning  of  the  four 
main  types  of  adrenergic  receptor  genes  was  the 
discovery  of  the  genes  for  several  novel  subtypes 
of  adrenergic  receptors  not  previously  known  to 
exist.  Knowledge  of  these  new  receptors  opens 
the  way  for  the  development  of  drugs  with 
greater  selectivity  and  fewer  side  effects.  Such 
drugs  might  conceivably  have  applications  in  the 
treatment  of  such  disparate  diseases  as  hyperten- 
sion, asthma,  diabetes,  and  prostatism,  or  even  in 
novel  anesthetic  agents. 

A  second  current  research  goal  is  to  elucidate 
the  patterns  of  receptor  regulation.  One  of  the 
most  important  insights  to  come  from  our  studies 
of  receptors  is  that  their  properties  are  not  fixed. 
Rather,  the  properties  of  the  receptors  are  in- 
fluenced by  the  hormones  and  drugs  with  which 
they  interact,  as  well  as  by  a  variety  of  disease 
states. 

There  are  important  clinical  implications  of 
the  ever-changing  nature  of  the  receptors.  For  ex- 
ample, this  provides  a  basis  for  beginning  to  un- 
derstand the  phenomenon  of  drug  tolerance  or 
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desensitization — the  diminishing  effect  of  drugs 
over  time.  This  phenomenon  markedly  compro- 
mises the  therapeutic  efl&cacy  of  epinephrine  and 
many  other  drugs.  When  drugs  Hke  epinephrine 
combine  with  their  receptors,  they  not  only  stim- 
ulate them  but  also  produce  changes  that  impair 
their  function,  thus  leading  to  desensitization. 
These  changes  involve  an  actual  loss  of  receptors 
from  the  cell  surface  (they  move  inside  the  cell 
where  they  cannot  function)  and  a  chemical 
change  of  those  receptors  remaining  at  the  cell 
surface  so  that  they  function  less  effectively.  With 
fewer  functioning  receptors  present  at  their  sur- 
face, cells  are  less  able  to  respond  to  drugs  or 
hormones.  A  wide  variety  of  other  circumstances 
that  regulate  receptor  properties  were  also  iden- 
tified, including  aging,  congestive  heart  failure, 
(S-blocker  therapy,  thyroid  hormones,  steroid 
hormones,  and  neoplasia. 

Our  recent  research  is  increasing  our  under- 
standing, in  molecular  terms,  of  how  the  recep- 
tors become  functionally  desensitized.  We  have 


recently  discovered  a  new  enzyme,  /^-adrenergic 
receptor  kinase  (/3ARK),  which  modifies  the 
structure  of  the  receptor  by  introducing  a  phos- 
phate group.  This  modification  may  provide  the 
basis  for  receptor  desensitization.  We  are 
currently  studying  how  such  phosphorylation  of 
the  receptors  by  /3ARK  and  related  enzymes  modi- 
fies their  function. 

The  implications  of  such  fundamental  research 
on  receptors  for  clinical  medicine  are  profound. 
Elucidation  of  the  detailed  structure  of  the  recep- 
tors will  allow  the  precise  design  of  more  potent 
and  specific  drugs.  Unraveling  of  the  molecular 
basis  of  desensitization  will  allow  the  develop- 
ment of  strategies  for  interdicting  the  basic  reac- 
tions that  lead  to  loss  of  hormone  and  drug  effect. 
An  example  is  the  design  of  specific  enzyme  in- 
hibitors for  /3ARK  that  could  block  the  reactions 
leading  to  desensitization.  Successful  conclusion 
of  such  research  may  lead  to  methods  for  greatly 
prolonging  and  augmenting  the  therapeutic  ac- 
tions of  diverse  types  of  drugs. 


Opposite:  Desensitization  of  the  ^-adrenergic  receptor.  Many  pharmacological  agents  lose  their 
effectiveness  upon  repeated  or  prolonged  administration.  The  molecular  mechanisms  responsible 
for  this  "desensitization"  may  be  analogous  to  those  that  underlie  the  loss  of  responsiveness 
observed  in  cell  culture  systems  to  hormones  that  activate  ^-adrenergic  receptor  molecules.  Acti- 
vation of  these  receptors  promotes  their  rapid  association  with  the  a-subunit  of  the  stimulatory  G 
protein  (a^.  Depending  on  the  concentration  of  the  activating  hormone,  other  processes  are  also 
triggered  that  eventually  disrupt  this  association,  including  phosphorylation  of  the  receptor  mole- 
cule by  the  c AMP- dependent  protein  kinase  (PKA )  and  the  ^-adrenergic  receptor  kinase  (^ARK). 
Disruption  of  receptor- a ^  coupling  is  further  ensured  or  maintained  by  interaction  of  the  phos- 
phorylated  receptor  with  the  recently  identified  cytosolic  protein  ^-arrestin. 

From  Hausdorff,  W.P.,  Caron,  M.G.,  and  Lefkowitz,  R.f.  1990.  FASEBJ  4:2881-2889. 
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OUR  laboratory  is  interested  in  the  genetic 
mechanisms  that  control  the  establishment 
of  polarity  during  embryogenesis.  In  Drosophila, 
basic  information  about  the  "coordinates"  of  the 
embryo  is  supplied  to  the  egg  cell  during  its  matu- 
ration in  the  mother.  Mutations  in  maternal  genes 
have  led  to  the  identification  of  a  small  number  of 
genes  that  are  required  for  the  establishment  of 
dorsal-ventral  and  anterior-posterior  polarity  in 
the  embryo.  In  these  mutants  the  lack  of  a  partic- 
ular maternal  gene  product  leads  to  a  lethal  phe- 
notype  in  the  embryo. 

Three  independent  morphogenetic  systems  are 
required  for  the  establishment  of  pattern  along 
the  anterior-posterior  axis.  The  anterior  system 
controls  the  development  of  head  and  thorax,  the 
posterior  system  determines  the  abdominal  re- 
gion, and  the  terminal  system  is  required  for  the 
most  anterior  and  posterior  structures.  The  ante- 
rior and  posterior  systems  act  through  factors  that 
are  localized  to  the  respective  poles.  For  the  ante- 
rior group  of  genes,  the  localized  factor  is  the 
mRNA  product  of  the  gene  bicoid.  The  other  ante- 
rior genes  are  involved  in  the  localization  of  bi- 
coid RNA.  Localization  of  the  bicoid  RNA  is  neces- 
sary for  the  formation  of  a  concentration  gradient 
of  bicoid  protein.  This  gradient  along  the  ante- 
rior-posterior axis  determines  thresholds  for  the 
expression  of  zygotic  target  genes.  Thus  high  lev- 
els of  bicoid  promote  head  formation,  while 
lower  levels  promote  thorax  formation. 

We  have  concentrated  on  dissecting  the  path- 
way that  leads  to  normal  development  of  the  pos- 
terior region  of  the  embryo.  Nine  maternal  genes 
have  been  identified  that  set  the  basic  posterior 
pattern.  These  genes,  referred  to  as  the  posterior 
group,  share  the  abdominal  phenotype:  homozy- 
gous mutant  females  produce  offspring  that  lack 
abdominal  segmentation.  Only  the  nanos  gene, 
however,  encodes  the  localized  signal  required 
for  the  development  of  the  abdomen,  while  seven 
genes — oskar,  vasa,  tudor,  valois,  staufen, 
cappuccino,  and  spire — are  required  for  the  lo- 
calization of  nanos. 
These  genes,  in  addition  to  the  abdominal  phe- 


notype, show  an  effect  on  germ  cell  formation: 
mutant  embryos  do  not  form  a  specialized  poste- 
rior pole  plasm  that  normally  contains  the  polar 
granules,  and  hence  these  embryos  lack  pole 
cells,  the  precursors  of  the  germline  cells  in  Dro- 
sophila. One  gene,  pumilio,  is  required  for  the 
distribution  of  the  nanos  protein.  Our  molecular 
analysis  of  the  posterior  group  is  aimed  at  deter- 
mining how  nanos  and  various  other  RNA  species 
become  localized  to  the  posterior  pole  and  how 
the  pole  plasm,  composed  of  RNA  and  protein,  is 
assembled. 

Synthesis  and  Function  of  the 
Abdominal  Signal 

Through  genetic  experiments  as  well  as  cyto- 
plasmic transfers  between  wild-type  and  mutant 
embryos,  we  concluded  that  the  specialized  cyto- 
plasmic zone  at  the  posterior  pole  is  the  source  of 
an  abdominal  signal  and  that  embryos  derived 
from  females  mutant  for  any  one  of  the  posterior 
group  genes  lack  a  common  signal  in  the  abdomi- 
nal region.  Depending  on  the  gene  affected,  mu- 
tant embryos  are  deficient  in  the  synthesis,  stor- 
age, or  transport  of  the  signal. 

The  gene  nanos  is  central  to  the  genetic  path- 
way leading  to  abdomen  formation.  We  have  iso- 
lated and  cloned  the  gene  and  have  shown  that  it 
encodes  the  posterior  signal.  During  oogenesis, 
nanos  mRNA  is  transcribed  and  becomes  local- 
ized to  the  posterior  pole  of  the  oocyte.  The  cen- 
tral role  of  nanos  for  posterior  pattern  formation 
is  demonstrated  by  the  finding  that  nanos  tran- 
script synthesized  in  vitro  from  a  nanos  cDNA 
rescues  the  abdominal  segmentation  phenotype 
not  only  of  nanos  mutant  embryos  but  also  of  all 
posterior  group  mutants  so  far  tested.  Since  seven 
of  the  nine  mutants  (except  nanos  and  pumilio) 
also  lack  pole  plasm  and  fail  to  localize  nanos 
RNA,  we  can  conclude  that  in  pole  plasm-defec- 
tive mutants  the  abdominal  segmentation  defect 
is  a  consequence  of  a  lack  of  localized  nanos  ac- 
tivity ■  In  future  experiments  we  will  determine 
cis-acting  sequences  within  the  nanos  mRNA, 
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and  the  trans-acting  factors  required  for  nanos 
RNA  localization. 

We  are  just  beginning  to  understand  the  pro- 
cess through  which  the  nanos  protein  affects  the 
development  of  abdominal  structures.  The  gene 
appears  to  inhibit  the  activity  of  another  mater- 
nally derived  protein,  encoded  by  the  hunch- 
back gene.  Maternally  derived  hunchback  pro- 
tein seems  to  act  as  a  repressor  molecule;  it 
blocks  the  transcription  of  genes  within  the  em- 
bryo. The  nanos  gene  acts  negatively  on  the 
mRNA  stability  and/or  translation  of  the  mater- 
nally provided  product  of  hunchback,  such  that 
the  maternal  hunchback  transcript  and  protein 
product  are  absent  from  the  posterior  of  the  em- 
bryo. Thus,  by  interfering  with  hunchback  activ- 
ity, nanos  indirectly  promotes  activation  of  the 
first  tier  of  embryonic  genes  required  for  abdo- 
men formation. 

Distribution  of  the  Posterior  Signal 

The  pumilio  gene  is  required  for  the  distribu- 
tion of  the  abdominal  signal,  encoded  by  the 
na«05  gene,  during  early  embryogenesis.  It  there- 
fore provides  an  excellent  opportunity  to  study 
the  redistribution  of  localized  cytoplasmic  deter- 
minants. Initial  cytoplasmic  transplantation  ex- 
periments indicated  that  in  the  absence  of  the 
maternally  provided  pumilio  gene  product,  the 
abdominal  signal  nanos  is  properly  localized  to 
the  posterior  pole  but  does  not  reach  its  target, 
the  prospective  abdominal  region.  In  situ  hybrid- 
ization studies  have  demonstrated  that  nanos 
mRNA  is  localized  to  the  posterior  pole  plasm  in 
wild-type  embryos  and  that  its  distribution  is  un- 
affected by  a  pumilio  mutation.  This  suggests  that 
the  pumilio  gene  affects  the  distribution  of  the 
nanos  protein. 

We  have  cloned  the  pumilio  gene  and  have 
shown  that  it  encodes  several  transcripts  that  are 
present  throughout  development.  In  early  em- 
bryos, pumilio  transcripts  are  localized  to  the 
posterior  pole.  This  observation  is  consistent 
with  the  possible  interaction  between  pumilio 
and  nanos  gene  products.  The  localization  of  the 
transcript  is  disrupted  by  the  same  set  of  mutants 
that  disrupt  nanos  localization,  and  it  is  possible 


that  the  same  mechanism  of  RNA  localization  acts 
on  both  RNAs. 

Assembly  of  the  Pole  Plasm 

The  remaining  seven  genes  in  the  posterior 
group  are  associated  with  pole  plasm  defects.  We 
are  interested  in  their  role  in  pattern  formation 
and  thus  in  nanos  RNA  localization.  We  are  also 
very  excited  about  the  potential  for  learning 
more  about  the  origin  of  the  germline. 

During  the  first  hour  of  embryogenesis,  the 
pole  plasm  appears  as  a  distinct  clear  zone  at  the 
posterior  end  of  the  newly  fertilized  egg.  Closer 
examination  reveals  that  the  zone  actually  con- 
tains specialized  cytoplasm  packed  with  mito- 
chondria and  numerous  donut-shaped  organelles 
that  do  not  occur  anywhere  else  in  the  embryo. 
These  organelles,  called  polar  granules,  must  be 
present  for  the  formation  of  pole  cells,  the  pre- 
cursors of  the  germ  cells.  Polar  granules  were 
first  recognized  in  Drosophila  embryos  30  years 
ago,  but  until  recently  little  was  known  about 
their  biochemical  structure  or  function.  Now  the 
link  between  defects  in  abdomen  formation  and 
the  absence  of  pole  cells  has  provided  a  new  ave- 
nue for  research. 

Our  hypothesis  is  that  the  seven  remaining  pos- 
terior group  genes  do  not  have  a  direct  role  in 
transmission  of  the  abdominal  signal.  Instead, 
they  provide  the  structural  framework  that  local- 
izes nanos  mRNA  in  the  pole  plasm.  This  frame- 
work may  be  the  polar  granule  or  components 
thereof.  We  have  begun  extensive  studies  of  the 
pole  plasm  in  mutant  and  normal  embryos,  using 
probes  designed  to  recognize  mRNA  and  proteins 
derived  from  the  posterior  group  genes. 

We  are  optimistic  that  new  information  about 
the  assembly  and  composition  of  the  pole  plasm 
and  polar  granules  will  lead  to  greater  under- 
standing of  their  various  functions  in  the  early 
embryo.  Structures  similar  to  the  polar  granules 
in  Drosophila  have  been  observed  in  association 
with  germ  plasm  in  many  different  invertebrate 
and  vertebrate  animals.  Thus  the  Drosophila 
model  could  reveal  basic  concepts  underlying 
the  establishment  of  germline  tissues  in  all 
species. 
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Developmental  regulation  of  cTnC  gene  expression  in  vitro.  A:  Differentiation  of  C2C12  myo- 
blasts into  myotubes  induced  by  serum  deprivation.  Cultures  of  C2C12  myoblasts  are  shown  on 
the  left.  After  48  h  of  serum  deprivation,  more  than  95  percent  of  the  myoblasts  have  differen- 
tiated into  myotubes  (right panel).  Magnification,  X  54. 

B:  Northern  blot  analysis  ofRNA  samples  isolated  from  murine  BALB/c  cardiac  tissue  (marked 
HeartJ,  undifferentiated  C2C12  myoblasts  (C2C12  Myoblast),  and  differentiated  C2C12  myo- 
tubes (C2C12  Myotube)  hybridized  to  the  murine  cTnC  cDNA  probe.  The  bottom  panel  shows  the 
same  filters  hybridized  to  a  glyceraldehyde-  3-phosphate  dehydrogenase  (GPD)  probe. 

From  Parmacek,  M.S.,  and  Leiden,  J. M.  1989.  J  Biol  Chem  264:13217-13225. 
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THE  processes  of  cellular  differentiation  and 
activation  are  accompanied  by  complex  and 
precisely  orchestrated  changes  in  gene  expres- 
sion. Abnormalities  in  the  expression  patterns  of 
these  differentiation-  and  activation-specific 
genes  may  be  involved  in  the  etiology  of  a  num- 
ber of  pathologic  states,  including  autoimmune 
disease  and  malignancy.  My  laboratory  is  studying 
gene  regulation  during  T  lymphocyte  and  muscle 
cell  differentiation,  in  order  to  understand  bet- 
ter the  molecular  mechanisms  that  regulate 
gene  expression  during  normal  and  pathologic 
development. 

Regulation  of  Human  T  Cell  Receptor  (TCR) 
Gene  Expression  During  T  Cell  Development 

Human  T  cells  recognize  foreign  antigens,  such 
as  virus-infected  cells  and  tumor  cells,  via  spe- 
cific cell-surface  TCR  molecules.  T  cells  can  be 
divided  into  two  subsets,  based  on  their  expres- 
sion of  two  distinct  types  of  antigen  receptor  mol- 
ecules. The  majority  of  circulating,  peripheral 
blood  T  cells  (including  all  helper  and  cytotoxic 
T  cells)  express  the  TCR  a/^  heterodimer;  a  small 
but  distinct  T  cell  subset  of  unknown  function 
expresses  the  7/6  TCR.  These  a//?  and  7/6  T  cells 
appear  to  develop  as  separate  lineages  during 
thymic  ontogeny.  During  the  past  several  years, 
my  laboratory  has  been  interested  in  the  mole- 
cular mechanisms  that  regulate  the  expression 
of  these  TCR  genes  during  T  lymphocyte 
development. 

In  an  initial  set  of  studies,  we  identified  the 
transcriptional  enhancer  elements  that  control 
the  expression  of  the  TCR  a  and  |S  genes.  These 
two  enhancers  were  shown  to  be  required  for  the 
expression  of  the  TCR  a  and  /3  genes  in  T  cells  and 
to  function  equally  well  in  both  mouse  and  hu- 
man cells.  Despite  these  similarities,  they  display 
a  number  of  distinct  molecular  properties.  For 
example,  the  TCR  a  enhancer  functions  only  in 
a/18  T  cells  and  is  equally  active  on  a  number  of 
different  promoters;  the  TCR  ^  enhancer  is  quite 
active  in  both  a/^  and  7/6  T  cells,  displays  low- 
level  activity  in  B  cells,  but  is  inactive  in  nonlym- 
phoid  cells.  The  identification  and  localization  of 


the  human  TCR  enhancers  led  us  to  propose  that 
certain  T  cell  tumors  previously  shown  to  contain 
chromosomal  translocations  into  the  human  TCR 
a  and  jS  loci  might  be  caused  by  the  apposition  of 
the  TCR  gene  enhancers  with  translocated 
proto-oncogenes . 

Our  more  recent  studies  have  focused  on  iden- 
tifying and  characterizing  the  enhancer  DNA  se- 
quences and  the  nuclear  proteins  bound  by  these 
sequences  that  are  responsible  for  controlling 
TCR  a  and  ^  gene  expression.  These  experiments 
have  demonstrated  that  both  enhancers  contain 
4-5  different  binding  sites  for  nuclear  proteins. 
At  least  two  of  these  sites  in  each  are  required  for 
enhancer  activity.  Several  of  the  nuclear  protein- 
binding  sites  in  each  enhancer  correspond  to  pre- 
viously defined  enhancer  motifs,  while  others 
represent  novel  sequence  elements.  At  least  one 
site  in  each  enhancer  was  shown  to  bind  T  cell- 
specific  nuclear  proteins.  Both  enhancers  were 
shown  to  contain  a  nuclear  protein-binding  site 
that  was  identical  to  the  previously  described 
cAMP  response  element  (CRE) . 

To  clarify  the  function  of  the  CRE  in  regulating 
T  cell  gene  expression,  we  have  cloned  two  novel 
CRE-binding  proteins,  CREB-2  and  CREB-3, 
which  specifically  bind  to  the  TCR  a  CRE  and  to 
CREs  from  several  other  eukaryotic  promoters. 
Both  of  these  proteins  contain  similar  basic  DNA- 
binding  domains  and  a  leucine  zipper  region  that 
allows  them  to  form  dimeric  complexes  both 
with  themselves  and,  potentially,  with  other 
members  of  the  CREB  protein  family. 

More  recently  we  have  shown  that  ets- 1 ,  a  pre- 
viously described  human  proto-oncogene,  is  actu- 
ally a  DNA-binding  protein  that  specifically  recog- 
nizes one  of  the  nuclear  protein-binding  sites  in 
the  TCR  a  enhancer.  These  studies  have  helped  to 
define  the  Ets  proteins  as  transcriptional  regula- 
tors that  may  play  an  important  role  in  control- 
ling T  cell  gene  expression.  Finally,  in  collabora- 
tion with  Stuart  Orkin  (HHMI,  the  Children's 
Hospital,  Boston)  we  have  identified  a  new  T 
cell-specific  transcription  factor  called  hGATA-3 
that  binds  to  a  third  nuclear  protein-binding  site 
within  the  TCR  a  enhancer.  Ongoing  studies  are 
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designed  to  determine  the  role  of  each  of  these 
transcription  factors  in  normal  T  cell  develop- 
ment and  activation,  as  well  as  in  T  cell  tumors. 

Molecular  Characterization  of  the  Cardiac 
and  Skeletal  Muscle  Troponin  C  Genes 

^  Normal  heart  muscle  contraction  depends  on 
the  phenomenon  of  excitation-contraction  cou- 
pling— i.e.,  the  transmission  of  the  electrical  de- 
polarization signal  into  the  biochemical  events 
involved  in  muscle  cell  contraction.  Cardiac  tro- 
ponin C  (cTnC)  is  the  calcium-binding  subunit 
of  the  muscle  fiber  that  regulates  excitation-con- 
traction coupling.  Cardiac  or  skeletal  muscle  con- 
traction is  initiated  by  electrical  depolarization 
of  the  muscle  cell  membrane,  resulting  in  a  dra- 
matic rise  in  intracellular  calcium,  which  in  turn 
binds  to  the  troponin  C  molecule,  causing  a  con- 
formational change  in  the  structure  of  the  poly- 
peptide. This  signal  is  transduced  across  the 
muscle  fiber,  facilitating  the  formation  of  actin- 
myosin  cross  bridges,  and  ultimately  resulting  in 
the  generation  of  tension  in  the  muscle  fiber. 

Cardiac  and  skeletal  muscle  each  express  a  dif- 
ferent form  of  troponin  C.  These  two  forms  dis- 
play markedly  different  biophysical  properties 
that,  in  part,  account  for  the  different  properties 
of  cardiac  and  skeletal  muscle  contraction.  To 
clarify  the  function  and  regulation  of  the  cTnC 
and  skeletal  muscle  troponin  C  (sTnC)  mole- 
cules, we  have  recently  cloned  cDNAs  and  genes 
that  encode  both  of  these  proteins.  Structural 
studies  have  revealed  that  these  genes  belong  to  a 
common  multigene  family  that  contains  at  least 
8-10  closely  related  members.  Although  the  two 
proteins  are  highly  related,  we  have  identified 
three  areas  that  display  significant  structural  di- 
versity and  that  may  account  for  the  functional 


differences.  Studies  of  the  expression  of  the  two 
genes  have  shown  that  the  cTnC  gene  is  ex- 
pressed only  in  embryonic  (not  adult)  fast  skele- 
tal muscle  and  in  adult  cardiac  and  slow  skeletal 
muscle,  while  the  fast  sTnC  gene  is  expressed 
only  in  embryonic  and  adult  fast  skeletal  muscle. 
Although  neither  gene  is  expressed  in  cultured 
myoblasts  (muscle  cell  progenitors),  the  expres- 
sion of  both  is  markedly  induced  following  chem- 
ical differentiation  of  these  myoblasts  into  embry- 
onic myotubes. 

Recently  we  have  focused  on  identifying  the 
genetic  elements  that  regulate  the  tissue-specific 
expression  of  the  two  TnC  genes  during  muscle 
cell  development.  These  studies  have  dem- 
onstrated that  the  cTnC  gene  contains  two  dis- 
tinct tissue-specific  transcriptional  enhancer  ele- 
ments: 1)  a  first-intron  skeletal  muscle-specific 
enhancer  that  is  required  for  expression  of  the 
cTnC  gene  in  embryonic  skeletal  myotubes  and 
2)  an  upstream  cardiac-specific  enhancer  that  is 
required  for  high-level  cTnC  gene  expression  in 
cardiac  myocytes.  Interestingly,  the  sTnC  gene 
also  contains  a  first-intron  enhancer  that  is  active 
in  skeletal  but  not  cardiac  muscle  cells.  Ongoing 
studies  are  designed  to  dissect  the  molecular  re- 
quirements for  cTnC  and  sTnC  enhancer  function 
and  to  clone  and  characterize  the  enhancer- 
binding  proteins  that  control  the  expression  of 
these  three  enhancers. 

More  recently  we  have  developed  a  method  for 
introducing  foreign  genes  directly  into  heart 
muscle  cells  in  living  animals.  These  inserted 
genes  are  expressed  in  the  heart  muscle  for  as 
long  as  six  months.  This  simple  technique,  which 
involves  injection  of  recombinant  genes  directly 
into  the  beating  hearts  of  rats  and  mice,  holds 
promise  for  the  treatment  of  a  number  of  human 
acquired  and  inherited  cardiovascular  diseases. 
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IN  human  beings,  olfaction  is  primarily  a  he- 
donistic sense;  among  other  things,  it  provides 
most  of  the  enjoyment  derived  from  eating.  For 
other  animals,  however,  olfaction  is  important 
for  finding  mates  and  food  and  avoiding  enemies 
and  noxious  compounds.  To  study  the  mecha- 
nisms behind  olfaction,  we  are  using  as  a  model 
system  the  moth  Manduca  sexta. 

Male  moths  locate  females  by  their  keen  ability 
to  detect  sex  pheromones  released  by  females  up 
to  several  miles  distant.  The  "nose"  of  a  moth  is 
its  antennae.  The  antennae  are  covered  with  fine 
hair-like  processes  called  sensilla.  The  surface  of 
a  sensillum  is  composed  of  chitin  perforated  by 
numerous  small  pores,  which  open  into  tubules 
leading  into  a  receptor  lymph.  Inside  each  sensil- 
lum and  surrounded  by  the  receptor  lymph  are 
the  cilia  from  olfactory  receptor  neurons. 

For  an  air-breathing  animal  to  detect  an  odor, 
three  events  must  take  place.  First,  the  volatile 
chemical  in  question,  which  is  often  quite  hydro- 
phobic, must  cross  an  aqueous-based  medium — 
in  the  case  of  moths,  the  receptor  lymph — to 
reach  the  cilia.  A  perfect  example  of  such  an 
odorant  is  bombykal,  a  l6-chain  hydrocarbon 
with  an  aldehyde  group  at  one  end,  which  is  a  sex 
pheromone  for  Manduca.  Second,  the  chemical 
must  interact  with  its  receptor.  Third,  the  chemi- 
cal, or  at  least  its  action,  must  be  terminated.  This 
last  point  is  particularly  important  in  the  case  of 
pheromones,  where  real-life  working  concentra- 
tions are  subpicomolar,  necessitating  that  noise 
be  kept  to  a  minimum. 

The  receptor  lymph  is  inundated  with  a  15- 
kDa  protein  called  pheromone-binding  protein 
(PBP).  It  is  widely  hypothesized  that  PBP  is  in- 
volved in  the  translocation  of  pheromone  from 
the  pore  tubules  to  the  membranes  of  the  cilia. 
We  have  been  characterizing  this  binding  protein 
in  Manduca  and  are  currently  working  on  its  de- 
velopmental regulation.  To  facilitate  these  stud- 
ies we  have  cloned  and  sequenced  the  cDNA  en- 
coding it. 

If  PBP  is  responsible  for  solubilizing  and  trans- 
porting pheromone  to  pheromone  receptors, 


how  do  general  odorants  get  to  their  receptors? 
We  have  found  that  the  olfactory  sensilla  for  de- 
tecting general  odors  have  two  proteins  that  are 
related  to  PBP.  These  proteins,  which  are  ex- 
pressed in  both  males  and  females,  may  be  gen- 
eral odorant  carriers.  Recently  we  cloned  and  se- 
quenced cDNA  coding  for  both  of  these  proteins, 
which  have  been  named  GOBPs  (general  odor- 
ant-binding  proteins). 

Odor  inactivation  could  be  the  result  of  any  of 
several  processes,  ranging  from  destruction  of  the 
chemical  in  question,  once  it  has  interacted  with 
a  receptor,  to  desensitization  of  the  receptor.  A 
developmentally  regulated  antenna-specific  al- 
dehyde oxidase  (AOX)  has  been  identified.  This 
1 50-kDa  enzyme  has  a  strong  preference  for  bom- 
bykal as  a  substrate  as  opposed  to  other  alde- 
hydes. As  a  result  of  the  AOX,  the  half-life  of  a 
molecule  of  pheromone,  once  it  enters  a  sensil- 
lum, is  less  than  1  millisecond. 

A  Tissue-Specific  snRNP  Protein 

Messenger  RNA  splicing  is  a  fundamental  pro- 
cess in  eukaryotic  cells.  The  precise  removal  of 
intervening  sequences  (introns)  and  the  rejoin- 
ing of  exons  to  form  an  mRNA  encoding  a  specific 
protein  are  in  some  ways  analogous  to  making  a 
movie.  First  many  scenes  are  taken.  The  primary 
RNA  transcript  made  from  a  DNA  template  is  akin 
to  this  unprocessed  film.  Like  the  film,  RNA  is 
processed  in  several  ways.  For  example,  a  modi- 
fied nucleotide  cap  structure  is  placed  at  the  be- 
ginning of  the  RNA  transcript — just  as  a  title  and 
headings  are  placed  at  the  beginning  of  the  film. 
Also,  a  strip  of  nucleotides,  the  poly(A)  tail,  is 
added  to  the  3'  end  of  the  RNA — as  credits  are 
added  at  the  end  of  the  film.  One  of  the  most 
interesting  and  basic  things  that  happens  to  an 
RNA  transcript  during  its  maturation  is  the  exci- 
sion of  introns  and  the  connection  of  exons. 
Again,  as  in  movie  making,  the  film  that  has  taken 
many  hours  to  shoot  must  be  edited  so  that  only 
the  most  important  two  hours  remain.  The  cellu- 
lar components  that  perform  these  splicing  reac- 
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tions  are  called  snRNPs  (small  nuclear  ribonu- 
cleoproteins;  see  the  report  in  this  volume  by 
Joan  Steitz,  HHMI,  Yale  University).  As  might  be 
expected  for  such  a  fundamental  process  as  splic- 
ing, the  individual  components  of  the  snRNPs, 
their  snRNAs  and  proteins,  are  generally  present 
in  all  tissues. 

Usually  when  snRNPs  splice  a  given  pre-mRNA, 
there  is  only  one  possible  final  product.  How- 
ever, this  is  not  always  the  case,  particularly  with 
regard  to  RNA  processing  in  the  central  nervous 
system  (CNS).  Imagine  a  movie  with  more  than 
one  potential  ending,  or  even  alternative  scenes 
in  the  middle,  so  that  individual  viewers  could 
make  their  own  choices.  Alternative  mRNA  splic- 
ing is  a  precise  process  by  which  different  cells. 


starting  with  the  same  primary  RNA  transcript, 
make  related,  but  distinct,  mature  mRNAs.  At 
least  one  snRNP  protein  is  apparently  present  in  a 
tissue-specific  manner,  occurring  mainly  in  brain 
and  to  a  much  lesser  extent  in  other  tissues  such 
as  heart,  adrenal  gland,  olfactory  epithelium,  and 
pigment  cells.  This  protein,  called  N,  has  been 
identified  and  cloned  in  collaboration  with  Susan 
Amara  (HHMI,  Yale  University).  Although  N  is 
expressed  predominately  in  the  CNS,  it  is  only 
present  in  neurons  and  not  in  the  more  abundant 
nonneuronal  cells,  such  as  glia.  Because  of  the 
restricted  tissue  and  cell  expression  of  N,  the  ge- 
nomic sequence  controlling  its  expression  is  of 
interest.  Recently  this  sequence  has  been  ob- 
tained and  is  now  being  investigated. 


284 


Structural  Determinants  of  a-Globin 
Gene  Expression 


i 


Stephen  A.  Liebhaber,  M.D. — Associate  Investigator 

Dr.  Liebhaber  is  also  Associate  Professor  of  Human  Genetics  and  Medicine  (Hematology )  at  the  University 
of  Pennsylvania  School  of  Medicine.  He  received  his  B.A.  degree  in  chemistry  from  Brandeis  University 
and  his  M.D.  degree  from  Yale  University.  He  took  clinical  training  in  internal  medicine,  hematology,  and 
molecular  biology  at  four  other  universities.  As  a  postdoctoral  fellow  with  David  Schlessinger  at  Wash- 
ington University,  Dr.  Liebhaber  examined  ribosomal  RNA  processing,  and  with  Y.  W.  Kan  at  the  Univer- 
sity of  California,  San  Francisco,  he  studied  human  globin  gene  expression  and  genetic  defects  in  a-thal- 
assemia.  Before  moving  to  Philadelphia,  he  was  a  faculty  member  of  the  Department  of  Medicine  at  UCSP. 


EXPRESSION  of  a  eukaryotic  gene  is  a  multi- 
step  process.  Specific  regions  of  chromatin 
must  be  activated  and  transcribed,  transcripts  pro- 
cessed, and  mature  mRNA  exported  to  the  cyto- 
plasm for  translation  into  protein.  Gene  expres- 
sion can  be  controlled  at  any  or  all  of  these  steps. 
Our  laboratory  has  largely  concentrated  on  study- 
ing the  expression  of  the  human  globin  genes. 
These  genes  encode  hemoglobin,  the  major  red 
cell  protein  responsible  for  oxygen  transport 
from  the  lungs  to  peripheral  tissues.  Since  the 
hemoglobin  molecule,  a2/^2'  is  composed  of  an 
equal  number  of  a-  and  |8-globin  chains,  normal 
synthesis  demands  balanced  expression  of  both 
sets  of  genes.  Defects  in  either  set  result  in  an 
imbalance  of  expression  and  consequent  anemia: 
a-  or  i8-thalassemia.  Thalassemias  result  from  al- 
most 100  different  mutations  in  the  globin  genes, 
affecting  the  health  of  millions  worldwide. 

Certain  characteristics  of  globin  gene  expres- 
sion make  it  particularly  interesting  for  study. 
The  extremely  high  level  of  globin  mRNA  in  the 
differentiating  red  cell  (over  95  percent  of  total 
cellular  mRNA)  has  no  equal  in  any  other  cell 
type.  This  abundance  reflects  both  high  levels  of 
synthesis  (transcription)  and  an  unusual  stability 
of  the  mature  globin  mRNA. 

Another  interesting  characteristic  is  that  the  a- 
and  /3-globin  gene  clusters  follow  an  orderly  se- 
quence of  expression  during  embryologic  devel- 
opment. This  results  in  a  well-defined  switch 
from  embryonic  to  adult  globin  gene  expression 
during  development  of  the  fetus.  The  switching 
results  in  the  synthesis  of  successive  hemoglo- 
bins with  oxygen  afiinities  that  match  changes  in 
the  uterine  environment.  The  active  transcrip- 
tion, unusual  mRNA  stability,  and  clearly  defined 
pattern  of  developmental  switching  are  areas  of 
special  focus  in  our  laboratory. 

The  loss  of  a-globin  gene  expression  observed 
in  a-thalassemia  usually  results  from  deletion  or 
abnormal  structure  of  one  or  more  of  the  a-globin 
genes.  We  have  recently  studied  a  series  of  pa- 
tients with  a  rather  unusual  form  of  a-thalassemia 
in  that  their  silenced  a-globin  genes  have  entirely 


normal  structures.  When  these  genes  are  isolated 
and  reintroduced  into  cells,  they  function  as  well 
as  normal  ones.  The  cause  of  the  abnormality  in 
these  patients  appears  to  lie  in  a  region  of  DNA 
outside  the  genes  themselves. 

Through  extensive  DNA  mapping,  we  have  de- 
tected large  deletions  outside  the  a-globin  clus- 
ter in  three  a-thalassemic  individuals.  One  of 
these  ectopic  deletions  is  separated  from  the  a- 
globin  gene  by  at  least  30,000  bases.  By  compar- 
ing the  maps  of  each  deletion,  a  region  of  com- 
mon overlap  is  noted.  Since  a  similar  set  of  signals 
critical  for  transcriptional  activation  has  been 
identified  adjacent  to  the  |S-globin  gene  cluster, 
one  can  speculate  that  such  signals  serve  coordi- 
nately  to  activate  the  expression  of  both  the  a- 
and  |S-globin  clusters  in  the  red  cell. 

The  human  a-globin  gene  cluster  contains  a  f- 
globin  gene  expressed  specifically  in  the  embryo 
and  two  a-globin  genes,  al  and  a2,  expressed  in 
the  fetus  and  the  adult.  The  switch  from  embry- 
onic ^globin  to  adult  a-globin  occurs  at  the  end 
of  embryonic  development  (7-8  weeks  of  gesta- 
tion). This  critical  developmental  switch,  which 
occurs  widely  in  mammals,  presents  a  well- 
defined  model  system  for  studying  developmen- 
tal control  of  gene  expression. 

To  establish  a  system  in  which  to  study  switch- 
ing within  the  human  a-globin  gene  cluster,  we 
have  introduced  the  human  and  a-globin  genes 
into  fertilized  mouse  eggs  to  generate  transgenic 
mice.  The  red  cells  of  these  mice  appropriately 
express  the  human  transgenes  during  develop- 
ment. In  the  embryonic  period,  there  is  parallel 
expression  of  the  human  and  mouse  f-globin 
genes,  and  by  day  12  of  development,  parallel 
expression  of  the  a-globin  genes.  These  data  sug- 
gest that  1 )  the  human  transgenes  contain  the  nec- 
essary information  for  appropriate  developmen- 
tal control  and  2)  the  factors  responsible  for 
developmental  switching  in  the  red  cell  have 
been  sufficiently  conserved  during  evolution  to 
substitute  in  the  control  over  the  human  trans- 
genes. By  generating  transgenic  mice  that  carry 
human  and  a-globin  genes  with  specific  alter- 
ations, and  by  studying  their  pattern  of  develop- 
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mental  expression,  it  should  be  possible  to  arrive 
at  a  definition  of  the  signal  (s)  critical  to  globin 
gene  switching. 

Selective  stabilization  of  globin  mRNA  is  essen- 
tial to  its  accumulation  in  terminally  differentiat- 
ing red  cells  to  levels  of  95  percent  of  total  cellu- 
4ar  mRNA.  We  are  attempting  to  define  the 
structural  basis  for  the  stability  of  a-globin  mRNA 
through  investigation  of  an  a-thalassemia  muta- 
tion in  which  this  stability  is  lost:  a  Constant 
Spring  (aCS).  This  mutation,  the  most  common 
cause  of  nondeletional  a-thalassemia  in  Southeast 
Asia,  is  a  codon  substitution  (CAA  for  UAA)  at  the 
termination  of  the  normal  a 2 -globin  gene.  As  a 
result  of  this  single-base  substitution,  the  ribo- 
some  translates  into  the  normally  untranslated  3' 
region  and  destabilizes  the  mRNA. 

To  study  the  basis  for  this  destabilization  in  de- 


tail, we  have  established  an  experimental  system 
that  reproduces  the  selective  instability  of  aCS 
mRNA  in  tissue  culture  cells.  Remarkably,  we 
find  that  the  mRNA  is  unstable  when  the  gene  is 
expressed  in  an  erythroid  cell  line,  but  normally 
stable  when  expressed  in  nonerythroid  cell  lines. 
This  suggests  that  the  stability  of  globin  mRNA 
may  depend  upon  interaction  with  one  or  more 
erythroid-specific  factors. 

By  making  specific  alterations  in  the  structure 
of  the  a-globin  genes  prior  to  expression,  it  has 
been  possible  to  demonstrate  that  the  region 
within  the  a-globin  mRNA  critical  to  its  stability 
is  located  in  a  segment  of  the  3'-nontranslated  re- 
gion just  past  the  translation  termination  codon. 
In  future  experiments  the  critical  cellular  factors 
or  structures  mediating  this  response,  and  the 
mechanism  involved,  will  be  further  defined. 
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THE  causes  of  heat-induced  lethality  and  the 
mechanisms  employed  by  cells  to  protect 
themselves  from  heat  damage  are  poorly  under- 
stood. Over  the  past  decade,  a  great  deal  of  re- 
search has  focused  on  a  small  group  of  highly 
conserved  proteins,  the  heat-shock  proteins 
(HSPs).  These  proteins  are  induced  in  response 
to  temperature  elevation  and  a  wide  variety  of 
other  stresses.  This  remarkable  response  is  the 
most  highly  conserved  genetic  induction  known, 
which  underscores  its  fundamental  importance 
in  biology.  Archaeobacteria,  eubacteria,  plants, 
and  animals  all  produce  similar  proteins.  Several 
of  these  proteins  show  very  high  levels  of  conser- 
vation, with  40-50  percent  amino  acid  identity 
between  the  proteins  of  human  cells  and  bacte- 
rial cells. 

Much  indirect  evidence  suggests  that  the  HSPs 
are  instrumental  in  protecting  cells  and  organ- 
isms from  lethality  at  extreme  temperatures.  For 
example,  pretreatments  at  moderately  elevated 
temperatures,  which  induce  the  synthesis  of 
HSPs,  result  in  tolerance  to  much  higher  tempera- 
tures. The  kinetics  of  HSP  induction  closely  paral- 
lel the  kinetics  of  thermotolerance  induction, 
and  the  decay  of  HSPs  from  the  cell  closely  paral- 
lels the  decline  in  thermotolerance.  Moreover, 
HSP  functions  are  not  limited  to  conditions  of 
high  temperature.  They  are  induced  by  many 
other  stresses,  including  anoxia  and  reoxygena- 
tion,  heavy  metal  ions,  ethanol,  and  inhibitors  of 
respiration  or  oxidative  phosphorylation.  In  gen- 
eral, exposure  to  one  form  of  stress  provides  toler- 
ance to  another,  suggesting  that  the  proteins  have 
broad  protective  roles.  Studies  of  the  heat-shock 
response  are,  therefore,  of  practical  interest. 
(The  development  of  crop  plants  with  extended 
heat  tolerance,  for  example,  would  be  of  enor- 
mous benefit  to  agricultural  productivity.) 

HSPs  are  of  particular  interest  to  human  biol- 
ogy and  medicine,  for  four  reasons.  1)  Studies  of 
cultured  cells  in  vitro  and  of  tumors  in  vivo  dem- 
onstrate that  many  cancer  cells  are  more  readily 
killed  by  heat  than  are  untransformed  cells.  For 
this  reason,  hyperthermia,  in  conjunction  with 
radiation  and  chemotherapy,  is  emerging  as  an 


important  new  tool  in  cancer  therapy.  2)  High 
temperatures  are  associated  with  a  number  of  de- 
velopmental anomalies  in  a  wide  variety  of  plants 
and  animals,  including  spina  bifida  in  humans.  In 
those  organisms  that  have  been  subjected  to  ex- 
perimental manipulation,  mild  preheat  treat- 
ments, which  induce  the  HSPs,  provide  protec- 
tion. 3)  The  induction  of  HSPs  is  associated  with 
a  variety  of  human  pathological  states,  including 
strokes,  heart  attacks,  and  kidney  disease.  Interest 
in  the  proteins  includes  both  their  putative  pro- 
tective functions  in  affected  tissues  and  the  possi- 
bility of  quantifying  them  as  disease  markers.  4) 
The  proteins  interact  with  and  potentiate  the 
function  of  many  other  proteins  in  the  cell. 

As  important  as  these  pragmatic  concerns  may 
be,  most  studies  of  the  heat-shock  response,  in- 
cluding those  in  my  laboratory,  have  been  moti- 
vated by  other  considerations.  Initially  the  re- 
sponse was  exploited  as  a  model  system  to  study 
the  cellular  mechanisms  involved  in  regulating 
protein  synthesis.  Because  induction  of  the  pro- 
teins is  required  for  survival,  a  number  of  sophis- 
ticated mechanisms  are  employed  to  ensure  that 
the  proteins  will  be  produced  as  rapidly  as  possi- 
ble after  exposure  to  stress.  Thus  studies  of  the 
response  have  provided  fundamental  insights  on 
the  nature  of  nuclear  and  cytoplasmic  regulation 
in  both  eukaryotes  and  prokaryotes. 

The  recent  discovery  that  the  HSPs,  or  their 
close  relatives,  play  vital  roles  in  the  cell  at  nor- 
mal temperatures  has  opened  up  a  whole  new 
field  of  investigation.  The  specific  molecular 
functions  of  the  HSPs  are  only  beginning  to  be 
elucidated,  but  they  appear  to  play  a  role  in  a 
remarkable  number  of  basic  cellular  processes, 
among  them  secretion,  signal  transduction,  and 
ribosome  assembly.  Determining  the  roles  these 
proteins  play  in  these  processes  will  provide  fun- 
damental insights  in  cell  biology. 

We  are  investigating  the  regulation  and  the 
function  of  the  HSPs.  Our  research  focuses  on  the 
yeast  Saccharomyces  cerevisiae  and  the  fruit  fly 
Drosophila  melanogaster,  because  techniques 
of  genetic  manipulation  and  molecular  analysis 
are  so  advanced  in  them.  For  the  past  few  years 
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our  investigations  of  the  regulation  of  the  re- 
sponse have  concentrated  on  post-transcriptional 
mechanisms  that  are  employed  to  maximize  the 
synthesis  of  the  HSPs  during  heat  shock  or  to  shut 
off  the  synthesis  of  HSPs  after  heat  shock.  Tom 
McGarry  and  Bob  Petersen  have  found  that  heat- 
shock  mRNAs  in  Drosophila  cells  are  preferen- 
tially translated  during  heat  shock  by  virtue  of 
sequences  in  their  5'-untranslated  leaders.  They 
are  preferentially  repressed  after  heat  shock 
(during  recovery)  by  virtue  of  sequences  in  their 
3'-untranslated  tails,  sequences  that  selectively 
target  them  for  destruction.  The  latter  sequences 
are  shared  by  many  non-heat-shock  messages, 
which  have  the  common  property  of  being  rap- 
idly degraded  at  normal  temperatures.  Heat- 
shock  regulation  takes  advantage  of  this  common 
pathway  to  control  HSP  expression.  The  mecha- 
nism is  inactivated  during  heat  shock  and  re- 
stored during  recovery. 

Joseph  Yost  demonstrated  that  heat  shock 
blocks  the  processing  of  messenger  RNA  precur- 
sors, explaining  the  puzzling  observation  that 
heat-shock  genes  generally  do  not  have  interven- 
ing sequences.  In  this  line  of  research  our  studies 
of  HSP  function  and  regulation  overlap.  Mild  pre- 
heat treatments,  which  induce  the  synthesis  of 
HSPs,  protect  RNA  processing  from  disruption 
during  heat  shock.  Examination  of  yeast  cells 
carrying  mutations  in  various  HSP  genes  sug- 
gests that  both  the  HSP  104  and  the  HSPIO  genes 
play  a  role  in  protecting  processing  at  high 
temperatures. 

To  investigate  the  function  of  the  HSPs,  we 
have  created  a  series  of  mutations  in  the  genes  of 
both  yeast  and  Drosophila.  Kathy  Borkovich 
found  that  hsp83  is  essential  for  growth  at  all  tem- 
peratures in  yeast  cells,  but  that  it  is  required  at 
higher  concentrations  for  growth  at  higher  tem- 
peratures. Thus  induction  of  this  protein  is  re- 
quired for  cells  to  grow  at  the  upper  end  of  their 
normal  temperature  range.  We  believe  the  pro- 
tein is  needed  to  regulate  the  activity  of  a  wide 
variety  of  other  cellular  proteins  and  that  it  is 
needed  at  higher  concentrations  at  high  tempera- 
tures to  drive  the  equilibrium  of  these  interac- 
tions toward  complex  formation.  Recently,  in  col- 
laboration with  Keith  Yamamoto's  laboratory,  we 


have  demonstrated  that  hsp82  interacts  with  the 
steroid  hormone  family  of  receptors  and  helps 
these  proteins  fold  into  an  active  conformation. 

Yolanda  Sanchez  created  mutations  in  the 
HSP  104  gene  of  yeast.  The  mutations  have  no  ef- 
fect on  growth  at  high  or  low  temperatures.  How- 
ever, the  cells  are  unable  to  acquire  tolerance  to 
extreme  temperatures  when  given  a  mild  preheat 
treatment.  Thus  this  mutation  confirms  the  long- 
standing assumption  that  HSPs  play  a  vital  role  in 
establishing  thermotolerance.  More  recently,  we 
have  found  that  this  protein  is  highly  conserved 
in  mammals  and  in  prokaryotic  cells.  Moreover,  it 
appears  to  provide  protection  from  many  other 
forms  of  stress,  such  as  exposure  to  ethanol  and 
sodium  arsenite. 

In  Drosophila  our  mutational  analysis  has  con- 
centrated on  hsp70.  Jan  Rossi  found  that  varying 
the  level  of  hsp70  expression  in  Drosophila  cells 
varies  the  rate  at  which  cells  recover  from  heat 
shock.  Jonathan  Solomon  found  that  expressing 
hsp70  from  independently  regulated  promoters, 
in  the  absence  of  heat  shock,  helps  cells  to  sur- 
vive extreme  temperatures  but  inhibits  their 
growth.  This  particular  protein  thus  helps  to  pro- 
tect cells  from  the  ravages  of  extreme  tempera- 
tures but  is  disadvantageous  at  normal  tempera- 
tures. Further  experiments  will  focus  on 
examining  the  specific  cellular  and  developmen- 
tal processes  in  which  hsp70  plays  a  role. 

Finally,  Kent  Golic  has  taken  advantage  of  the 
heat-shock  response  to  devise  an  inducible  sys- 
tem for  site-specific  recombination  in  Drosoph- 
ila. He  placed  the  FLP  recombinase  gene  of  yeast 
under  the  control  of  the  hsp70  regulatory  se- 
quences and  transformed  Drosophila  embryos 
with  this  construct.  He  transformed  other  em- 
bryos with  a  recombinase  target,  consisting  of  an 
eye  color  gene  flanked  by  target  sequences  for 
FLP  recombination.  When  the  two  strains  were 
crossed  and  their  progeny  were  heat-shocked,  the 
eye  color  gene  was  excised  in  both  somatic  and 
germline  tissues,  with  the  frequency  of  excision 
depending  on  the  severity  of  the  heat  shock.  This 
system  is  being  used  in  our  laboratory  to  vary  the 
dosage  of  hsp  mutants,  but  it  has  many  other  po- 
tential applications  in  Drosophila  and  other 
organisms. 
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MATURATION  of  T  lymphocytes  involves  the 
interaction  of  surface  proteins  on  these 
cells  with  components  of  the  thymic  microen- 
vironment.  Thymocyte  precursors  (immature 
thymus-generated  cells)  bearing  appropriately 
rearranged  and  assembled  T  cell  receptors  are  se- 
lected through  this  process  of  cell-cell  interac- 
tion to  mature  and  migrate  to  peripheral  lym- 
phoid organs.  Selection  involves  elimination  of 
self-reactive  T  cells  and  preferential  outgrowth  of 
cells  capable  of  reacting  to  foreign  antigen  com- 
plexed  to  major  histocompatibility  complex 
(MHC)  molecules. 

Aside  from  the  T  cell  receptors,  two  other  cell 
surface  glycoproteins,  CD4  and  CDS,  play  central 
roles  in  this  selection  process.  Both  CD4  and  CDS 
molecules  are  expressed  on  immature  thymo- 
cytes, but  upon  maturation  the  gene  for  one  or 
the  other  is  shut  oflf.  Cells  that  have  receptors  for 
class  I  MHC  molecules  continue  to  express  CDS, 
but  shut  off  CD4;  cells  with  receptors  for  class  II 
MHC  molecules  express  CD4  and  shut  off  CDS. 
Our  laboratory  is  studying  the  mechanism  of  regu- 
lation of  the  CD4  and  CDS  genes  as  well  as  the 
molecular  basis  of  the  cell-cell  interaction  result- 
ing in  the  selection  of  cells  having  appropriate 
specificity. 

We  have  previously  used  a  cell-cell  adhesion 
assay  to  demonstrate  that  the  CDS  molecule  binds 
to  the  membrane-proximal  domain  of  the  class  I 
molecule.  Since  the  T  cell  receptor  binds  to  an- 
other region  of  this  molecule,  it  is  likely  that  the 
MHC  molecule  serves  as  a  bridge  to  bring  the  re- 
ceptor complex  and  CDS  into  close  contact. 

We  have  now  tested  the  hypothesis  that  bind- 
ing of  CDS  to  class  I  MHC  during  thymic  differen- 
tiation is  required  for  the  maturation  of  class  I- 
restricted  T  cells.  Transgenic  mice  that  express  a 
mutant  form  of  MHC-I  that  cannot  bind  CDS,  but 
can  interact  with  T  cell  receptors,  were  shown  to 
be  defective  in  developing  a  mature  T  cell  reper- 
toire. Analysis  of  T  cells  in  these  mice  provides 
evidence  that  CDS-MHC  binding  is  required  both 
for  intrathymic  deletion  of  self-reactive  T  cells 
and  for  positive  selection  of  useful  T  cells. 


To  study  the  roles  of  CD4  and  CDS  in  greater 
detail,  we  are  preparing  animals  that  are  defec- 
tive in  the  expression  of  these  molecules. 
Namely,  we  have  utilized  gene-targeting  technol- 
ogy to  mutate  the  CD4  gene  in  embryonal  stem 
cells  and  have  injected  these  cells  into  mouse 
blastocysts,  generating  chimeric  animals.  We  are 
attempting  to  propagate  the  mutant  gene  in  the 
mouse  germline,  and  the  immune  system  of  mice 
lacking  CD4  expression  will  be  studied.  We  have 
also  prepared  transgenic  mice  expressing  mutant 
forms  of  CD4  that  are  predicted  to  be  defective  in 
signal-transducing  functions,  and  these  will  be 
used  to  analyze  CD4  function  in  the  absence  of 
endogenous  CD4  expression. 

The  CD4  and  CDS  glycoproteins  have  been 
shown  to  function  importantly  in  the  activation 
of  peripheral  T  lymphocytes.  For  example,  artifi- 
cial crosslinking  of  CD4  or  CDS  to  the  T  cell  re- 
ceptor complex  results  in  T  cell  activation.  More- 
over, T  cells  that  have  lost  expression  of  either 
CD4  or  CDS  but  continue  to  express  the  T  cell 
receptor  are  usually  no  longer  responsive  to 
antigen. 

We  are  performing  a  variety  of  structure/func- 
tion  studies  to  determine  the  mechanism  through 
which  CD4  and  CDS  facilitate  signal  transduction 
in  T  cells.  It  is  known  that  CD4  and  CDS  are  asso- 
ciated with  a  cytoplasmic  tyrosine  kinase,  p56'''*, 
a  member  of  the  src  kinase  family.  The  sites  of 
interaction  between  these  molecules  map  to  cys- 
teine-containing  regions  in  the  cytoplasmic 
domains  of  CD4  and  CDS  and  in  the  unique 
amino-terminal  domain  of  p56'''*.  We  have  dem- 
onstrated that  only  CD4  molecules  that  can  asso- 
ciate with  the  kinase  are  functional  in  antigen- 
specific  T  cell  hybridomas,  which  normally 
produce  interleukin-2  upon  stimulation  with  the 
appropriate  antigen.  Since  interaction  of  CD4 
with  the  Ick  kinase  is  essential  for  T  cell  activa- 
tion, we  are  now  beginning  to  characterize  the 
proteins  that  serve  as  substrates  for  tyrosine 
phosphorylation . 

The  CD4  glycoprotein  is  doubly  important  be- 
cause it  is  the  receptor  for  the  human  immunode- 
ficiency virus  (HIV) .  We  have  completed  a  muta- 


289 


T  Cell  Surface  Glycoproteins  in  Development  and  Viral  Infections 


genesis  analysis  to  identify  the  region  of  CD4  that 
binds  to  the  viral  envelope  glycoprotein.  This  in- 
formation, coupled  to  the  recent  elucidation  of 
CD4's  three-dimensional  structure,  may  facilitate 
the  development  of  agents  that  block  binding. 

Although  human  cells  expressing  the  CD4  mol- 
ecule are  susceptible  to  HIV  infection,  murine 
cells  expressing  CD4  bind  virus  but  are  resistant 
to  infection.  This  resistance,  we  have  found,  is 
due  to  the  inability  of  the  viral  envelope  to  fuse  to 
the  murine  plasma  membrane.  It  is  likely  that  a 
human-specific  plasma  membrane  molecule 
other  than  CD4  is  required  for  fusion  of  virus  to 
target  cells.  To  identify  such  a  molecule  and  to 
isolate  its  gene,  we  have  packaged  a  selectable 
marker  within  the  HFV  particle.  Such  particles 
bind  to  CD4-bearing  human  cells,  are  internal- 
ized, and  are  processed  to  generate  a  proviral 
DNA  that  is  integrated  in  the  host  genome.  The 
cells  are  then  grown  in  selection  media. 

These  virus  particles  do  not  infect  CD4-bearing 
murine  cells.  But  if  the  particles  are  prepared 
with  an  envelope  glycoprotein  that  can  normally 
interact  with  murine  cells,  the  selectable  HIV  ge- 
nome is  integrated  in  the  cells.  Efforts  are  under 
way  to  transfer  human  genes  into  the  murine  cells 
to  permit  cell  survival  after  infection  with  the 


selectable  virus.  It  is  hoped  that  identification  of 
a  second  molecule  involved  in  HIV  entry  may 
lead  to  the  design  of  novel  agents  to  interfere 
with  the  virus's  spread. 

We  have  utilized  this  system  to  demonstrate 
that  the  HIV  envelope  glycoprotein  can  be  re- 
placed by  that  of  the  other  pathogenic  human  ret- 
rovirus, human  T  cell  leukemia  virus  (HTLV),  to 
form  HIV  (HTLV)  pseudotypes.  HTLV  causes  T 
cell  leukemias  and  lymphomas  and  myelopa- 
thies. Individuals  infected  with  both  HFV  and 
HTLV  have  more  rapid  progression  of  HFV  disease 
than  those  infected  with  HIV  alone.  Since  both 
viruses  infect  T  lymphocytes,  it  is  likely  that 
mixed  viral  particles  can  form  in  vivo.  We  have 
demonstrated  that  mixed  particles  (pseudo- 
types)  that  form  in  vitro  have  an  expanded  host 
range — i.e.,  HIV  particles,  endowed  with  the 
HTLV-I  envelope  glycoprotein,  can  readily  infect 
CD4-deficient  cells.  Thus  mixed  particles  may 
have  an  important  role  in  HFV  pathogenesis,  par- 
ticularly in  infection  of  cells  that  do  not  have  HIV 
receptors,  such  as  cells  of  the  central  nervous 
system. 

In  addition  to  investigating  HIV  infection,  we 
are  using  the  hybrid  particles  to  study  the  yet  un- 
characterized  HTLV  receptor. 
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THE  immune  system  is  involved  intimately  in 
our  body's  defense  against  invading  microor- 
ganisms and  tumors.  In  addition,  it  plays  a  central 
role  in  organ  graft  rejection  and  autoimmune  dis- 
eases such  as  systemic  lupus  erythematosus.  Its 
critical  role  in  maintaining  health  is  well  typified 
in  the  acquired  immune  deficiency  syndrome 
(AIDS) ,  in  which  destruction  of  a  specific  portion 
of  the  immune  system  results  in  a  fatal  disease. 

Our  investigation  is  focused  on  the  molecular 
mechanisms  that  allow  normal  and  abnormal  de- 
velopment of  the  antigen-specific  T  cells.  T  cells 
are  those  lymphocytes  (a  type  of  white  cell)  that 
depend  on  the  presence  of  the  thymus  gland  for 
maturation.  An  antigen  may  be  viewed  as  any 
marker  that  these  cells  recognize.  T  cells  are 
thought  to  play  a  central  role  in  the  regulation  of 
the  immune  response.  T  cells  recognize  antigens 
by  means  of  a  cell  surface  structure  called  the  T 
cell  receptor  (TCR) .  The  genes  that  are  responsi- 
ble for  the  expression  of  the  TCR  undergo  DNA 
gene  rearrangement  and  gene  activation  specifi- 
cally in  the  thymus  during  the  individual's  early 
development.  Once  the  TCR  is  expressed  as  pro- 
tein, it  is  the  interaction  of  the  TCR  with  its  anti- 
gen that  triggers  the  activation  of  T  cells,  result- 
ing in  an  immune  response.  The  ultimate  result  of 
such  a  response  may  be  either  defense  against  in- 
vading organisms  or  tissue  destruction,  as  seen 
in  transplantation  rejection  and  autoimmune 
phenomena. 

During  the  past  few  years,  my  laboratory  has 
concentrated  on  identifying  the  genetic  elements 
that  encode  the  TCR  genes.  Recently  we  have 
shifted  our  efforts  to  study  the  function  of  T  cells 
in  the  intact  animal.  Two  important  questions 
were  addressed.  1)  Why  are  we  tolerant  of  our 
own  tissues  and  organs?  2)  Why  are  transplanted 
organs  rejected  readily  (unless  they  are  carefully 
cross-matched)?  We  have  used  both  recombinant 


DNA  technology  and  our  ability  to  create  trans- 
genic mice  (mice  with  cloned  genes  incorpo- 
rated in  their  own  chromosomes)  to  study  these 
questions.  Two  kinds  of  transgenic  mice  have 
been  created.  One  kind  bears  transgenic  TCR 
genes;  the  other  has  transgenic  major  histocom- 
patibility complex  (MHC)  genes  (a  marker  that 
distinguishes  us  individually  during  transplanta- 
tion). By  introducing  these  genes  back  into  the 
mouse  itself  and  into  the  mouse  germline,  we  can 
determine  how  normal  T  cells  develop  by  study- 
ing how  the  TCR  and  MHC  interact  during 
development. 

This  strategy  has  been  very  successful.  By  creat- 
ing mice  of  appropriate  genetic  background,  we 
discovered  that  T  cells  that  are  self-recognizing 
and  hence  self-reactive  are  deleted  in  the  thymus 
during  development.  This  implies  that  part  of 
self-tolerance  is  accomplished  by  physical  elimi- 
nation of  self-reactive  T  cells.  Using  mice  con- 
taining transgenic  MHC,  we  showed  a  second 
mechanism  of  self-tolerance  that  does  not  involve 
physical  deletion.  In  this  case,  self-reactive  cells 
were  not  physically  eliminated  but  are  function- 
ally paralyzed.  These  studies  allow  us  to  lay  the 
foundation  to  study  how  T  cells  acquire 
self-tolerance. 

Most  recently,  we  have  developed  a  new  gener- 
ation of  TCR  transgenic  mice  in  which  we  can 
induce  massive  cell  death  when  the  appropriate 
antigen  is  administered  in  vivo.  These  studies 
prove  that  clonal  elimination  of  self-reactive 
cells  is  indeed  mediated  by  programmed  cell 
death.  These  new  approaches  should  shed  insight 
into  the  development  of  self-tolerance  and  the 
mechanism  of  programmed  cell  death  during  de- 
velopment. Since  distinguishing  what  is  self  and 
nonself  is  a  central  problem  in  immunology,  we 
hope  these  studies  will  increase  understanding 
of  transplantation  rejection  and  autoimmune 
phenomena. 
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THE  primary  long-range  goal  of  our  research  is 
to  understand  the  functions  of  molecules 
called  oligosaccharides  that  are  found  on  the  sur- 
face of  mammalian  cells  and  to  explain  how  the 
cells  regulate  expression  of  these  molecules.  Oli- 
gosaccharides consist  of  many  different  sugar 
structures  linked  together  in  complex  linear  and 
branching  arrays.  Quantitative  and  structural 
changes  in  such  molecules  have  been  show^n  to 
correlate  with  morphologic  changes  that  occur 
during  the  embryonic  development  of  animals 
and  in  association  with  neoplastic  transforma- 
tion. These  and  other  observations  suggest  that 
cell  surface  oligosaccharides  may  function  as  in- 
formation bearers  in  mediating  interactions  be- 
tween cells  during  the  developmental  process. 

Mammalian  cells,  in  constructing  these  mole- 
cules, use  special  proteins  called  glycosyltrans- 
ferase  enzymes.  With  few  exceptions,  a  unique 
glycosyltransferase  is  responsible  for  the  synthe- 
sis of  each  linkage  between  the  sugar  molecules 
in  an  oligosaccharide.  The  enormous  number  of 
diflf^erent  oligosaccharides  dictates  that  many  dif- 
ferent glycosyltransferases  will  enter  the  con- 
struction of  the  complex  cell  surface  carbohy- 
drates on  any  particular  cell  or  tissue. 

In  many  instances,  changes  in  cell  surface  car- 
bohydrate structure  observed  during  differentia- 
tion or  in  association  with  malignant  transforma- 
tion have  been  shown  to  correlate  with  changes 
in  the  glycosyltransferase  repertoire.  The  mecha- 
nisms by  which  cells  coordinate  and  regulate  the 
expression  of  these  enzymes,  and  thus  the  ex- 
pression of  oligosaccharide  structures  at  the  cell 
surface,  are  unknown.  During  the  past  few  years, 
the  main  focus  of  our  work  has  been  in  establish- 
ing systems  that  will  allow  molecular  analysis  of 
the  mammalian  genes  responsible  for  glycosyl- 
transferase synthesis. 

The  human  ABO,  H,  and  Lewis  blood  group  an- 
tigens are  actually  cell  surface  oligosaccharides. 
The  determinant  genes  encode  particular  glyco- 
syltransferases that  are  able  to  construct  the 
"blood  group"  molecules.  These  glycosyltrans- 
ferases provide  convenient  genetic  and  biochemi- 
cal models  for  studying  the  processes  that  allow 


mammalian  cells  to  regulate  cell  surface  oligosac- 
charide expression. 

The  blood  group  antigens  are  not  restricted  in 
their  expression  to  blood  cells.  They  are  found  on 
a  number  of  other  tissues  in  the  body,  suggesting 
that  tissue-specific  mechanisms  regulate  their  ex- 
pression. Moreover,  their  expression  changes 
during  human  embryonic  development  and  is  of- 
ten altered  in  malignancy. 

Our  initial  efforts  have  focused  on  developing 
systems  to  isolate  glycosyltransferase  genes  with- 
out the  benefit  of  purified  enzyme  protein.  In 
one  instance,  we  isolated  the  gene  for  human  H 
blood  group  a(l,2)fucosyltransferase,  using  a 
scheme  in  which  the  gene  was  first  transferred 
into  a  cultured  mouse  cell  line  by  a  process 
called  transfection.  We  were  then  able  to  use  mo- 
lecular cloning  procedures  to  retrieve  the  human 
gene  from  the  mouse  genome. 

We  have  also  used  a  similar  technical  approach 
to  isolate  other  glycosyltransferase  genes.  This 
scheme  involves  transfer  of  cloned  segments  of 
human  genes  back  and  forth  between  animal  cells 
and  the  bacterial  host  cells  used  for  molecular 
cloning  purposes.  With  this  approach  an  a (1,3/ 
1 ,4)fucosyl transferase  gene,  for  example,  was 
isolated  that  represents  the  human  Lewis  blood 
group  locus.  We  have  recently  used  the  H  and 
Lewis  gene  segments  and  cross-hybridization  pro- 
cedures to  isolate  still  other  glycosyltransferase 
genes  with  related  or  unique  structural  and  func- 
tional properties. 

The  cloned  gene  segments  in  each  case  repre- 
sent tools  for  investigating  the  function  and  regu- 
lation of  cell  surface  oligosaccharides.  For  exam- 
ple, we  recently  used  these  gene  segments  to 
identify  specific  cell  surface  oligosaccharide  mol- 
ecules that  play  pivotal  functional  roles  in  the 
inflammatory  process.  One  of  the  primary  events 
in  inflammatory  conditions  involves  a  process 
whereby  circulating  white  cells  leave  the  interior 
of  blood  vessels  and  become  localized  in  inflam- 
matory foci  outside  the  vascular  system.  This  pro- 
cess begins  when  the  endothelial  cells  lining  the 
blood  vessels  become  "activated"  by  substances 
that  accompany  an  incipient  inflammatory  condi- 
tion. Circulating  white  cells  known  as  neutro- 


295 


Molecular  Genetics  of  Mammalian  Glycosyltransferases 


phils  adhere  tightly  to  activated  endotheUum,  in- 
sinuate themselves  into  the  endothelial  cell  pave- 
ment lining  the  blood  vessel,  and  ultimately 
come  to  occupy  areas  outside  the  vascular  tree. 

Neutrophil  adhesion  to  activated  endothelium 
is  mediated  in  part  by  a  protein  known  as  endothe- 
lial leukocyte  adhesion  molecule  I,  or  ELAM-I, 
that  is  found  on  the  surface  of  activated  endothe- 
lial cells.  Structural  features  exhibited  by  ELAM-I 
suggested  that  it  might  interact  with  an  oligosac- 
charide molecule  specific  to  the  surface  of  neu- 
trophils, but  this  hypothesis  had  not  been  con- 
firmed, nor  had  the  nature  of  such  molecules 
been  defined.  By  transfecting  different  glycosyl- 
transferase  gene  segments  into  cultured  mamma- 
lian host  cells,  we  were  able  to  recapitulate  the 
biosynthesis  of  several  distinct  sets  of  cell  surface 
oligosaccharide  molecules.  We  found  that  one 
set  allowed  transfected  cells  to  adhere  to  ELAM-I. 

We  further  demonstrated  that  the  oligosaccha- 
ride molecules  were  one  or  more  members  of  a 
family  of  oligosaccharides  containing  sialic  acid 
and  fucose  and  were  represented  by  a  molecule 


known  as  the  sialyl  Lewis  X  determinant.  Subse- 
quent work  by  our  own  group  and  others  has  con- 
firmed that  these  and  other  glycosyltransferase 
genes  play  important  regulatory  roles  in  the  bio- 
synthesis of  oligosaccharides  recognized  by 
ELAM-I. 

Recently  it  has  also  been  shown  that  oligosac- 
charides related  or  identical  to  the  sialyl  Lewis  X 
determinant  may  function  in  adhesive  interac- 
tions exhibited  by  two  other  cell  surface  pro- 
teins, called  GMP-140  and  Mell4/Leu8.  These 
latter  proteins,  structurally  related  to  ELAM-I, 
also  function  to  mediate  leukocyte-endothelial 
cell  adhesion  in  inflammation  and  lymphocyte 
recirculation. 

Circumstantial  evidence  gathered  by  other  in- 
vestigators suggests  that  oligosaccharides  are  im- 
portantly involved  in  cell  adhesion  during  mam- 
malian embryogenesis.  We  are  directing  current 
efforts  toward  exploring  this  hypothesis  and 
characterizing  the  genes  that  determine  these  in- 
teractions, through  genetic  manipulation  of  the 
murine  genome. 
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AN  emerging  theme  in  vertebrate  develop- 
ment is  that  temporally  and  spatially  regu- 
lated signals  between  cell  populations  direct  pat- 
tern formation.  The  process  by  which  such 
signals  effect  differentiation  of  a  tissue  is  called 
induction.  A  major  goal  in  our  laboratory  is  to 
understand  what  these  inductive  signals  are  and 
how  their  activities  are  orchestrated. 

Recently  two  major  classes  of  genes  have  been 
found  to  have  an  important  role  in  vertebrate  de- 
velopment. Homeobox  genes,  the  first  of  these 
classes,  control  the  identity  of  individual  body 
parts.  They  have  in  common  a  180-base  pair  se- 
quence element,  the  homeodomain.  This  genetic 
element,  highly  conserved  in  evolution,  has  been 
shown  to  confer  the  sequence-specific  binding  of 
the  homeoprotein  to  DNA,  with  the  remainder  of 
the  protein  functioning  as  a  transcriptional  acti- 
vator or  repressor.  Moreover,  in  the  examples  an- 
alyzed, the  homeodomain  alone  can  interact  with 
DNA.  Although  the  target  genes  for  homeobox 
proteins  are  known  in  some  cases,  particularly  in 
Drosophila,  and  include  other  homeotic  genes, 
they  are  not  well  identified  or  understood  in 
mammals. 

From  work  in  both  Xenopus  and  Drosophila,  it 
has  become  clear  that  certain  growth  factors  con- 
stitute a  second  class  of  gene  products  important 
in  morphogenesis.  One  hypothesis  is  that  individ- 
ual homeobox  genes,  acting  alone  or  in  concert, 
also  regulate,  perhaps  directly,  the  function  of 
individual  growth-controlling  peptides.  It  is  al- 
ready known  that  members  of  a  family  of  peptide 
growth  factors,  called  transforming  growth  fac- 
tor-|8s  (TGF-i8s) ,  can  influence  the  expression  of 
certain  homeobox  genes.  A  possibility  that  indi- 
vidual homeobox  genes  might  also  regulate  TGF- 
i8s  or  other  growth  factors  is  suggested  by  coinci- 
dental patterns  of  expression  in  developing 
vertebrate  embryos,  as  well  as  from  genetic  ex- 
periments in  Drosophila.  This  raises  the  general 
question,  What  are  the  transcriptional  targets  of 
individual  homeobox-containing  genes  during 
vertebrate  embryogenesis?  We  have  taken  a 
multidisciplinary  approach  to  this  question, 
with  both  general  and  specific  avenues  of 
investigation. 


Homeobox  Genes  in  the  Developing 
Mouse  Heart 

The  embryonic  development  of  the  vertebrate 
heart  consists  of  an  interaction  between  two  con- 
centric layers  of  epithelium — an  outer  epithelial 
layer,  or  myocardium,  and  an  inner  epithelial 
layer,  or  endocardium.  The  outer  epithelial  layer 
has  been  shown  to  elaborate  a  soluble  factor  that, 
in  concert  with  TGF-/31,  triggers  the  transforma- 
tion of  the  inner  epithelium  into  an  undifferen- 
tiated mesenchyme.  This  subsequently  differen- 
tiates into  the  connective  tissue  of  the  valves  of 
the  adult  heart. 

In  the  mouse  embryo,  TGF-^1  is  expressed  in 
the  atrioventricular  canal,  the  site  of  valve  forma- 
tion, at  the  time  of  differentiation.  Interestingly, 
a  homeobox  gene  of  the  msh  class,  Hox-  7.1,  ap- 
pears to  share  during  development  a  common 
pattern  of  expression  with  TGF-|8 1  in  the  atrioven- 
tricular canal.  We  have  cloned  the  TGF-/?!  pro- 
moter and  regulatory  region  and  are  testing  its 
ability  to  interact  with  recombinant  Hox-  7. 1  ho- 
meobox proteins.  We  also  plan  to  determine  the 
sequence  of  DNA  responsible  for  binding  Hox- 
7. 1,  with  a  view  to  comparing  the  binding  sites  of 
msh  class  homeoboxes  with  those  of  the  Anten- 
napedia  class,  which  are  already  known. 

Using  a  degenerate  set  of  primers  engineered  to 
the  most  highly  conserved  regions  of  the  homeo- 
box, we  have  also  amplified  and  analyzed  addi- 
tional homeobox  sequences  from  reverse- 
transcribed  embryonic  cardiac  RNA  from  day  10 
of  mouse  embryogenesis. 

Homeobox  Gene  Expression  in  Developing 
Embryonic  Mouse  Kidney 

A  second  system  studied  in  our  laboratory  is  the 
developing  metanephros,  or  kidney.  Approxi- 
mately 20  different  homeobox  genes  have  been 
identified  in  the  day- 15  kidney  of  the  embryonic 
mouse.  These  genes  have  been  divided  into  three 
major  groups:  1)  genes  identical  to  known  ho- 
meobox sequences,  2)  genes  that  are  clearly  dif- 
ferent from  any  known  sequences,  and  3)  genes 
that  are  very  closely  related  to  known  sequences, 
but  still  different.  Immediate  goals  for  the  ho- 
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meobox  sequences  that  are  clearly  novel  are  to 
obtain  cDNA  clones,  to  map  them  in  the  mouse 
genome,  and  to  investigate  their  expression  pat- 
terns by  in  situ  hybridization  in  the  embryonic 
kidney.  A  longer-term  goal  is  to  identify  the  target 
genes  that  these  homeoboxes  interact  with  in 
mouse  kidney  development. 

Approaches  to  Identifying  Target  Genes  of 
Homeobox  Proteins 

Since  vertebrate  development  appears  to  pro- 
ceed by  a  sequential  hierarchy  of  transcriptional 
activation  (and  repression) ,  and  since  homeobox 
genes  are  apparently  important  in  mediating  such 
events,  we  have  sought  to  develop  a  technique 
for  identifying  genes  that  homeobox  genes  di- 
rectly regulate.  One  technique  being  explored 
makes  use  of  a  yeast  expression  system  in  which 
an  individual  yeast  cell  contains  an  expression 
plasmid  for  the  homeoprotein  whose  binding  site 
is  being  sought,  fused  to  Gal4,  a  strong  transcrip- 
tional activator.  This  homeobox-Gal4  fusion  pro- 
tein is  placed  under  the  control  of  an  inducible 


promoter,  so  that  the  synthesis  of  the  protein  can 
be  turned  on  or  off. 

In  yeast,  such  a  system  has  been  shown  capa- 
ble of  activating  a  target  plasmid  containing  a 
sequence-specific  binding  site  upstream  of  a  pro- 
moter controlling  the  expression  of  the  bacterial 
gene  lacZ,  which  thus  serves  as  a  reporter  gene. 
Conveniently,  /«cZ  encodes  an  enzyme,  jS-galac- 
tosidase,  that  will  cleave  a  fluorescent  substrate, 
resulting  in  a  fluorescent  yeast  cell.  Such  cells 
can  be  separated  with  the  fluorescence-activated 
cell  sorter  (FAGS).  We  hope  to  use  this  system  to 
remove  background  from  subgenomic  libraries 
and  then,  after  induction  of  homeobox  expres- 
sion, to  sort  cells  containing  specific  homeobox 
binding  sites. 

A  second  general  approach  under  consider- 
ation is  the  addition  of  labeled  or  retrievable  ho- 
meodomains  to  chromatin,  isolated  from  the  de- 
velopmental stage  and  tissue  of  interest.  This  is 
patterned  after  an  analogous  approach  used  in 
Drosophila.  However,  a  current  limitation  is  that 
high-titer  monoclonal  antibodies  are  not  avail- 
able for  many  mouse  homeobox  genes. 
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TWO  events  mark  the  reproductive  life  of  a 
cell:  replication  of  the  DNA  and  its  distribu- 
tion to  daughter  cells  at  mitosis.  Because  of  the 
central  importance  of  cell  reproduction  to  or- 
dered cell  growth,  cells  have  evolved  rigorous 
controls  to  ensure  that  both  processes  are  carried 
out  with  high  fidelity  and  at  the  appropriate  time. 
My  laboratory  is  interested  in  understanding  the 
nature  and  regulation  of  these  controls  with  re- 
spect to  how  a  cell  commits  itself  to  replicate  its 
DNA  and  how  it  knows  when  to  divide.  The  deci- 
sion to  get  ready  for  DNA  synthesis  is  made  in  the 
Gi  period  (after  mitosis  but  before  DNA  synthesis 
or  S  phase),  and  the  decision  to  begin  cell  divi- 
sion (M  phase)  is  made  in  G2  (after  S  phase  is 
complete) .  There  is  abundant  evidence  that  these 
decisions  are  made  at  checkpoints  or  restriction 
points  in  the  cycle.  The  nature  of  these  Gi  and  G2 
decision-making  periods  in  the  cell  cycle  under- 
lies fundamental  processes  operative  in  early  em- 
bryonic development  and  in  malignancy. 

G2      M  Regulation 

In  the  last  three  years  great  strides  have  been 
made  in  understanding  the  checkpoint  governing 
entry  into  mitosis.  For  many  years  it  had  been 
known  that  mitosis  is  dominant  over  interphase. 
That  is,  the  signal  that  tells  a  cell  to  go  into  mito- 
sis, if  inappropriately  expressed  earlier  in  the  cy- 
cle, can  cause  premature  mitosis  and  cell  death. 
Our  laboratory  developed  an  interphase  cell  ex- 
tract from  frog  eggs  in  which  addition  of  mitotic 
signals  caused  synthetic  nuclei  in  the  extract  to 
enter  mitosis  in  vitro.  We  then  purified  the  mito- 
sis-signaling enzyme  (called  M-phase  factor  or 
MPF)  and  found  that  it  was  composed  of  a  protein 
kinase  complexed  to  a  G2  cyclin.  Kinases  have 
the  ability  to  attach  a  phosphate  group  to  many 
different  cellular  proteins,  modifying  their  func- 
tion and  causing  profound  changes  in  cellular 
biochemistry.  The  protein  kinase  subunit  of  the 
enzyme  that  catalyzes  mitosis  was  identified  as  a 
vertebrate  homologue  of  the  cdc2  gene,  which 


had  been  genetically  implicated  in  the  control  of 
mitosis  by  the  study  of  certain  mutants  in  yeast. 

G2  cyclins  are  proteins  that  accumulate  during 
interphase,  reach  high  levels  in  late  G2  phase,  and 
are  then  degraded  near  the  metaphase/anaphase 
transition  in  mitosis.  This  degradation  is  required 
for  cells  to  complete  mitosis  successfully  and 
enter  Gi.  In  most  cells  there  are  two  classes  of  G2 
cyclins,  termed  A  and  B  cyclins,  that  differ  in  se- 
quence similarity  and  have  different  kinetics  of 
accumulation  and  degradation.  Both  bind  cdc2 
kinase,  but  A-type  complexes  are  activated  much 
earlier  in  the  cell  cycle  than  B-type  complexes. 
To  study  the  differences  in  A-  and  B-type  cyclins, 
we  have  expressed  the  frog  cyclin  genes  in  insect 
cells.  This  allows  us  to  make  large  quantities  of 
functionally  active  frog  cyclins,  which  are  able  to 
drive  synthetic  nuclei  into  mitosis  in  extracts 
after  complexing  with  cdc2  kinase. 

We  are  interested  in  the  mechanism  of  activa- 
tion of  MPF  in  oocytes  during  the  cell  cycles  of 
meiosis  I  and  II.  In  these  cycles  the  synthesis  of 
proteins  other  than  cyclin  is  required  for  activa- 
tion of  MPF.  One  protein  required  for  meiosis  I 
and  II  is  the  product  of  the  mos  proto-oncogene. 
Proto-oncogenes  are  the  normal  cellular  counter- 
part of  mutated  oncogenes  found  in  cancer  cells, 
suggesting  that  they  act  by  perturbing  normal  cel- 
lular pathways.  In  general,  very  little  is  known 
about  how  proto-oncogenes  work,  but  the  spe- 
cific involvement  of  mos  in  cell  cycle  control  is 
the  clearest  example  of  a  specific  function  for  any 
proto-oncogene  in  a  defined  cellular  process. 
The  mos  gene  encodes  a  serine/threonine  pro- 
tein kinase,  indicating  that  a  substrate  for  phos- 
phorylation by  mos  exists  that  can  lead  to  activa- 
tion of  MPF  as  well  as  stabilization  of  cyclin  in 
meiosis  II.  This  year  we  discovered  that  B-type 
Xenopus  cyclins  were  mos  phosphorylation  sub- 
strates and  that  the  affected  sites  were  distinct 
from  those  phosphorylated  by  cdc2  kinase  itself. 
As  soon  as  the  exact  site  of  mos  phosphorylation 
in  cyclin  is  determined,  we  can  ablate  the  site(s) 
and  evaluate  effects  on  cyclin  function  and  cell 
cycle  control. 
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Reentry  of  Quiescent  Cells  Into 
the  Cell  Cycle 

A  vast  fraction  of  the  body's  cells  do  not  ac- 
tively participate  in  the  cell  cycle.  Instead,  they 
are  in  a  quiescent  state  awaiting  a  signal  to 
reenter  the  cycle.  This  signal  is  often  in  the  form 
of  a  small  protein  termed  a  growth  factor.  Some 
forms  of  cancer  appear  to  arise  because  cells 
reenter  the  cycle  in  the  absence  of  a  growth  fac- 
tor, leading  to  uncontrolled  cell  growth.  Some 
cells  enter  quiescence  in  the  G2  phase.  However, 
most  quiescent  cells  waiting  for  proliferative  sig- 
nals are  in  a  specialized  state  of  the  Gj  phase  of 
the  cell  cycle  known  as  Gq.  Many  normal  cells 
often  enter  this  state  when  deprived  of  growth 
factors.  It  is  evident  that  the  transition  into  and 
out  of  the  Go  state  is  related  to  the  general  ques- 
tion of  how  a  cell  decides  it  is  time  to  prepare  for 
DNA  replication.  Available  evidence  suggests  that 
once  a  cell  is  committed  to  replicate  its  DNA,  it 
almost  invariably  progresses  on  through  the  cell 
cycle  until  the  G2  M  checkpoint  controls  be- 
come activated. 

Many  of  the  growth  factors  stimulate  quiescent 
cells  to  reenter  the  cell  cycle  by  binding  to  spe- 
cific receptors  on  the  cell  surface.  These  recep- 
tors are  actually  enzymes  expressing  a  protein  ki- 
nase activity  that  transfers  phosphate  groups  to 
tyrosine  residues  on  substrate  proteins.  Many  on- 
cogenes that  cause  cancer  turn  out  to  be  mutant 
versions  of  these  growth  factor  tyrosine  kinases 
that  signal  the  cell  to  reenter  the  cycle  in  an  un- 
controlled fashion.  We  have  undertaken  an  analy- 
sis of  the  steps  in  signaling  by  tyrosine  kinases, 
termed  the  signal  transduction  pathway,  in  order 
to  gain  a  better  understanding  of  how  to  control 
inappropriate  signaling.  Our  approach  has  been 


to  characterize  biochemically  an  event  rapidly 
stimulated  by  activated  tyrosine  kinase  receptors 
and  then  work  backward  up  the  signaling  path- 
way to  define  the  signal  transduction  mechanism. 

Over  the  past  five  years  we  have  identified  in 
ribosomes  a  protein  called  S6  that  is  phosphory- 
lated  on  serine  residues  very  rapidly  after  activa- 
tion of  tyrosine  kinase  receptors.  This  means  the 
signal  transduction  pathway  involves  receptor  ty- 
rosine kinase  activation  of  a  serine  kinase.  We 
have  shown  that  the  activation  does  not  occur  by 
direct  phosphorylation  of  the  S6  kinase  on  tyro- 
sine residues,  indicating  that  there  are  interme- 
diary molecules  that  participate  in  the  signaling 
mechanism.  Instead,  the  S6  kinase  is  activated  by 
direct  phosphorylation  on  serine  and  threonine 
residues  by  another  serine/threonine  protein  ki- 
nase called  microtubule-associated  protein  2  ki- 
nase or  MAP  kinase.  Xenopus  MAP  kinase  has 
been  purified  this  year  and  found  to  be  itself 
phosphorylated  on  both  threonine  and  tyrosine 
residues  by  additional  upstream  protein  kinases. 
This  means  we  are  getting  very  close  to  the  direct 
tyrosine  phosphorylation  events  stimulated  by  a 
tyrosine  kinase  receptor. 

In  both  G,  phase  after  the  Gq  transition  and  in 
G2-arrested  cells,  cdc2  kinase  becomes  activated 
after  reentry  into  the  cell  cycle  and  may  also  expe- 
rience an  increase  in  its  rate  of  synthesis,  indicat- 
ing that  it  works  at  both  checkpoints  in  the  cycle. 
In  the  Gi  phase,  cdc2  kinase  is  complexed  with  a 
different  group  of  proteins,  known  as  Gj  cyclins, 
whose  synthesis  increases  in  late  Gj.  AfuU  under- 
standing of  cell  cycle  control  will  come  when  the 
entire  sequence  of  events  from  tyrosine  kinase 
activation  to  cdc2  and  Gj  cyclin  synthesis  is 
elucidated. 


298 


The  Role  of  T  Cells  in  Health  and  Sickness 


Philippa  Marrack,  Ph.D. — Investigator 

Dr.  Marrack  is  also  a  member  of  the  Department  of  Medicine  at  the  National  Jewish  Center  for  Immunol- 
ogy and  Respiratory  Medicine  and  Professor  of  Biochemistry,  Biophysics  and  Genetics,  of  Microbiology 
and  Immunology,  and  of  Medicine  at  the  University  of  Colorado  Health  Sciences  Center,  Denver.  She  was 
educated  at  the  University  of  Cambridge,  England,  and  took  her  Ph.D.  in  biological  sciences  at  the  MRC 
laboratory  for  Molecular  Biology  in  Cambridge.  She  came  to  the  United  States  to  do  postdoctoral  work 
with  Richard  Dutton  at  the  University  of  California,  San  Diego.  From  there  she  moved  to  the  University 
of  Rochester  and  then,  after  seven  years,  to  her  present  position.  Dr.  Marrack  was  awarded  the  Wellcome 
Foundation  Prize  by  the  Royal  Society  and  is  a  member  of  the  National  Academy  of  Sciences. 


T cells  are  absolutely  essential  to  the  health  of 
higher  vertebrates.  They  patrol  the  body,  rec- 
ognize some  invaders  such  as  bacteria  and  vi- 
ruses, and  orchestrate  their  destruction.  The  im- 
portance of  these  cells  is  tragically  revealed  by 
the  devastating  effects  of  AIDS,  in  which  a  certain 
type  of  T  cell  is  inexorably  destroyed. 

T  cells  recognize  targets,  using  receptors  on 
their  surfaces.  Each  T  cell  has  about  20,000  iden- 
tical receptors,  and  those  on  any  given  T  cell 
differ  from  those  on  other  T  cells.  Thus  a  human 
being  has  about  a  million  million  T  cells,  all  bear- 
ing different  receptors  to  help  in  defense.  When 
an  organism  invades  the  body,  a  wide  array  of  T 
cell  receptors  is  available  to  bind  to  fragments  of 
the  organism,  called  antigens.  By  chance  the  re- 
ceptors on  some  of  the  T  cells  will  be  able  to  bind 
to  these  antigens.  These  T  cells  bearing  bound 
receptors  are  stimulated  to  multiply  and  destroy. 

How  does  the  body  develop  so  many  different  T 
cell  receptors?  This  is  achieved  by  combining  in 
the  receptor  one  each  of  several  elements  for  en- 
gagement. T  cell  receptors  are  made  up  of  five 
variable  elements:  Va,  Ja,  V/?,  D/S,  and  J/3.  Each  T 
cell  picks  for  its  receptor  1  Va  out  of  about  50 
available  Va's.  Likewise  the  cell  chooses  1  Ja 
from  a  collection  of  about  50  Ja's  and  1  V/3  from 
about  50  V|8's.  D/?  and  J|8  are  chosen  from  pools  of 
about  6  D|8's  and  13  J|8's,  respectively.  By  com- 
bining these  different  components  in  different 
ways,  the  total  T  cell  pool  manages  to  express 
many  receptors  and  hence  is  able  to  bind  many 
foreign  organisms. 

One  of  the  major  theoretical  problems  in  im- 
munology has  always  been  the  matter  of  toler- 
ance to  self.  If  our  T  cells  can  recognize  and  de- 
stroy most  foreign  material — including,  for 
example,  skin  grafts  from  some  close  relatives — 
how  is  it  that  our  T  cells  do  not  attack  and  destroy 
M5?  Recently  we  have  shown  that  tolerance  to  self 
is  created  by  at  least  two  different  means.  The 
first  involves  T  cell  death.  As  T  cells  are  develop- 
ing they  go  through  a  stage  at  which  the  binding 
of  material  to  their  receptors  causes  them  to  die. 
Developing  T  cells  are  almost  bound  to  be  ex- 


posed to  self;  therefore  cells  that  are  potentially 
self-reactive  will  die  at  this  stage.  At  a  later  stage 
in  the  life  history  of  T  cells,  another  mechanism 
comes  into  play,  and  T  cells  that  can  react  with 
self  appear  to  be  inactivated. 

We  do  not  yet  understand  why  T  cells  that  can 
react  with  self  are  killed  in  some  cases  and  inacti- 
vated in  others.  More  importantly,  perhaps,  we 
do  not  understand  why  T  cells  that  can  react  with 
self  sometimes  escape  both  these  processes  and 
emerge  later  to  cause  autoimmune  diseases. 
There  is  good  evidence,  for  example,  that  juve- 
nile diabetes  occurs  because  T  cells  in  some  chil- 
dren are  able  to  bind  to  material  made  by  the 
pancreas.  The  T  cells  in  question  exercise  this 
ability  and  destroy  the  pancreatic  cells  that  se- 
crete insulin. 

In  order  to  study  this  problem,  we  have  re- 
cently started  to  work  on  T  cells  in  humans,  in 
collaboration  with  Brian  Kotzin,  a  rheumatolo- 
gist  at  the  National  Jewish  Center.  Even  though  T 
cell  receptors  comprise  five  variable  compo- 
nents, Va,  VjS,  etc.,  we  found  to  our  surprise  that 
in  mice  we  could  understand  a  lot  about  T  cell 
specificity,  responses,  and  autoimmunity  by 
studying  just  one  part  of  the  T  cell  receptor,  V/?. 
For  this  reason  we  have  begun  our  clinical  studies 
by  developing  tools  to  explore  V(8  use  by  T  cells. 

At  first  we  developed  a  molecular  biological 
method — quantitative  polymerase  chain  reac- 
tion— to  measure  what  proportion  of  T  cells  in 
different  individuals  bore  a  particular  V/?.  Using 
this  method,  we  showed  that  patients  with  toxic 
shock  syndrome  have  very  large  numbers  of  T 
cells  bearing  V/32.  This  occurs  because  the  bacte- 
ria that  cause  toxic  shock,  Staphylococcus  au- 
reus, produce  a  toxin  that  stimulates  human  T 
cells  bearing  V(82  to  divide.  We  believe,  in  fact, 
that  the  disease  toxic  shock  is  caused  by  massive 
replication  of  V(82-bearing  cells  and  their 
copious  secretion  of  lymphokines,  hormone-like 
materials  that  are  useful  to  the  body  in  small 
quantities  but  which  at  high  concentrations 
cause  shock  and  death. 
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At  the  moment  we  are  applying  these  same 
methods  to  another  human  disease,  rheumatoid 
arthritis.  Although  the  reasons  some  persons  de- 
velop this  disease  are  not  fully  understood,  there 
are  many  indications  that  rheumatoid  arthritis  is 
triggered  by  a  T  cell  malfunction.  We  have  exam- 
ined T  cells  from  the  rheumatic  joints  and  blood 
of  patients  with  this  disease  and  have  found  that  T 
cells  bearing  V^l  4  are  selectively  deficient  in  the 
blood.  Moreover,  there  are  indications  that  such 
T  cells  may  accumulate  in  the  diseased  joints. 


Now  we  are  testing  the  ideas  that  rheumatoid  ar- 
thritis is  caused  by  malfunction  in  T  cells  bearing 
this  V/3  and  that  it  may  have  been  started  by  a 
toxin-like  substance  that  stimulates  T  cells  bear- 
ing V^l4. 

The  quantitative  polymerase  chain  reaction 
used  in  these  experiments  is  a  cumbersome  and 
difficult  technique.  With  a  view  to  making  exper- 
iments easier,  we  are  developing  a  collection  of 
monoclonal  antibodies  specific  for  different  hu- 
man T  cell  V|8s. 
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THE  purpose  of  our  research  is  to  elucidate  the 
mechanism  of  action  of  transforming  growth 
factor-^S  (TGF-/3)  and  other  factors.  Our  apprecia- 
tion of  the  complexity  of  the  TGF-/?  system  has 
escalated  dramatically  in  the  last  five  years.  TGF- 
(8  is  now  known  to  represent  a  vast  family  of  para- 
crine polypeptides  thought  to  be  involved  in 
many  processes  of  tissue  development,  morpho- 
genesis, recycling,  and  repair.  This  family  in- 
cludes homodimeric  and  heterodimeric  products 
of  more  than  20  genes.  The  distribution  of  TGF-/3- 
related  factors  is  widespread  in  organisms  from 
fruit  flies  to  humans,  and  their  evolutionary  con- 
servation is  unusually  strict.  They  are  actively  ex- 
pressed throughout  embryo  development  and 
into  adulthood,  in  discrete  regions  of  many  tis- 
sues with  characteristic  temporal  patterns. 

TGF-(8  exists  in  multiple  isoforms,  including 
TGF-j8l ,  -(82,  and  -|83  in  mammals.  These  isoforms 
show  many  similarities  in  their  biological  proper- 
ties, but  also  some  important  differences.  TGF-j8 
is  exceptional  in  its  multifunctional  nature,  act- 
ing, for  example,  as  a  paracrine  inhibitor  of  cell 
proliferation  and  a  regulator  of  cell  differentia- 
tion, cell  adhesion,  and  extracellular  matrix  de- 
position. Defects  in  TGF-/?  activity  as  a  growth 
suppressor  may  lead  to  oncogenesis,  and  its  ex- 
cessive activity  to  fibrotic  disorders. 

Mechanisms  of  TGF-/?  Action 

One  of  our  goals  is  to  identify,  clone,  and  char- 
acterize the  membrane  receptors  that  mediate 
TGF-/3  action.  To  this  end,  we  have  identified 
membrane  proteins  that  bind  TGF-(S  with  high  af- 
finity. These  include  TGF-/?  receptor  components 
I  and  II,  which  are  distinct  cell-surface  glycopro- 
teins of  53  and  70  kDa,  respectively.  They  are 
expressed  ubiquitously,  but  in  low  numbers,  in 
most  normal  and  transformed  cells,  with  the  nota- 
ble exception  of  human  retinoblastoma  cells. 
These  receptors  markedly  discriminate  between 
the  various  TGF-/3  isoforms.  A  third  TGF-(8-bind- 
ing  component,  betaglycan,  has  an  unusual  struc- 
ture; it  is  a  membrane-anchored  proteoglycan. 

Which  of  these  binding  components  mediates 
TGF-j8  action?  The  resolution  of  this  question 


made  use  of  one  of  the  most  remarkable  effects  of 
TGF-/3,  inhibition  of  cell  growth.  A  panel  of  mu- 
tant cell  clones  resistant  to  the  growth  inhibitory 
action  of  TGF-/3  was  isolated  by  selection  of  chem- 
ically mutagenized  cells  that  would  grow  in  the 
continuous  presence  of  TGF-/3.  Characterization 
of  these  mutants  showed  that  the  mutations  af- 
fected the  expression  of  TGF-/?  receptors  I  and  II 
but  not  betaglycan.  Moreover,  somatic  cell  hy- 
brids between  cell  mutant  clones  demonstrated 
that  all  TGF-j8  responses  were  regained  when  ex- 
pression of  receptors  I  and  II  was  restored  by 
complementation  between  different  cell  mu- 
tants. This  evidence  linked  receptors  I  and  II  to 
mediation  of  the  pleiotropic  action  of  TGF-/3  and 
indicated  that  both  receptor  components  cooper- 
ate to  mediate  the  multiple  effects  of  TGF-/?.  We 
are  developing  methodology  to  use  the  TGF-^S  re- 
ceptor-defective cells  to  clone  receptor  cDNAs. 

Progress  has  also  been  made  in  the  study  of  be- 
taglycan. This  integral  membrane  proteoglycan 
contains  heparan/chondroitin  sulfate  glycosami- 
noglycan  (GAG)  chains  attached  to  a  110-  to 
1 30-kDa  core  glycoprotein.  The  GAGs  are  not  re- 
quired for  TGF-^  binding  or  expression  of  beta- 
glycan on  the  cell  surface;  the  TGF-;8-binding  site 
resides  in  the  core  protein.  Soluble  forms  of  be- 
taglycan lacking  membrane  anchor  are  released 
into  the  medium  and  can  be  found  in  extracellu- 
lar matrices. 

Betaglycan  is  intriguing  as  a  novel  example  of 
regulatory  molecules  involved  in  the  biology  of 
growth  factors.  It  is  widely  distributed  in  tissues, 
but  absent  from  certain  cell  types  that,  neverthe- 
less, respond  to  TGF-/?.  No  evidence  is  presently 
available  for  a  direct  role  of  betaglycan  in  signal 
transduction.  It  may  be  involved,  however,  in  li- 
gand  presentation  to  TGF-/3  receptors  I  and  II  and 
could  function  as  a  pericellular  reservoir  of 
bioactive  TGF-jS.  Like  other  membrane  proteogly- 
cans, betaglycan  may  also  participate  in  cell  ad- 
hesion and  recognition.  We  have  purified  it  to 
homogeneity  in  order  to  investigate  it  further. 

Recent  progress  linked  growth  arrest  by  TGF-/? 
in  late  Gl  to  its  ability  to  control  the  phosphory- 
lation state  of  an  intracellular  growth  suppressor. 
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RB,  the  nuclear  phosphoprotein  product  of  the 
retinoblastoma  susceptibility  gene.  TGF-/?  arrests 
cell  growth  at  precisely  the  cell  cycle  point 
where  RB  phosphorylation  is  due  to  take  place. 
This  observation  led  to  the  finding  that  TGF-/3  in- 
hibits RB  phosphorylation,  retaining  this  protein 
in  its  presumed  growth  suppressor  state. 

Expression  of  the  SV40  transforming  protein, 
large  T  antigen,  which  binds  unphosphorylated 
RB  and  disrupts  its  growth-suppressive  function, 
prevented  growth  inhibition  by  TGF-/3  without 
affecting  its  inhibition  of  RB  phosphorylation  or 
other  effects.  Thus  TGF-|S's  inhibition  of  RB  phos- 
phorylation is  not  an  irrelevant  event  but,  possi- 
bly, one  that  mediates  cell  cycle  arrest  by  this 
factor.  These  results  represent  a  case  in  which  a 
prototypic  intracellular  growth  suppressor  (RB) 
participates  in  the  mechanism  of  action  of  a 
growth-inhibitory  paracrine  agent  (TGF-|S) . 

The  mechanisms  involved  in  growth  suppres- 
sion by  TGF-/3  might  also  relate  to  those  that  lead 
to  its  control  of  cell  differentiation.  This  effect 
was  best  studied  with  skeletal  muscle  myoblasts. 
TGF-18  inhibited  myoblast  differentiation  in  part 
by  interfering  with  the  expression  of  myogenin,  a 
differentiation-determining  gene.  Other  studies 
showed  that  TGF-jS  is  a  major  regulator  of  the 
cell  adhesion  apparatus — cell  adhesion  recep- 
tors and  extracellular  matrix  protein  expression 
— in  many  cell  types.  These  effects  could  in  turn 
explain  the  ability  of  TGF-)8  to  affect  the  morphol- 
ogy and  phenotype  of  cells  and,  most  impor- 
tantly, explain  its  presumed  roles  in  tissue  mor- 
phogenesis and  fibrotic  disorders.  Effects  of 
TGF-|S  on  gene  expression  might  be  mediated  by 
its  ability  to  control  the  levels  or  activity  of  cer- 
tain transcription  factors,  a  possibility  that  is  the 
focus  of  current  studies. 

Cell-Cell  Communication  by 
Membrane-anchored  Growth  Factors 

Our  research  interests  extend  also  to  the  area  of 


growth  factors  that  are  synthesized  as  part  of 
transmembrane  protein  precursors.  The  model 
system  in  our  studies  is  transforming  growth  fac- 
tor-a  (TGF-a).  This  is  a  50-amino  acid  polypep- 
tide initially  identified  in  culture  fluids  of  trans- 
formed cells,  hence  its  name.  It  is  not  related  to 
TGF-/5  but  shares  structural  homology  with 
members  of  the  epidermal  growth  factor  (EGF) 
family.  TGF-a  is  most  prevalent  in  tumor-derived 
cells  and  causes  a  high  incidence  of  mammary 
and  liver  neoplasias  in  transgenic  mice  that 
chronically  overexpress  it.  The  factor  stimulates 
cell  growth  through  the  EGF  membrane  receptor, 
which  is  a  ligand-activated  tyrosine  kinase. 

Paracrine  growth  factors  and  polypeptide  hor- 
mones are  generally  synthesized  as  larger  soluble 
precursors  that  are  cleaved  by  specific  peptidases 
to  release  the  bioactive  domain.  In  contrast  to 
this,  a  remarkable  feature  of  TGF-a,  shared  with 
the  other  members  of  its  family,  is  its  generation 
by  cleavage  of  a  membrane-anchored  precursor, 
pro-TGF-a.  This  is  a  1 60-amino  acid  polypeptide 
with  the  TGF-a  sequence  in  the  extracellular  do- 
main. The  proteolytic  process  that  releases  ma- 
ture TGF-a  from  its  precursor  is  inefficient  in 
most  cell  types.  Consequently,  substantial  levels 
of  pro-TGF-a  can  accumulate  on  the  cell  surface. 

Membrane  pro-TGF-a  can  bind  EGF  receptors 
on  the  surface  of  adjacent  cells.  This  interaction 
leads  to  signal  transduction  and  DNA  replication. 
Pro-TGF-a  also  functions  as  a  mediator  of  cell- 
cell  adhesion.  Thus  a  membrane-anchored 
growth  factor  and  its  receptor  can  function  simul- 
taneously as  mediators  of  cell-cell  adhesion  and 
as  initiators  of  mitogenic  stimulation  by  cell-cell 
contact.  The  membrane-anchored  forms  could 
be  important  in  tissue  development  processes 
whose  guidance  depends  on  discrete  cell-cell  in- 
teractions incompatible  with  the  diffusible  na- 
ture of  soluble  factors.  Given  their  structure,  the 
membrane  forms  might  function  as  signaling  re- 
ceptors. This  possibility  is  under  study. 
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OUR  laboratory'  uses  x-ray  crystallography,  in 
concert  with  other  techniques,  to  address 
some  of  the  fundamental  problems  in  biology: 
How  do  proteins  spontaneously  fold  into  their 
biologically  active  three-dimensional  configura- 
tions? What  determines  the  stability  of  these 
folded  proteins,  and  can  stability  be  improved? 
How  do  proteins  interact  with  each  other?  How 
do  proteins  interact  with  DNA?  How  do  enzymes 
act  as  catalysts? 

The  Protein  Folding  Problem 

An  area  of  long-standing  interest  is  the  so-called 
protein  folding  problem.  How  does  a  newly  syn- 
thesized, extended  peptide  chain  "know"  how  to 
fold  spontaneously  into  its  active  three-dimen- 
sional shape? 

Although  it  has  long  been  recognized  that  the 
amino  acid  sequence  of  a  protein  determines  its 
three-dimensional  structure,  recent  work  from 
several  laboratories  has  made  it  clear  that  certain 
amino  acids  are  more  important  than  others  in  the 
folding  process.  At  some  positions,  typically  the 
solvent-exposed  mobile  sites  in  the  folded  pro- 
tein, amino  acids  can  be  interchanged  at  random 
with  little  apparent  effect  on  folding  or  stability. 
These  amino  acids  seem  to  be  unimportant  in 
protein  folding.  On  the  other  hand,  interchange 
of  amino  acids  in  buried  or  rigid  parts  of  a  folded 
protein  can  destabilize  it,  suggesting  that  the 
amino  acids  at  these  positions  are  important  in 
determining  the  folded  conformation. 

To  try  to  simplify  the  complexity  of  the  folding 
problem,  an  attempt  has  been  made  to  replace 
some  of  the  "nonessential"  amino  acids  with  ala- 
nine in  phage  T4  lysozyme.  Such  a  "polyalanine" 
protein  would,  in  principle,  truncate  all  nones- 
sential side  chains  and  permit  focus  on  those 
parts  of  the  amino  acid  sequence  that  are  critical 
for  the  folding  process. 

As  a  first  step,  a  series  of  alanines  was  intro- 
duced within  the  a-helix  that  includes  residues 
1 26- 1 34  of  T4  lysozyme.  The  somewhat  surpris- 
ing result  was  that  alanines  were  not  only  toler- 
ated at  most  positions  in  the  a-helix;  they  actually 
increased  the  protein's  stability.  This  indicates 


that  alanine  is  a  strongly  helix-favoring  residue.  It 
also  suggests  that  the  replacement  of  solvent- 
exposed  residues  of  a-helices  with  alanines 
might  provide  a  means  of  increasing  the  stability 
of  other  proteins.  Finally,  the  fact  that  a  series  of 
alanines  can  be  introduced  into  T4  lysozyme  con- 
firms that  this  might  be  a  way  to  simplify  the  pro- 
tein folding  problem. 

Engineering  Proteins  of  Enhanced  Stability 

We  are  using  the  lysozyme  from  bacteriophage 
T4  to  define  the  contributions  that  different  types 
of  interaction  (hydrogen  bonds,  hydrophobic  in- 
teractions, salt  bridges,  etc.)  make  to  the  stability 
of  proteins.  Much  of  our  emphasis  during  the  past 
year  has  been  on  gaining  a  better  understanding 
of  electrostatic  interactions.  It  is  known  that  they 
can  be  strong  in  some  cases  but  seemingly  weak 
in  others.  To  try  to  explain  this,  we  first  investi- 
gated the  effect  of  long-range  electrostatic  inter- 
actions by  using  genetic  engineering  to  change 
the  charge  of  a  number  of  groups  on  the  surface 
of  phage  T4  lysozyme.  We  were  able  to  reduce 
the  overall  charge  on  the  protein  from  +9  units  to 
+  1  unit.  Nevertheless,  there  was  almost  no 
change  in  protein  stability,  indicating  that  the  in- 
teraction between  the  different  charged  groups  is 
very  weak. 

Next  we  examined  short-range  electrostatic  in- 
teractions by  introducing  negatively  charged 
groups  adjacent  to  positively  charged  amino 
acids  on  the  surface  of  the  protein.  Again,  there 
was  almost  no  change  in  the  protein  stability. 

Finally,  we  introduced  charged  groups  next  to 
the  ends  of  a-helices.  In  this  case  an  increase  in 
stability  was  consistently  observed,  indicating 
that  there  are  favorable  electrostatic  interactions 
between  charged  amino  acids  and  the  charges  on 
the  ends  of  the  a-helices.  This  is  the  so-called 
"helix  dipole"  effect. 

It  is  at  first  surprising  that  electrostatic  interac- 
tions between  charged  groups  on  the  surface  of  a 
protein  tend  to  be  weak,  whereas  interactions 
with  the  a-helix  dipole  tend  to  be  much  stronger. 
The  reason  for  the  difference,  we  believe,  is  that 
charged  groups  on  the  surface  of  a  protein  tend  to 
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be  very  mobile,  whereas  the  a-helix  dipoles  are 
locaHzed  during  the  folding  of  the  protein. 

The  effectiveness  of  the  a-helix  dipole  in  con- 
tributing to  protein  stability  is  a  verification  of 
our  earlier  results.  Such  studies  are  indicating 
ways  in  which  the  stabilities  of  proteins  might  be 
improved  by  genetic  engineering.  We  hope  to  be 
able  eventually  to  stabilize  biological  molecules 
used  in  medicine,  such  as  vaccines,  and  to  im- 
prove enzymes  that  are  used  in  the  pharmaceuti- 
cal and  food  industries. 

Receptor-Ligand  Interaction 

To  develop  an  understanding  of  the  mode  of 
action  of  growth  factors  and  their  interactions 
with  their  receptors,  we  have  crystallized  and  de- 
termined the  high-resolution  structure  of  human 
fibroblast  growth  factor.  The  structure  was  found 
to  be  very  similar  to  that  of  interleukin-ljS.  It 
seems  clear  that  many  growth  factors  have  similar 
overall  structures,  but  the  exact  relationship  of 
these  factors  in  the  vicinity  of  their  receptor- 
binding  regions  remains  to  be  clarified. 

Protein-DNA  Interaction 

We  have  been  interested  for  some  time  in  the 
interaction  between  proteins  and  nucleic  acids. 
In  1981  we  determined  the  structure  of  the  Cro 
repressor  protein  of  bacteriophage  X  (bacteria- 
infecting  virus) .  Cro  has  served  as  one  of  the  pro- 
totypical examples  of  a  DNA-interacting  protein. 
This  small  dimeric  protein  recognizes  and  binds 
to  a  specific  set  of  17  base  pairs  on  the  phage 
genome. 


The  structure  of  Cro,  as  determined  crystallo- 
graphically,  suggested  that  a  characteristic  part  of 
the  protein,  now  known  as  the  helix-tum-helix 
motif,  is  especially  important  in  DNA  binding. 
The  helix-tum-helix  unit  can  be  considered  as  a 
"reading  head"  that  fits  into  the  grooves  of  the 
DNA  and  matches  the  DNA  structure  at  the  spe- 
cific recognition  site.  It  is  now  known  to  occur  in 
a  large  number  of  DNA-binding  proteins,  and  its 
functional  role  has  been  confirmed  by  structures 
of  several  DNA- protein  complexes. 

Recently  we  have  determined  the  crystal  struc- 
ture of  Cro  protein  in  complex  with  a  tight-bind- 
ing, 1 7-base  pair  DNA  operator.  We  are  improv- 
ing the  accuracy  of  the  structure  by  a  process  of 
crystallographic  refinement.  In  general  terms  the 
structure  of  the  complex  supports  the  model  for 
Cro-DNA  interaction  that  was  proposed  on  the 
basis  of  the  uncomplexed  protein. 

The  Cro  dimer,  however,  undergoes  a  substan- 
tial conformational  change  relative  to  the  un- 
complexed crystal  structure.  One  monomer  ro- 
tates relative  to  the  other  by  about  45°.  This 
supports  the  idea  that  the  Cro  dimer  is  very  flexi- 
ble in  solution.  The  DNA  maintains  an  essentially 
Watson-Crick  conformation,  but  is  bent  in  the 
middle  of  the  operator  into  the  shape  of  a  shallow 
boomerang. 

We  have  recently  obtained  high-quality  crys- 
tals of  the  biotin  repressor  from  Escherichia  coli. 
Determination  of  the  structure  of  this  more  com- 
plicated protein,  which  not  only  binds  DNA  but 
also  acts  as  an  enzyme,  is  well  under  way. 


504 


Structure  of  fibroblast  growth  factor.  Figure  is  a  stylized  computer- graphics  representation  of  the 
three-dimensional  structure  of  human  fibroblast  growth  factor.  The  core  of  the  molecule,  consist- 
ing of  a  series  of  12  extended  segments  (shown  as  arrows ),  appears  to  be  common  to  a  number  of 
growth  factors.  The  loop-like  connections  on  the  surface  include  the  receptor-binding  region  and 
the  heparin- binding  region,  which  are  seen  to  be  far  apart. 

Research  and  photograph  by  Elizabeth  Eriksson  and  Larry  Weaver  in  the  laboratory  of  Brian  W. 
Matthews. 
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FOR  the  past  several  years  my  colleagues  and  I 
have  studied  a  mammalian  transcription  fac- 
tor termed  CCAAT/enhancer-binding  protein  (C/ 
EBP) .  This  protein  is  capable  of  binding  to  DNA 
in  a  sequence-specific  manner  and  thereby  regu- 
lating gene  expression.  By  studying  the  detailed 
properties  of  C/EBP,  v^e  have  sought  to  develop  a 
better  understanding  of  how  genes  are  regulated 
in  mammalian  cells. 

These  efforts  have  been  revi^arded  by  a  surpris- 
ing discovery.  The  mechanism  by  which  C/EBP 
binds  DNA  is  common  to  that  used  by  many  other 
sequence-specific  DNA-binding  proteins,  includ- 
ing the  products  of  two  important  proto-onco- 
genes.  The  observations  that  led  to  this  discovery 
were  as  follows. 

After  sequencing  the  gene  encoding  C/EBP,  we 
fed  its  conceptually  translated  amino  acid  se- 
quence into  a  computer  database  of  known  pro- 
tein sequences.  Much  to  our  delight,  a  60-amino 
acid  segment  of  C/EBP  proved  to  be  sequence 
related  to  the  products  of  the  Fos  and  Jun  proto- 
oncogenes.  Earlier  studies  on  C/EBP  had  shown 
that  this  same  region  of  the  protein  was  responsi- 
ble for  its  ability  to  bind  DNA.  In  other  words,  the 
region  of  C/EBP  that  was  related  in  amino  acid 
sequence  to  the  FOS  and  JUN  proteins  was  its 
DNA-binding  domain. 

Next  we  began  to  focus  attention  on  the  amino 
acids  that  evolution  has  conserved  among  the 
three  proteins.  We  reasoned  that  the  related  se- 
quences might  reveal  the  underpinnings  of  a 
structural  motif  that  facilitates  specific  interac- 
tion with  DNA.  The  amino  acid  sequences  shared 
by  the  three  proteins  occurred  in  two  patches. 
One  patch  contained  a  conserved  set  of  basic 
amino  acids — arginines  and  lysines.  On  the  car- 
boxyl-terminal  side  of  this  "basic  region,"  each 
protein  exhibited  leucine  residues  spaced  every 
seven  amino  acids. 

Bill  Landschulz,  then  an  M.D./Ph.D.  student  in 
the  laboratory,  noticed  that  neither  the  basic  re- 
gion nor  leucine  repeat  region  contained  proline 
or  glycine  residues.  Since  prolines  and  glycines 
tend  to  be  incompatible  with  a-helical  structure, 
we  reasoned  that  much  of  the  DNA-binding  do- 


main of  C/EBP,  as  well  as  FOS  and  JUN,  might  be 
a-helical.  This  prediction  of  a-helical  structure 
provided  an  attractive  role  for  the  repeating  leu- 
cine residues.  Since  the  repeating  period  of  a- 
helices  is  3.5  amino  acids  per  helical  turn,  and 
since  the  leucines  were  spaced  at  a  heptad  inter- 
val, the  putative  helix  projected  a  continuous 
array  of  leucine  residues  from  one  helical  face. 

What  might  be  the  role  of  an  a-helix  that  dis- 
plays repeating  leucines  along  one  of  its  sides? 
Chemists  have  long  known  that  leucines  are  un- 
usually hydrophobic.  Rather  than  being  exposed 
to  an  aqueous  or  hydrophilic  environment,  hy- 
drophobic amino  acids  prefer  to  interact  with 
other  hydrophobic  compounds,  most  often 
within  the  internal  fold  of  a  protein.  Following 
this  lead  we  hypothesized  that  the  a-helical  re- 
gion displaying  a  repeated  array  of  leucines  might 
represent  a  dimerization  interface,  allowing  two 
polypeptide  chains  to  coalesce  along  the  helical 
face  that  contained  the  repeating  array  of  leu- 
cines. We  termed  this  hypothetical  structure  the 
"leucine  zipper." 

Elegant  experiments  by  Peter  Kim  (HHMI)  and 
his  colleagues  at  the  Whitehead  Institute  con- 
firmed the  general  tenets  of  the  zipper  hypothesis 
and  further  established  that  zippered  helices  as- 
sociate with  each  other  in  a  parallel  orientation. 

During  the  time  that  we  were  developing  the 
zipper  idea,  research  from  several  other  laborato- 
ries demonstrated  that  the  FOS  and  JUN  polypep- 
tides were  capable  of  stable  association.  Recog- 
nizing this  fact.  Bill  Landschulz,  Peter  Johnson, 
and  I  predicted  that  FOS  and  JUN  would  associate 
as  dimers  by  virtue  of  their  respective  leucine 
zippers.  Supportive  evidence  has  been  obtained 
by  Robert  Tjian  (HHMI,  University  of  California, 
Berkeley),  Edward  Ziff"  (HHMI,  New  York  Univer- 
sity Medical  School),  and  Daniel  Nathans  (HHMI, 
the  Johns  Hopkins  University  Medical  School). 

If  the  leucine  zipper  region  of  each  of  these 
proteins  is  indeed  responsible  for  allowing  pro- 
tein dimers  to  form,  what  is  the  role  of  the  basic 
region?  Knowing  that  the  substrate  for  binding  by 
these  proteins  is  DNA — a  negatively  charged 
polymer — we  reasoned  that  the  basic  region 
might  facilitate  direct  contact  with  DNA.  Such 
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speculation  provided  a  satisfying  fit  between  the 
dimeric  proteins  and  their  DNA-binding  sites. 

A  common  property  shared  by  the  binding  sites 
for  leucine  zipper  proteins  is  dyad  symmetry. 
Two  short  sequences,  related  to  each  other  as  are 
the  symmetric  halves  of  a  restriction  endonucle- 
ase  recognition  site,  are  directly  abutted.  The  ba- 
sic region  of  one  polypeptide  chain  would  fit  one 
half  of  the  dyad  symmetric  binding  site,  while 


that  of  the  other  chain  would  fit  the  other  half. 
Evidence  has  recently  been  obtained  by  a  number 
of  different  laboratories  in  support  of  this  idea. 
The  acid  test,  however,  will  come  upon  resolu- 
tion of  the  actual  molecular  structures  of  leucine 
zipper  proteins.  Paul  Sigler  (HHMI,  Yale  Univer- 
sity) and  Stephen  Harrison  (HHMI,  Harvard  Uni- 
versity) are  well  on  the  way  to  crystallizing  two 
different  leucine  zipper  proteins. 
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ION  channels  are  the  most  fundamental  ele- 
ments of  molecular  hardware  in  the  nervous 
system.  They  are  the  membrane-spanning  pro- 
teins that  directly  mediate  the  transmembrane 
ion  fluxes  giving  rise  to  the  generation,  propaga- 
tion, and  integration  of  electrical  signals  in  neu- 
rons, muscle,  and  other  electrically  active  cells. 
By  forming  aqueous  pores  right  through  the  heart 
of  the  channel  protein  (and  hence  across  the 
membrane  that  the  protein  spans),  channels  act 
as  "leakage"  pathways  for  ions  down  their  prees- 
tablished  thermodynamic  gradients.  These  pro- 
teins are  intelligent  leaks.  Channels  can  discrimi- 
nate fiercely  among  the  different  species  of 
inorganic  ions  present  in  the  aqueous  solutions 
bathing  the  cell  membrane.  They  can  also  open 
and  close  their  conduction  pores  rapidly  in  re- 
sponse to  external  signals,  such  as  binding  of  neu- 
rotransmitters or  changes  in  the  transmembrane 
electric  field. 

Work  here  is  directed  toward  questions  of  basic 
molecular  mechanisms  of  ion  channel  operation 
and  of  the  underlying  protein  structures  in- 
volved. Since  no  high-resolution  structures  have 
been  obtained  for  this  class  of  proteins  (and  since 
none  is  coming  over  the  horizon) ,  one  must  draw 
structural  inferences  from  close  examination  of 
ion  channel  function.  This  can  be  done  because 
ion  channels,  unique  among  all  classes  of  pro- 
teins, can  be  studied  at  the  single-molecule  level. 

In  this  laboratory,  heavy  use  is  made  of  the  tech- 
nique of  "single-channel  reconstitution,"  in 
which  individual  ion  channel  molecules  are  in- 
serted into  an  artificial  membrane  under  simple, 
chemically  controllable  conditions.  This  ap- 
proach has  allowed  us  to  develop  crude  physical 
pictures  of  several  ion  channels,  in  which  crucial 
dimensions  have  been  deduced:  the  conduction 
pore's  width  and  length,  the  distance  of  the  pore 
entryway  from  the  lipid  bilayer  surface,  and  the 
number  of  ions  inside  the  channel  during  the 
conduction  process.  We  are  currently  comple- 
menting these  purely  functional  and  mechanistic 
studies  with  recent  advances  in  methods  of  mem- 
brane protein  biochemistry  and  manipulation  of 
ion  channels  at  the  genetic  level. 


Use  of  Peptide  Neurotoxins  as  Probes 
of      Channel  Structure 

Charybdotoxin  (CTX)  is  a  scorpion  venom- 
derived  peptide  that  blocks  a  small  family  of 
K^-specific  channels.  Having  shown  that  it  acts  by 
physically  plugging  up  the  channel's  externally 
facing  "mouth,"  we  are  now  utilizing  the  pep- 
tide as  a  probe  of  this  important  region  of  the 
channel.  Employing  site-directed  mutagenesis 
with  the  Drosophila  Shaker  channel,  we  have 
identified  residues  that  locally  and  specifically 
alter  the  binding  of  CTX.  These  residues  are  evi- 
dently located  near  the  ion  entryway,  so  we  are 
homing  in  on  regions  of  the  protein  that  form  the 
transmembrane  pore.  Two  developments  of  the 
past  year  have  placed  us  in  position  to  use  this 
toxin  as  a  structural  probe  of  the  K"^  channel's 
outer  mouth.  First,  using  two-dimensional  nu- 
clear magnetic  resonance,  we  determined  the 
solution  structure  of  the  toxin.  Second,  we  con- 
structed a  synthetic  gene  for  CTX  and  success- 
fully overexpressed  the  fully  functional  peptide 
in  Escherichia  coli.  Using  high-level  expression, 
routine  structure  determination  of  the  toxin,  and 
site-specific  mutagenesis  of  both  toxin  and  chan- 
nel, we  are  now  attempting  to  map  the  locations 
in  the  K"*"  channel  mouth  of  residues  that  make 
direct  contact  with  residues  on  CTX. 

Purification  and  Reconstitution 
of  Cl"  Channels 

The  electric  ray  Torpedo  californica  carries  in 
its  electric  organ  a  C 1  "-specific  channel  with  an 
unusual  structural  characteristic.  The  channel  is 
built  as  a  dimeric,  or  "double-barreled,"  com- 
plex, with  two  identical  Cl~  diffusion  pathways 
in  a  single  molecular  unit.  We  have  developed  a 
functional  assay  for  this  channel  protein  in  a  solu- 
bilized  state  and  are  presently  using  it  to  perform 
conventional  purification  studies.  We  intend  to 
study  the  purified  channel  protein  as  a  supple- 
ment to  structure-function  work  at  the  cDNA 
level  on  this  recently  cloned  anion  channel. 

Structure-Function  Relations  in  a  Minimal 
K+  Channel 

We  are  beginning  a  structure-function  analysis 
on  a  K^-specific  channel  that  was  first  cloned 
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from  a  kidney  cDNA  library  and  that  shows  a  re- 
markable molecular  property:  a  very  small  polypep- 
tide of  only  130  amino  acids — some  10-fold 
smaller,  for  instance,  than  the  voltage-dependent 
Na^  channel.  We  have  constructed  and  expressed  a 
fully  synthetic  gene  for  this  "mink"  channel,  using 
the  degeneracy  of  the  genetic  code  to  build  a  large 
number  of  unique  restriction  sites  throughout  the 
coding  sequence.  Thus  positioned  to  perform  rou- 
tine cassette  mutagenesis,  we  are  initiating  a  search 
for  functional  domains  of  the  channel  and  develop- 
ing direct  tests  to  settle  whether  this  gene  does  in 
fact  code  for  an  ion  channel  at  all,  a  basic  question 
that  has  not  yet  been  answered  rigorously.  On  the 
basis  of  point  mutants  that  lead  to  changes  in  ion 
selectivity,  we  consider  that  mink  is  a  structural 
gene  for  an  ion  channel. 

High-Level  Expression  of  Ion 
Channel  Proteins 

Over  the  past  few  years,  ion  channel  genes 


have  been  cloned  at  a  rapidly  accelerating  pace, 
and  they  can  be  expressed  in  systems  that  allow 
high-sensitivity  electrophysiological  assays. 
But  channels  are  proteins,  and  it  would  be  de- 
sirable to  have  a  high-level  expression  system 
available  for  producing  milligram-scale  protein 
for  cloned  channel  genes.  We  have  found  that 
the  baculovirus-Sf9  transient  expression  system 
works  well  with  the  Shaker  K"^  channel.  By  in- 
fecting a  cell  line  derived  from  the  armyworm 
caterpillar  with  a  Shaker-recombinant  baculo- 
virus  that  normally  attacks  the  alfalfa-looper, 
we  have  shown  that  electrophysiologically  nor- 
mal currents  can  be  observed  and  that  the 
Shaker  protein  is  the  major  membrane  protein 
produced.  We  are  presently  testing  the  wider 
generality  of  this  system  and  using  this  insect- 
based  "channel  factory"  as  a  source  for  purifica- 
tion and  functional  reconstitution  of  the  Shaker 
channel  protein. 
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MY  research  concerns  the  function  and  devel- 
opment of  the  visual  system,  especially  the 
visual  areas  of  the  primate  cerebral  cortex.  My 
main  experimental  tools  are  electrophysiological 
recording  and  quantitative  analysis  of  the  visually 
evoked  activity  of  single  neurons.  We  also  draw 
importantly  from  related  work  in  visual  psycho- 
physics,  computational  modeling,  and  comple- 
mentary neuroanatomy. 

Presently  we  are  involved  in  two  broad  groups 
of  studies.  The  first  concerns  the  functional  prop- 
erties of  single  neurons  in  the  extrastriate  visual 
areas  of  the  macaque  monkey's  cerebral  cortex, 
with  special  emphasis  on  the  processing  of  infor- 
mation about  visual  motion,  space,  form,  and 
color.  The  second  group  of  studies  concerns  the 
development  of  cortical  visual  function  in  mon- 
keys and  the  way  that  development  is  affected  by 
abnormal  early  visual  experience. 

An  important  organizing  theme  derives  from 
the  discovery  of  two  functional  streams  in  the 
monkey's  geniculo-cortical  visual  pathway.  One 
stream,  the  P  system,  originates  in  the  dense  and 
numerous  PjS  ganglion  cells  of  the  retina,  contin- 
ues through  the  parvocellular  layers  of  the  lateral 
geniculate  nucleus  (LGN),  and  extends  to  layer 
4Ci8  of  the  striate  (or  primary  visual)  cortex,  VI . 
A  second  stream,  the  M  system,  originates  in  the 
large,  fast-conducting  but  relatively  sparse  Pa 
retinal  ganglion  cells,  continues  through  the 
magnocellular  layers  of  the  LGN,  and  enters  the 
striate  cortex  through  layer  4Ca. 

Signals  from  the  P  system  are  passed  preferen- 
tially into  a  set  of  cortical  areas  that  seem  to  be  of 
special  importance  for  the  processing  of  form  and 
color,  especially  visual  areas  2  and  4,  and  into  the 
inferior  temporal  cortex.  Signals  from  the  M  sys- 
tem pass  rather  selectively  into  another  set  of  cor- 
tical areas  that  seem  to  be  essential  for  the  analy- 
sis of  visual  motion  and  visual  space,  especially 
the  middle  temporal  area  (MT,  or  V5),  and  into 
the  posterior  parietal  cortex.  Our  working  hy- 
pothesis is  that  these  streams  subserve  different, 
albeit  overlapping  visual  functions,  and  also  that 
different  forms  of  developmental  visual  disorder 
may  reflect  abnormalities  primarily  affecting  one 
stream  or  the  other. 


To  study  the  functions  of  cortical  visual  areas, 
we  analyze  the  responses  evoked  in  single  neu- 
rons by  visual  stimuli  carefully  selected  to  permit 
formal  characterization  of  underlying  neuronal 
mechanisms.  The  class  of  properties  in  which  we 
are  generally  interested  concerns  the  selectivity 
with  which  neurons  respond  to  variations  along 
one  or  another  visual  dimension.  We  also  try  to 
examine  the  neuroanatomical  distribution  and 
functional  properties  of  neurons  providing  affer- 
ent signals  to  a  particular  area,  so  that  we  can 
attempt  to  understand  the  computational  trans- 
formations of  the  visual  signal  executed  by  the 
circuits  in  each  area. 

An  important  concern  is  to  establish  the  partic- 
ular dimensions  of  the  visual  stimulus  for  which 
neurons  in  that  area  show  an  invariant  selectivity 
— that  is,  for  which  their  selectivity  is  unaffected 
by  parametric  variation  in  other,  unrelated  di- 
mensions. For  example,  neurons  in  VI  have  in- 
variant selectivity  for  the  spatial,  temporal,  and 
chromatic  structure  of  visual  stimuli.  Neurons  in 
MT  transform  afferent  spatiotemporal  signals  into 
invariant  representations  of  an  object's  speed  and 
direction.  Neurons  in  V4,  on  the  other  hand,  may 
transform  simple  afferent  chromatic  signals  into 
invariant  representations  of  the  object's  surface 
properties. 

A  critical  issue  in  cortical  sensory  physiology  is 
to  relate  perceptual  experience  and  judgment  to 
the  activity  of  neurons  and  neuron  assemblies.  In 
collaboration  with  William  Newsome  at  Stanford 
University,  we  have  used  statistical  methods 
based  on  the  theory  of  signal  detection  to  com- 
pare the  performance  of  single  neurons  with  psy- 
chophysical measures  of  performance  obtained 
concurrently  from  an  awake,  behaving  monkey. 
The  goal  is  to  deduce  the  associations  between 
the  computation  of  perceptual  features  and  the 
activity  of  particular  groups  of  neurons.  The  re- 
sults suggest  that  small  groups  of  neurons  in  area 
MT  may  carry  the  signals  upon  which  behaving 
monkeys  make  judgments  of  the  motion  content 
of  visual  targets.  This  approach  allows  us  to  form 
a  common  language  in  which  to  consider  psycho- 
physical, computational,  and  neurobiological 
analyses  of  visual  cortical  function. 
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In  addition  to  its  purely  visual  functions,  the  M 
stream  provides  signals  that  drive  eye  movements 
of  pursuit,  the  slow^,  smooth  eye  movements  with 
which  primates  stabilize  on  the  retina  the  image 
of  a  moving  visual  target.  In  collaboration  with 
Stephen  Lisberger  (University  of  California,  San 
Francisco) ,  we  have  studied  several  aspects  of  the 
relationship  between  visual  and  visuomotor  pro- 
cesses. In  a  series  of  neurophysiological  studies, 
we  have  explored  the  responses  of  MT  neurons  to 
the  dynamic  motion  profiles  used  to  characterize 
pursuit,  and  have  documented  the  suitability  of 
the  motion-related  signals  in  these  neurons  for 
the  task  of  initiating  pursuit.  In  psychophysical 
work,  we  have  begun  to  explore  the  kinds  of  vi- 
sual signals  that  pass  into  the  oculomotor  system 
by  examining  the  relationship  between  the  de- 
tectability  of  particular  visual  patterns  and  the 
pursuit  eye  movements  they  elicit. 

Our  overall  ambition  for  these  studies  is  to 
"turn  the  sensory-motor  corner"  and  relate  the 
particulars  of  visual  processing  to  the  higher 
mechanisms  that  produce  voluntary  motor  com- 
mands. To  this  end,  we  are  developing  computa- 
tional models  designed  to  explain  the  signal 
transformations  that  take  place  at  a  series  of  stages 
between  the  initial  registration  of  the  visual 


image  and  the  formulation  of  the  final  oculomo- 
tor command. 

To  analyze  development,  we  study  the  vision 
of  monkeys  reared  either  with  an  artificial  stra- 
bismus (deviation  of  one  eye)  or  anisometropia 
(difference  in  the  refractive  state  of  the  two 
eyes).  Both  of  these  manipulations  lead  to  con- 
ditions resembling  human  amblyopia,  a  com- 
mon visual  deficit  of  central  nervous  system  ori- 
gin. In  behavioral  experiments,  we  learn  how 
experimental  amblyopia  affects  perceptually 
defined  mechanisms  that  support  visual  sensi- 
tivity to  form,  contrast,  and  position.  Neuro- 
physiological studies  in  the  same  animals  then 
reveal  alterations  in  cortical  neuron  properties 
that  seem  to  be  related  to  the  psychophysically 
measured  visual  defects. 

Using  this  strategy,  we  seek  to  uncover  the  rela- 
tionship between  the  neural  changes  that  under- 
lie amblyopia  and  the  perceptual  consequences 
of  the  disorder.  We  are  currently  pursuing  the 
idea  that  the  relatively  mild  type  of  amblyopia 
typically  produced  by  anisometropia  (having  un- 
symmetric  parts)  involves  a  deficit  in  the  P  sys- 
tem, while  the  more  complex  syndrome  that  of- 
ten follows  strabismus  also  involves  important 
deficits  in  the  M  system. 
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T lymphocytes  protect  the  body  from  invasion 
by  foreign  organisms,  but  can  also  become 
targets  of  infection  by  viruses.  One  of  these  is  the 
human  immunodeficiency  virus  (HIV),  which 
causes  the  acquired  immune  deficiency  syn- 
drome (AIDS).  Under  normal  circumstances,  T 
cells  are  activated  in  response  to  infection  and 
begin  to  synthesize  a  set  of  proteins  that  step  up 
the  immunologic  defense  system.  In  T  cells  that 
contain  HIV,  cellular  activation  signals  the  virus 
to  augment  its  replication.  We  have  characterized 
regulatory  proteins  that  stimulate  gene  expres- 
sion in  T  cells  and  retroviruses.  These  cells  pro- 
vide a  model  for  the  study  of  coordinate  gene 
expression  during  development  and  viral 
infection. 

Through  the  use  of  this  model,  we  have  identi- 
fied proteins  that  bind  to  control  regions,  regu- 
lating the  expression  of  other  immune  system 
proteins  and  of  HIV.  We  have  also  begun  to  use 
our  knowledge  of  cellular  and  viral  transcription 
to  deliver  recombinant  genes  in  vivo.  The  T  cell 
model  has  taught  us  more  about  the  biology  of 
these  genes,  and  we  have  applied  this  knowledge 
to  endothelial  and  vascular  smooth  muscle  cells 
of  the  vessel  wall,  which  has  provided  new  oppor- 
tunities for  therapeutic  gene  transfer. 

Regulation  of  HTV  Gene  Expression 
in  T  Cells  and  Monocytes 

HIV  expression  can  be  triggered  in  T  cells  by 
means  of  phorbol  esters  or  other  immune  system 
activators.  We  have  shown  that  stimulation  of 
these  cells  increases  the  binding  activity  of  a  pro- 
tein called  NF-/cB  (nuclear  factor  that  recognizes 
a  sequence  in  the  k  immunoglobulin  light  chain 
of  B  cells) ,  which  binds  to  a  DNA  control  region. 
This  factor  stimulates  HIV  transcription  in  acti- 
vated T  cells.  The  DNA  sequence  that  NF-kB  rec- 
ognizes is  twice  repeated  in  the  HIV  control  re- 
gion, and  mutation  of  these  sites  abolishes  HIV 
inducibility.  This  transcription  factor  acts  in  syn- 
ergy with  HIV  products,  such  as  the  tat-l  gene, 
further  enhancing  HIV  gene  expression  in  an  in- 
fected cell. 

In  addition  to  HIV  type  1  (HIV-1),  AIDS  can  be 


induced  by  a  related  virus,  HIV  type  2  (HrV-2) .  A 
retrovirus,  HIV- 2  shares  nucleic  acid  and  protein 
similarity  with  HIV-1.  First  described  in  West 
Africa,  it  has  begun  to  appear  throughout  the 
world.  The  two  viruses  differ  in  the  length  of  the 
asymptomatic  period  following  infection.  Be- 
cause progression  of  HlV-related  disease  is  asso- 
ciated with  increased  viral  replication,  the  rate  of 
disease  progression  may  be  influenced  by  virus- 
activating  regulatory  proteins  synthesized  by  host 
cells. 

Such  proteins  could  themselves  be  regulated 
by  distinct  cofactors  that  selectively  stimulate 
cellular  activation  pathways.  These  T  cell  activa- 
tion pathways  regulate  specific  transcription  fac- 
tors, which  may  contribute  to  the  regulation  of 
the  latent  phase  of  HIV  infection. 

We  have  recently  defined  the  transcriptional 
regulation  and  induction  of  these  retroviruses 
and  have  found  that  the  regulation  of  HIV- 2 
differs  from  that  of  HIV- 1 .  A  distinct  T  cell  activa- 
tion pathway — triggering  of  the  CD  3  component 
of  the  T  cell  antigen  receptor  complex — stimu- 
lates HIV- 2  gene  expression  but  does  not  affect 
HIV-1.  The  response  to  T  cell  receptor  stimula- 
tion in  HIV- 2  is  mediated  by  an  upstream  reg- 
ulatory element,  CD3R,  which  is  recognized 
by  a  sequence-specific  DNA-binding  protein, 
NF-CD3R. 

Jurkat  T  leukemia  cell  lines  containing  HrV-2 
provirus  also  showed  increased  viral  replication 
following  stimulation  of  the  T  cell  receptor  com- 
plex, in  contrast  to  HIV- 1 .  These  findings  suggest 
that  transcriptional  regulation  and  induction  of 
HIV-2  differs  from  HIV-l.  The  studies  also  raise 
the  possibility  that  different  cofactors  contribute 
to  activation  of  disease  associated  with  the  two 
HIV  types. 

Alteration  of  gene  transcription  by  inhibition 
of  specific  transcriptional  regulatory  proteins 
would  help  to  define  how  these  factors  partici- 
pate in  cellular  differentiation  and  viral  infec- 
tion. Although  several  methods  are  known  by 
which  the  functions  of  these  proteins  can  be  an- 
tagonized, each  has  specific  limitations.  Recently 
we  have  developed  inhibitors  of  sequence- 
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specific  DNA-binding  proteins  with  double- 
stranded  phosphorothioate  oligonucleotides,  us- 
ing octamer  or  kB  consensus  sequences.  These 
oligonucleotides  bind  specifically  to  the  relevant 
transcription  factor.  Octamer-dependent  activa- 
tion of  a  reporter  plasmid,  or  NF-/cB-dependent 
activation  of  the  HIV  enhancer,  can  be  appropri- 
ately inhibited  when  the  relevant  phosphor- 
othioate oligonucleotide  is  added  to  a  transiently 
transfected  cell  line. 

Addition  of  octamer  phosphorothioate  oligonu- 
cleotides to  Jurkat  T  leukemia  cells  inhibited  se- 
cretion of  interleukin-2  (IL-2)  to  a  degree  similar 
to  that  observed  with  a  mutated  octamer  site  in 
the  IL-2  enhancer.  Double-stranded  phosphor- 
othioate oligonucleotides  compete  for  the  bind- 
ing of  specific  transcription  factors  and  may  there- 
fore provide  antiviral,  immunosuppressive,  or 
other  therapeutic  effects. 

Expression  of  Cellular  and  Retroviral- 
Vector  Genes  In  Vivo 

Transcription  factors  resembling  NF-/cB  also 
regulate  the  expression  of  normal  cellular  genes. 
Examination  of  a  growth  factor  receptor  for  an- 
other immunologic  protein,  interleukin-2  (IL-2), 
revealed  a  site  closely  related  to  the  regulatory  /cB 
sites  of  HIV.  We  have  shown  that  this  site  is  recog- 
nized not  only  by  NF-kB  but  by  other  proteins.  We 
have  cloned  two  genes  encoding  these  proteins 
and  have  evaluated  their  role  in  the  regulation  of 
different  cellular  genes.  We  are  also  studying  the 
activation  of  another  coordinately  expressed  T 
cell  gene,  IL-2,  by  characterizing  proteins  that 
bind  to  expression-regulating  control  regions.  In 
particular,  we  have  identified  binding  proteins 
that  recognize  conserved  regions  by  which  IL-2 
gene  expression  is  negatively  regulated. 


Despite  recent  advances  in  the  understanding 
of  eukaryotic  gene  regulation,  the  site-specific 
expression  of  genes  in  vivo  remains  a  major  ob- 
stacle to  the  therapeutic  management  of  human 
disease.  Using  our  knowledge  of  retroviral  gene 
expression,  we  have  developed  systems  utilizing 
viral  vectors  to  express  biologically  active  pro- 
teins in  cells  and  tissues. 

For  example,  we  have  recently  devised  a 
method  that  allows  a  recombinant  gene  to  be  ex- 
pressed efficiently  at  a  specific  site  in  vivo  by 
direct  introduction  of  genetic  material  at  the  time 
of  catheterization.  A  recombinant  /3-galactosidase 
gene  was  expressed  in  a  specific  arterial  segment 
by  direct  infection  with  a  retroviral  vector  or  by 
DNA  transfection  using  liposomes.  Several  cell 
types  in  the  vessel  wall  were  transduced  with  the 
recombinant  gene,  including  endothelial  and  vas- 
cular smooth  muscle  cells.  Following  retroviral 
infection,  a  recombinant  reporter  gene  was  ex- 
pressed for  at  least  five  months,  with  no  detect- 
able helper  virus  generated.  Recombinant  gene 
expression  was  limited  to  the  site  of  infection  and 
was  absent  from  liver,  lung,  kidney,  and  spleen. 

Thus  site-specific  gene  expression  can  be 
achieved  by  direct  gene  transfer  in  vivo,  and 
could  be  applied  to  the  treatment  of  such  human 
diseases  as  atherosclerosis,  cancer,  or  AIDS.  Bio- 
logically active  proteins  are  now  being  intro- 
duced into  cells,  including  growth  factors, 
growth  inhibitors,  or  immune  system  proteins. 
The  goal  of  this  research  is  not  only  to  understand 
basic  mechanisms  of  gene  regulation,  transcrip- 
tional activation,  and  viral  gene  expression,  but 
also  to  define  the  biological  significance  of  fac- 
tors that  regulate  gene  expression  in  complex 
organisms. 
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OUR  laboratory  is  interested  in  the  molecular 
mechanisms  that  regulate  the  production 
and  function  of  the  contractile  system  in  muscle 
cells.  This  apparatus  is  the  molecular  motor  for 
locomotion  and  for  the  heartbeat.  Moreover,  vari- 
ations of  this  contractile  system  are  involved  in 
maintaining  the  shape  and  function  of  nonmuscle 
cells. 

The  functional  unit  of  the  contractile  system  is 
the  sarcomere,  with  its  precisely  arranged  compo- 
nents. Each  of  these  sarcomeric  proteins  is  the 
product  of  a  small  multigene  family.  Combina- 
tions of  these  protein  isoforms  can  give  rise  to 
many  different  sarcomeres,  and  this  capacity  for 
generating  diversity  is  further  expanded  by  a  pro- 
cess of  alternative  splicing. 

Among  sarcomeres,  significant  functional  dif- 
ferences are  produced  through  two  main  mecha- 
nisms. One  changes  the  components  either  at  the 
transcriptional  level  or  by  alternative  splicing; 
the  other,  by  regulating  the  availability  of  ions  in 
the  muscle  cells.  These  two  aspects  of  contractil- 
ity continue  to  be  the  main  focus  of  our  research. 

Transcriptional  Regulation  of  Contractile 
Protein  Genes 

Which  sarcomeres  are  assembled  in  a  given  cell 
at  a  particular  time  depends  on  which  contractile 
protein  genes  are  expressed  in  the  cell.  To  ana- 
lyze the  mechanisms  involved  in  switching  from 
one  gene  to  another  in  the  same  gene  family,  we 
have  focused  on  the  genes  coding  for  the  myosin 
heavy  chain  (MHC) .  Functionally  this  is  the  most 
important  component  of  the  sarcomere  because 
it  contains  the  ATPase  activity  that  converts  the 
chemical  energy  of  ATP  into  mechanical  force. 
We  are  currently  exploring  two  main  questions  in 
the  regulation  of  the  MHC  genes:  What  deter- 
mines that  a  given  MHC  gene  is  expressed  in  a 
given  cell  type  at  a  particular  developmental  or 
physiological  stage?  What  determines  the  level  of 
expression?  For  these  analyses  we  have  concen- 
trated on  the  two  MHC  genes  expressed  in  the 
myocardium,  a  and  |8  MHC.  These  genes  are  par- 
ticularly advantageous  because  their  expression 


during  development  is  modulated  in  response  to 
different  hormones  and  physiological  conditions. 

We  have  identified  many  of  the  regulatory  DNA 
sequences  involved  in  the  tissue-specific  expres- 
sion of  these  two  genes.  We  have  shown  that  their 
expression  is  determined  in  large  part  by  negative 
regulatory  elements.  These  elements  interact 
with  nuclear  proteins  that  are  present  in  many 
different  cell  types  but  sparse  in  muscle  cells. 
The  transcriptional  factors  of  the  MyoD  gene  fam- 
ily do  not  seem  to  play  a  role  in  the  regulation  of 
these  two  MHC  genes. 

In  their  expression,  positive  as  well  as  negative 
factors  are  essential.  The  most  important  positive 
factor  for  a  MHC  is  a  thyroid-responsive  element 
(TRE) .  The  interaction  of  this  cis-acting  element 
with  different  forms  of  receptor  for  thyroid  hor- 
mone and  retinoic  acid  has  been  analyzed  in  de- 
tail. These  interactions  are  required  and  suffi- 
cient to  produce  most  of  the  gene's  known 
phenotypes.  Interestingly,  thyroid  hormone  re- 
presses the  jS  MHC  gene. 

The  main  positive  regulator  of  0  MHC  is  a  mus- 
cle-specific enhancer  of  complex  structure,  com- 
posed of  binding  sites  for  muscle-specific  and 
general  transcriptional  factors.  Expression  stud- 
ies of  this  binding  site,  together  with  mobility 
shift  assays,  demonstrate  that  the  protein  binding 
to  the  site  is  muscle-specific.  Using  this  sequence 
as  a  probe  to  screen  a  cardiac  expression  cDNA 
library,  we  have  cloned  a  novel  transcription  fac- 
tor that  belongs  to  the  family  of  the  homeobox 
genes.  Expression  of  this  factor  significantly  en- 
hances the  expression  of  constructs  containing 
the  |S  MHC  enhancer  sequence. 

The  finding  of  a  muscle-specific  transcription 
factor  that  does  not  belong  to  the  MyoD  family 
has  long  been  a  goal  of  workers  in  muscle  dy- 
namics. This  is  so  because  the  MyoD  gene  family 
is  not  expressed  in  the  heart.  However,  this  mus- 
cle expresses  many  of  the  skeletal  muscle  genes 
whose  induction  is  dependent  on  MyoD.  The 
search  for  the  cardiac  MyoD  equivalent  in  many 
laboratories,  including  ours,  has  so  far  been  un- 
successful. The  identification  and  cloning  of  a 
cardiac  transcription  factor  of  the  homeobox  fara- 
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ily  suggests  that  this  factor,  or  a  close  relative,  is  a 
cardiocyte  lineage-determining  gene.  Experi- 
ments are  in  progress  to  determine  its  physiologi- 
cal role,  its  mode  of  action,  and  its  developmen- 
tal and  physiological  regulation. 

Regulation  of  Protein  Diversity  by 
Alternative  Splicing 

To  explore  the  mechanisms  involved  in  the  pro- 
duction of  different  proteins  from  a  single  gene, 
we  have  continued  to  focus  on  the  a-tropomyosin 
(TM)  gene.  This  gene  generates  a  minimum  of  1 0 
different  isoforms  that  are  tissue-specific  and  de- 
velopmentally  regulated.  These  are  produced  by 
the  alternative  use  of  two  different  promoters  and 
poly  (A)  addition  sites,  together  with  three  pairs 
of  exons  that  are  used  in  a  mutually  exclusive 
fashion.  We  have  focused  on  one  of  these  exon 
pairs  (exons  2  and  3)  to  elucidate  the  elements 
involved  in  this  type  of  regulation. 

Using  a  combination  of  in  vivo  and  cell-free 
splicing  assays,  we  have  analyzed  the  regulatory 
features  of  this  system.  The  mutual  exclusive  be- 
havior is  due  to  a  competition  between  the  two 
exons  for  the  common  flanking  splicing  sites.  In- 
clusion of  exon  3  is  the  default  pattern  and  oc- 
curs in  all  cell  types,  except  smooth  muscle  cells. 
If  exon  3  is  deleted,  however,  inclusion  of  exon  2 
becomes  the  default  pattern  and  occurs  in  all  cell 
types. 

The  basis  for  this  behavior  is  the  polypyrimi- 
dine  tract  located  at  the  3'  end  of  the  intron,  be- 
tween the  branch  site  and  the  splice  site.  The  role 
of  the  tract  in  splice  site  selection  is  determined 
by  its  ability  to  bind  to  a  polypyrimidine  binding 
protein  (PBP) .  This  factor  has  proven  to  be  essen- 
tial for  splicing.  During  the  past  year  we  have 
biochemically  purified  and  characterized  it. 

In  addition,  we  have  purified  and  cloned  a  ribo- 
nucleoprotein  that  copurifies  with  PBP.  Experi- 
ments are  now  in  progress  to  determine  the  stage 
in  spliceosome  assembly  that  requires  PBP  as 
well  as  its  interaction  with  other  components  of 
the  splicing  complex. 

The  role  of  PBP  seems  to  explain  the  default 
splicing  pattern.  However,  since  PBP  is  a  ubiqui- 
tous splicing  factor  richly  present  in  smooth  mus- 
cle cells,  the  obvious  question  of  how  the  regu- 
lated pattern  is  established  remains  unanswered. 


In  vivo  and  in  vitro  experiments  have  clearly 
demonstrated  a  negative  regulatory  mechanism. 
Factor(s)  present  in  smooth  muscle  cells  block 
the  default  splicing  pattern,  whereupon  exon  2 
becomes  the  default  pathway  and  is  included  in 
the  mature  mRNA.  The  sequences  involved  in  this 
negative  regulation  have  been  mapped  to  three 
different  elements  in  and  around  exon  3.  Each  of 
these  elements  is  required,  but  neither  is  suffi- 
cient to  produce  this  form  of  regulation.  Experi- 
ments are  in  progress  to  isolate  and  characterize 
the  trans-acting  factors  that  interact  with  these 
elements,  using  a  combination  of  biochemical 
and  genetic  approaches. 

Maintenance  of  the  Terminally 
Differentiated  State 

One  of  the  more  intriguing  characteristics  of 
muscle  cells  is  their  terminally  differentiated  phe- 
notype.  In  the  process  of  differentiation,  these 
cells  withdraw  irreversibly  from  the  cell  cycle 
and  are  therefore  unable  to  regenerate.  Moreover, 
expression  of  muscle-specific  genes  is  dependent 
on  this  terminally  differentiated  state.  In  its  ab- 
sence, these  genes  are  only  transiently  expressed 
and  repressed  in  response  to  growth  factor  stimu- 
lation. In  an  attempt  to  understand  this  process, 
we  have  reversed  the  terminally  differentiated 
state  through  the  expression  of  several  DNA  tu- 
mor virus  oncogenes.  All  these  oncogenes  inter- 
act with  the  product  of  the  retinoblastoma  gene. 
Using  antibodies  against  the  transforming  pro- 
tein, Rb,  p53,  and  muscle-specific  transcription 
factors,  we  have  determined  the  involvement  of 
these  several  gene  products  in  the  production  of 
the  terminally  differentiated  state. 

Structure-Function  Relationships  in 
Potassium  Channels 

The  contractile  cycle  of  the  sarcomere  is  trig- 
gered by  the  action  potential.  Potassium  channels 
are  fundamental  to  the  repolarization  of  the  cell 
membrane.  For  this  reason,  we  recently  initiated 
an  in-depth  analysis  of  the  structure  of  a  mamma- 
lian potassium  channel  cloned  in  our  laboratory. 
This  channel  belongs  to  the  family  of  delayed  rec- 
tifiers and  has  very  low  inactivation  kinetics, 
making  it  a  valuable  model  to  study  the  voltage 
sensor  mechanism  in  this  molecule. 
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THE  growth  of  mammalian  cells  is  regulated 
by  extracellular  proteins  called  growth  fac- 
tors. How  these  proteins  induce  cell  growth  is  a 
key  question  relevant  not  only  to  the  regulation 
of  cell  proliferation  and  tumorigenesis  but  more 
generally  to  the  response  of  cells  to  a  variety  of 
extracellular  signaling  agents,  such  as  develop- 
mental factors,  classical  hormones,  cell  surface 
and  matrix  proteins,  and  neurotransmitters. 

The  first  step  in  the  stimulation  of  cell  growth 
by  a  growth  factor  is  the  interaction  of  the  factor 
with  a  specific  cell-surface  receptor.  This  inter- 
action rapidly  induces  a  cascade  of  biochemical 
reactions  in  the  cell,  leading  to  the  sequential 
activation  of  specific  cellular  genes  and  subse- 
quent DNA  replication  and  cell  division.  The  or- 
dered expression  of  cellular  genes  induced  by 
growth  factors  has  the  attributes  of  a  regulated 
genetic  program.  Research  in  my  laboratory  con- 
cerns the  analysis  of  this  program  and  its  role  in 
cell  proliferation. 

To  identify  some  of  the  genes  activated  by 
growth  factors  we  used  recombinant  DNA  meth- 
ods to  isolate  DNA  copies  (cDNAs)  of  gene  mes- 
sages present  in  cultured  fibroblastic  mouse  cells 
only  after  they  have  been  stimulated  by  a  growth 
factor  or  by  serum  that  is  rich  in  a  growth  factor 
derived  from  blood  platelets.  We  and  others  have 
identified  one  set  of  genes,  the  immediate-early 
genes,  which  are  activated  within  2  or  3  minutes 
after  addition  of  growth  factor,  coordinately  with 
activation  of  the  proto-oncogenes  fos  or  myc  de- 
tected previously.  Some  of  these  immediate-early 
genes  encode  proteins  related  to  known  tran- 
scription factors  (proteins  that  regulate  gene  ex- 
pression), others  encode  secreted  proteins  or 
membrane  proteins,  and  others  encode  proteins 
that  are  part  of  the  filamentous  structures  of  cells. 
We  have  concentrated  largely  on  genes  that  en- 
code probable  transcription  factors,  because 
these  are  likely  to  be  involved  in  regulating  the 
genetic  program  induced  by  growth  factors. 

We  previously  described  immediate-early  tran- 
scription factors  with  "zinc  finger"  motifs  and 
others  that  are  "leucine  zipper"  proteins  of  the 
Jun  family.  During  the  past  year  we  have  identi- 


fied three  additional  immediate-early  proteins 
whose  structures  are  related  to  those  of  known 
transcription  factors.  One  of  the  proteins  is 
Nup475,  a  nuclear,  zinc-binding  protein  that  has 
two  copies  of  a  sequence  that  suggests  it  can  form 
a  novel  type  of  zinc  finger.  Studies  are  under  way 
to  determine  if  Nup475  binds  to  a  specific  DNA 
sequence  and  regulates  transcription. 

The  second  protein,  AFosB,  is  a  naturally  occur- 
ring shortened  form  of  FosB,  a  previously  de- 
scribed member  of  the  Fos  proto-oncogene  fam- 
ily of  transcription  factors.  Expression  of  AFosB 
was  found  not  only  in  growth  factor-stimulated 
cultured  cells  but  also  in  specific  parts  of  the 
brain  after  electrical  stimulation  and  in  regener- 
ating liver.  Like  other  members  of  the  Fos  family, 
AFosB  can  form  dimers  with  members  of  the  Jun 
family,  and  the  dimers  can  bind  to  signals  in  DNA. 
However,  unlike  other  Fos-Jun  dimers,  AFosB-Jun 
dimers  are  not  transcriptionally  active.  Instead, 
AFosB  competitively  inhibits  the  transcriptional 
activities  of  Jun  and  Fos;  it  may  therefore  serve  as 
a  negative  regulator  of  Jun  and  Fos  during  the 
growth  response. 

The  third  protein  is  HLH462,  which  is  related 
to  the  helix-loop-helix  class  of  transcription  regu- 
lators. Proteins  of  this  class  form  dimers  through 
their  helix-loop-helix  structural  domains.  Some 
dimers  bind  to  signals  in  DNA,  whereas  others 
cannot  bind  to  DNA  because  they  lack  a  DNA- 
binding  region.  Helix-loop-helix  proteins  of  the 
latter  type  have  been  shown  to  inhibit  the  tran- 
scriptional activity  of  DNA-binding  helix-loop- 
helix  proteins.  HLH462  has  the  structural  and 
functional  properties  of  an  inhibitory  helix-loop- 
helix  protein.  It  is  expressed  in  many  mouse  tis- 
sues and  in  early  mouse  embryos.  What  role  it 
plays  in  the  growth  factor-induced  program  is 
not  yet  clear.  One  possibility  is  that  it  inhibits  a 
negative  growth  regulator  present  in  nongrowing 
cells. 

So  far,  approximately  15  known  or  probable 
transcription  factors  have  been  identified  among 
the  immediate-early  proteins  induced  in  fibro- 
blasts by  serum  growth  factors.  Later  in  the  cellu- 
lar response  other  genes  are  activated  whose 
encoded  proteins  are  thought  to  mediate  progres- 
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sion  toward  the  DNA  synthesis  phase  of  cell  scription  factors  probably  reflects  a  complex 
growth.  The  diversity  of  immediate-early  tran-    mechanism  for  activating  these  later  genes. 
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VISUAL  pigments  are  the  light-absorbing  pro- 
teins that  initiate  phototransduction.  Each 
consists  of  a  chromophore,  1 1-cis  retinal,  joined 
to  an  integral  membrane  protein.  The  visual  pig- 
ments constitute  one  branch  of  a  large  family  of 
cell  surface  receptors  that  transduce  external 
stimuli  by  activating  G  proteins.  In  the  visual  sys- 
tem, the  activated  G  protein  stimulates  a  cGMP 
phosphodiesterase,  and  the  resulting  transient 
decline  in  cGMP  closes  plasma  membrane  cation 
channels. 

Photon  absorption  by  11  -cis  retinal  causes  it  to 
isomerize  from  1 1-cis  to  all-trans.  It  then  under- 
goes a  series  of  conformational  changes,  leading 
ultimately  to  a  form  that  interacts  with  the  G  pro- 
tein. The  changes  underlying  visual  pigment  acti- 
vation are  likely  to  resemble  those  that  accom- 
pany hormone-receptor  binding  among  the  other 
members  of  this  receptor  family. 

Our  laboratory  is  taking  three  general  ap- 
proaches related  to  the  visual  pigments:  investi- 
gations of  their  structure  and  function,  the  con- 
trol of  their  expression,  and  their  variation  within 
the  human  population. 

Structure/Function  Studies 

Several  years  ago  we  succeeded  in  producing 
large  quantities  of  bovine  rhodopsin  by  expres- 
sion of  cloned  cDNA  in  tissue  culture  cells.  We 
are  using  this  system  in  conjunction  with  site- 
directed  mutagenesis  to  define  the  chromophore 
binding  pocket,  the  residues  involved  in  protein 
conformational  changes,  and  the  surface  and 
transmembrane  topography  of  the  protein. 

In  one  experiment,  each  of  the  22  negatively 
charged  amino  acids  (e.g.,  aspartate  or  gluta- 
mate)  was  changed  to  a  neutral  residue  of  identi- 
cal size  (asparagine  or  glutamine) .  We  observed 
that  only  one  of  the  mutant  proteins  differs  dra- 
matically from  the  wild  type  in  its  behavior. 
When  glutamate*''  is  mutated  to  glutamine,  the 
Schiffs  base  linking  1 1  -cis  retinal  to  the  protein 
loses  its  proton.  The  base  is  re-protonated  upon 
addition  of  small  anions  (e.g.,  chloride)  to  the 
sample. 


Our  interpretation  of  this  experiment  is  that 
glutamate'*^  is  normally  the  counterion  that  sta- 
bilizes the  protonated  Schiff's  base,  but  in  its  ab- 
sence a  small  anion  can  serve  as  a  surrogate.  This 
observation  is  likely  to  be  relevant  to  the  mecha- 
nism of  protein  activation,  because  SchifFs  base 
deprotonation  is  required  for  the  protein  to  as- 
sume its  active  conformation. 

In  a  second  experiment,  Charles  Weitz,  a  post- 
doctoral fellow,  has  examined  a  number  of  rho- 
dopsin mutants  for  their  ability  to  assume  the  ac- 
tive conformation.  Thus  far,  one  mutant  binds  the 
chromophore  and  absorbs  light  normally  but  ap- 
pears to  be  locked  in  the  inactive  conformation. 

In  a  third  set  of  experiments,  Jimo  Borjigin,  a 
graduate  student,  is  using  insertional  mutagene- 
sis to  tag  rhodopsin  at  predetermined  sites.  The 
tags  consist  of  stretches  of  foreign  amino  acids 
that  can  bind  to  other  proteins  (e.g.,  antibodies). 
These  modified  rhodopsins  should  be  useful  for  a 
variety  of  structural  studies. 

The  expression  system  is  also  being  used  by 
Shannath  Merbs,  a  graduate  student,  to  produce 
the  human  cone  pigments — a  related  group  of 
light  receptors  that  mediate  color  vision. 

Control  of  Visual  Pigment  Gene  Expression 

As  an  entree  into  the  general  question  of  retinal 
development,  we  are  examining  the  control  of 
the  genes  for  visual  pigment.  Donald  Zack,  a 
postdoctoral  fellow,  in  collaboration  with  Jean 
Bennett  and  John  Gearhart  at  Johns  Hopkins,  has 
constructed  a  set  of  transgenic  mice  that  carry 
sequences  upstream  of  the  bovine  rhodopsin 
gene  joined  to  a  gene  encoding  (8-galactosidase,  a 
convenient  histochemical  marker.  Rhodopsin 
gene  fragments  as  small  as  230  base  pairs  direct 
expression  of  the  reporter  gene  to  the  photore- 
ceptor cells. 

Curiously,  a  considerably  larger  fragment 
(2,100  bp)  directs  expression  in  a  spatially  non- 
uniform manner:  a  gradient  of  expression  forms 
across  the  retina.  This  DNA  fragment  appears  to 
be  responding  to  a  preexisting  spatial  gradient. 
This  gradient  may  be  involved  in  determining  the 
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map  coordinates  for  the  wiring  of  the  visual 
system. 

Inherited  Variation 

The  human  visual  system  has  several  experi- 
mental attributes  that  facilitate  a  genetic  analysis: 
first,  humans  can  accurately  report  their  visual 
experience  in  sensitive,  noninvasive  behavioral 
tests;  second,  inherited  alterations  in  visual  func- 
tion rarely  affect  longevity  or  fecundity — and 
thus  usually  persist  in  the  gene  pool;  and  third, 
persons  with  significant  visual  disorders  usually 
present  themselves  to  the  medical  community. 
We  have  taken  advantage  of  these  attributes  to 
identify  persons  with  mutations  in  each  of  the 
four  genes  for  visual  pigments. 

Several  years  ago  we  showed  that  the  common 
forms  of  red-green  colorblindness  are  due  to  mu- 
tations in  the  red  and  green  cone  pigments.  More 
recently  Charles  Weitz  has  identified  amino  acid 
substitutions  in  the  blue  pigment  gene  in  patients 
with  defects  in  blue  sensitivity.  A  rare  form  of 
severe  color  vision  deficiency,  blue  cone  mono- 
chromacy,  occurs  when  both  red  and  green  cone 
systems  are  inactive.  Our  analysis  of  30  blue  cone 
monochromat  families  shows  that  deletion  of  a 
small  DNA  sequence  adjacent  to  the  red  and 


green  pigment  genes  suffices  to  cause  the  defect. 
Yanshu  Wang,  a  graduate  student,  has  recently 
shown  that  a  large  DNA  fragment  containing  this 
region  directs  expression  of  a  reporter  gene  to 
cone  photoreceptor  cells  in  transgenic  mice. 

To  identify  rhodopsin  gene  mutations,  we  ex- 
amined patients  with  defects  in  night  vision  and 
peripheral  vision.  (Rhodopsin  is  the  visual  pig- 
ment in  rods — the  photoreceptors,  enriched  in 
the  peripheral  retina,  that  subserve  dim-light  vi- 
sion.) Night  blindness  and  loss  of  peripheral  vi- 
sion are  the  hallmarks  of  retinitis  pigmentosa,  a 
family  of  genetic  disorders  that  affect  1  person  in 
4,000.  We  recruited  patients  with  retinitis  pig- 
mentosa, and  Ching-Hwa  Sung,  a  postdoctoral 
fellow,  screened  their  rhodopsin  genes,  using  de- 
naturing gradient  gel  electrophoresis,  a  new  and 
rapid  method  for  detecting  point  mutations. 

Thus  far  we  have  discovered  1 3  different  point 
mutations  in  the  rhodopsin  gene,  together  ac- 
counting for  24  percent  of  patients  with  autoso- 
mal dominant  retinitis  pigmentosa.  All  of  the  mu- 
tant proteins  have  been  produced  in  the  tissue 
culture  expression  system.  The  single  most  com- 
mon mutation,  a  proline-to-histidine  substitution 
at  amino  acid  23,  results  in  a  protein  that  be- 
comes stuck  in  the  endoplasmic  reticulum. 
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Sections  of  a  transgenic  mouse  retina  expressing  Pgalactosidase  under  control  of  a  bovine  rho- 
dopsin  promoter.  A:  The  jSgalactosidase  substrate  X-gal  stains  only  the  photoreceptors.  B:  Hema- 
toxylin and  eosin  staining  shows  the  cell  layers. 

From  Zack,  D.f.,  Bennett,  f.,  Wang,  Y.,  Davenport,  C,  Klaunberg,  B.,  Gearhart,].,  and  Nathans, 
J.  1991.  Neuron  6:187-199. 
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THE  regulation  of  gene  expression  is  central  to 
the  complex  cellular  changes  that  take  place 
during  such  events  as  embryogenesis  and  onco- 
genesis. An  elucidation  of  the  molecular  mecha- 
nisms of  gene  control  pathways  is  the  focus  of  our 
laboratory,  including  the  modification  and  regu- 
lation of  factors  that  mediate  the  control  of  gene 
expression. 

Mechanisms  of  Viral-mediated 
Trans-activation  of  Transcription 

The  primary  event  in  the  expression  of  a  gene  is 
the  initiation  of  transcription  leading  to  the  for- 
mation of  functional  mRNA.  The  control  of  this 
event  is  a  key  step  in  the  determination  of  cellular 
phenotype.  Various  studies  have  shown  that  tran- 
scription initiation  is  complex,  resulting  from 
the  interaction  of  a  large  number  of  proteins  with 
regulatory  DNA  sequences  of  the  gene. 

The  study  of  complex  cellular  events  is  often 
facilitated  by  the  use  of  simple  viral  model  sys- 
tems. An  example  is  the  trans-activation  of  tran- 
scription by  viral  regulatory  proteins  such  as  the 
adenovirus  El  A  gene  product.  The  ElA  protein 
mediates  the  activation  of  transcription  of  a  set  of 
viral  genes  and  several  cellular  chromosomal 
genes.  Since  the  activation  process  involves  the 
use  of  cellular  transcription  factors,  this  system 
has  provided  a  means  for  the  study  of  cellular 
transcription  control. 

Work  in  our  laboratory  has  led  to  the  identifi- 
cation of  several  cellular  transcription  factors 
that  the  viral  regulatory  system  utilizes  and  that 
contribute  to  the  trans-activation  event.  In  partic- 
ular, the  activity  of  these  factors  is  modified  by 
the  viral  infection,  which  increases  the  levels  and 
changes  the  nature  of  the  DNA-binding  activity  of 
the  proteins.  The  DNA-binding  activity  of  several 
factors  depends  on  the  phosphorylation  state  of 
the  protein,  and  phosphorylation  appears  to  play 
a  key  role  in  the  activation  process. 

Recent  studies  have  shown  that  the  E2F  tran- 
scription factor,  as  isolated  from  adenovirus- in- 
fected cells,  binds  to  the  E2  promoter  with  a  high 
degree  of  cooperativity,  resulting  in  the  forma- 
tion of  a  very  stable  DNA-protein  complex.  This 


binding  is  important  for  full  transcription  activa- 
tion, requires  the  precise  dyad  arrangement  of 
the  E2F  sites  as  found  in  the  E2  promoter,  and 
requires  the  interaction  of  a  19-kDa  product  of 
the  early  viral  E4  gene  with  the  E2F  factor.  The 
induction  of  cooperative  E2F  binding  is  striking 
in  view  of  the  specificity.  Although  E2F-binding 
sites  have  been  identified  in  a  variety  of  cellular 
promoters,  no  other  instances  have  been  found  of 
this  precise  arrangement  in  the  E2  promoter.  It 
thus  appears  that  the  cellular  E2F  factor  is  con- 
verted into  an  E2  promoter-specific  factor 
through  the  action  of  the  E4  protein. 

The  E2F  factor  has  now  been  found  to  be  com- 
plexed  to  cellular  factors  in  extracts  of  a  variety 
of  cell  lines.  These  complexes  are  significant 
with  respect  to  a  viral  infection,  since  the  E4  pro- 
tein cannot  interact  with  E2F  that  is  already  com- 
plexed  to  a  cellular  factor,  thus  preventing  the 
formation  of  the  stable  interaction  on  the  E2  pro- 
moter. Strikingly,  however,  we  have  found  that 
ElA  proteins  can  dissociate  these  E2F-containing 
complexes,  releasing  free  E2E  that  can  associate 
with  the  E4  protein.  Thus  it  would  appear  that  a 
two-step  mechanism  has  evolved  to  utilize  the 
cellular  E2F  factor  efficiently  and  redirect  it  for 
viral-specific  purposes. 

Through  the  analysis  of  a  large  series  of  ElA 
mutants  it  has  become  clear  that  the  E2F-disso- 
ciating  activity  is  dependent  on  sequences  within 
ElA  that  are  important  for  transforming  activity. 
Moreover,  trans-activation  assays  have  demon- 
strated that  mutations  that  disrupt  E2F  dissocia- 
tion also  disrupt  cell-specific  trans-activation 
function.  These  are  also  the  amino  acid  se- 
quences that  are  shared  with  two  other  viral  on- 
cogene products — SV40  T  antigen  and  human 
papillomavirus  E7.  Previous  experiments  have 
shown  that  both  possess  trans-activation  function 
dependent  on  the  E2F  factor.  Recent  experiments 
have  shown  that  the  E7  protein  is  capable  of  al- 
tering the  E2F  complexes  in  a  manner  similar  to 
ElA.  It  therefore  appears  that  these  otherwise 
unrelated  viruses  have  acquired  a  common  regu- 
latory function  through  evolution,  directed  at  a 
specific  cellular  transcription  factor,  that  may  be 
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a  critical  part  of  the  ability  of  these  viral  proteins 
to  induce  oncogenic  transformation. 

Gene  Regulation  by  Alternative 
RNA  Processing 

Very  often  a  transcription  unit  encodes  not  one 
mRNA  but  several  (and  thus  several  proteins) .  By 
alternative  processing  of  the  primary  transcript, 
one  particular  mRNA  is  selected  and  produced.  In 
several  instances,  this  is  a  regulated  event,  sub- 
ject to  change  depending  on  the  circumstances  of 
the  cell.  Alternative  processing  can  involve  both 
the  selective  splicing  of  exons  and  the  selection 
of  one  of  several  poly  (A)  addition  sites,  thus  gen- 
erating a  unique  mRNA  3'  terminus. 

Possibly  the  best  example  of  regulation  of 
poly(A)  site  utilization  is  within  the  differentiat- 
ing B  lymphocyte,  where  there  is  a  dramatic 
change  in  the  nature  of  the  immunoglobulin  (Ig) 
heavy-chain  polypeptide  that  is  synthesized.  Ini- 
tially, a  membrane-bound  form  of  the  protein  is 
produced,  but  this  then  switches  to  a  secreted 
form  when  the  cells  differentiate  to  plasma  cells. 
The  RNAs  that  direct  the  synthesis  of  these  two 
proteins  are  encoded  in  the  same  transcription 
unit  and  are  produced  by  differential  cleavage  of 
the  primary  transcript  at  two  different  poly(A) 
sites. 


A  definition  of  the  events  and  factors  involved 
in  poly  (A)  site  formation,  again  using  simple 
viral  systems  but  also  the  developmentally  regu- 
lated Ig  heavy-chain  gene  system,  has  been  a  ma- 
jor goal  of  our  laboratory.  Experiments  have 
identified  sequences  within  the  two  Ig  ^  heavy- 
chain  poly(A)  sites  that  are  essential  for  regulated 
selection  during  B  cell  differentiation.  In  addi- 
tion, we  have  utilized  cell-free  systems  capable  of 
accurate  poly(A)  site  processing  in  vitro  to  iden- 
tify and  isolate  factors  mediating  the  processing 
reaction. 

Purification  of  factors  from  these  cell-free  ex- 
tracts has  now  yielded  information  concerning 
the  multiplicity  and  nature  of  factors  involved  in 
this  processing  event.  At  least  four  distinct  activi- 
ties are  required  for  efficient  processing  at  the 
poly  (A)  site.  Two  of  these  factors  form  com- 
plexes with  the  pre-mRNA,  and  the  combined  in- 
teraction results  in  the  generation  of  a  stable, 
committed  complex  that  allows  processing  to 
take  place.  Furthermore,  the  stability  of  these 
complexes  can  vary,  depending  on  sequences  in 
the  RNA  that  are  known  to  be  important  for  effi- 
cient processing,  indicating  that  the  formation  of 
these  protein-RNA  interactions  plays  a  significant 
role  in  determining  the  efficiency  with  which  a 
particular  poly(A)  site  is  utilized. 
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THE  research  in  my  laboratory  is  directed  to- 
ward elucidating  the  molecular  bases  for  a 
number  of  human  genetic  diseases.  Each  disease 
under  investigation  is  known  to  be  caused  by  a 
gene  on  the  X  chromosome,  but  the  molecular 
mechanism,  the  gene  involved,  and  the  nature  of 
the  underlying  mutations  have  been  generally  un- 
known. Recombinant  DNA  techniques  are  being 
employed  to  isolate  the  responsible  genes,  with 
the  aim  of  furthering  our  understanding  of  the 
normal  processes  that  result  in  each  of  these  dis- 
eases when  disrupted. 

Fragile  X  Syndrome 

One  of  the  most  common  genetic  forms  of 
mental  retardation,  the  fragile  X  syndrome,  af- 
fects nearly  1  in  every  2,000  males  worldwide. 
The  chromosomes  of  affected  males  are  normal  in 
appearance  on  routine  examination.  If,  however, 
supplies  of  the  molecular  building  blocks  for 
DNA  synthesis  are  stringently  limited  during  cell 
division  (m  vitro),  an  abnormal  gap  or  fragile 
site  is  induced  to  appear  at  the  tip  of  the  human  X 
chromosome. 

The  disease  is  generally  inherited  in  an  X- 
linked  manner,  but  there  are  important  excep- 
tions. These  are  found  in  "transmitting  males," 
the  approximately  20  percent  of  male  carriers 
who  are  not  retarded  and  do  not  show  the  chro- 
mosomal fragile  site  under  the  usual  induction 
conditions.  The  grandsons  of  transmitting  males, 
however,  may  inherit  the  gene  through  daughters 
and  manifest  the  disease. 

Understanding  the  fragile  X  syndrome  will  ulti- 
mately require  isolation  of  DNA  sequences  from 
the  region  involved  so  that  the  molecular  basis  of 
both  the  chromosomal  abnormality  (the  fragile 
site)  and  the  clinical  disease  (fragile  X  mental 
retardation)  can  be  elucidated.  My  colleagues 
and  I  are  creating  yeast  artificial  chromosomes 
(YACs)  containing  DNA  from  the  region  of  the 


fragile  site  for  study.  This  involves  attaching 
functional  parts  of  yeast  chromosomes  to  large 
fragments  of  DNA  from  a  patient  and  reintroduc- 
ing these  fragments  into  yeast.  The  large  frag- 
ments are  then  propagated  in  the  yeast,  just  like  a 
yeast  chromosome,  and  can  be  grown  in  large 
amounts  for  study.  Nearly  600,000  bp  of  contigu- 
ous DNA  from  the  region  around  the  fragile  site 
has  been  isolated  to  date.  Our  aim  is  to  under- 
stand how  the  chromosome  abnormality  comes 
about  and  what  gene  or  genes  are  responsible  for 
the  retardation. 

Choroideremia 

Choroideremia  is  a  rare  X-linked  disease  of  the 
retina  that  produces  blindness  in  affected  males. 
The  gene  responsible  and  the  mechanism  of  reti- 
nal damage  have  until  recently  been  unknown. 
Our  laboratory  is  using  information  about  where 
the  choroideremia  gene  is  to  identify  it  and  ex- 
plain why  mutations  in  this  gene  cause  the 
disease. 

We  have  been  studying  a  female  patient  in 
whom  choroideremia  has  occurred  because  of  a 
disruption  of  the  choroideremia  gene  caused  by  a 
chromosome  break  in  the  X  chromosome  in  this 
region.  A  transcribed  gene  that  is  disrupted  by 
this  chromosome  translocation  has  been  identi- 
fied and  found  to  be  very  similar,  although  not 
identical,  to  one  isolated  in  the  laboratory  of 
Frans  Cremers  by  his  study  of  males  with  choroi- 
deremia and  submicroscopic  deletions.  The  gene 
identified  in  both  laboratories  bears  no  sequence 
homology  to  any  previously  identified  gene,  and 
there  is  no  information  at  present  as  to  what  the 
protein  encoded  by  this  gene  does  and  why  muta- 
tions in  the  gene  cause  choroideremia. 

The  goal  is  to  learn  more  about  the  function  of 
the  normal  retina  as  well  as  to  find  new  methods 
of  diagnosis  and  treatment  for  this  and  other  re- 
lated retinal  disorders. 
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Lowe's  Syndrome 

Lowe's  syndrome  is  an  uncommon  X-linked 
disease  that  causes  mental  retardation,  cataracts, 
and  kidney  dysfunction.  The  cause  is  unknown. 
As  with  choroideremia,  our  strategy  is  to  identify 
the  Lowe's  syndrome  gene  through  information 
about  its  location. 

Linkage  analysis  with  restriction  fragment 
length  polymorphisms  localized  the  gene  to  the 
Xq25  region  of  the  X  chromosome.  Collaborators 
in  the  United  Kingdom  have  identified  a  Lowe's 
syndrome  patient  in  whom  the  disease  is  caused 
by  a  break  in  Xq25  that  has  disrupted  the  gene, 
splitting  it  into  two  pieces  separated  on  different 


chromosomes.  In  collaboration  with  David  Nel- 
son (Baylor  College  of  Medicine) ,  we  have  iden- 
tified a  segment  of  DNA  in  a  YAC  that  spans  the 
region  of  the  break  in  this  patient.  Segments  of 
DNA  from  within  this  YAC  have  been  shown  to 
contain  gene  sequences  that  are  candidates  for 
containing  the  Lowe's  syndrome  gene. 

Our  goal  is  to  understand  the  biochemical  pro- 
cesses that,  when  defective,  lead  to  brain,  lens, 
and  kidney  dysfunction  and  damage.  Insights  into 
normal  lens  formation  and  normal  brain  and  kid- 
ney function  could  result,  and  methods  for  im- 
proved diagnosis  and  therapy  for  the  disease  may 
be  found. 
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THE  major  goal  of  our  work  is  to  elucidate  how 
intercellular  signals  control  the  proper  ar- 
rangement of  cells  and  tissues  during  early  em- 
hryogenesis. These  signals  can  be  in  the  form  of 
polypeptides  that  are  secreted  from  one  cell  and 
received  by  others  through  binding  to  specific 
receptors.  A  systematic  approach  to  the  identifi- 
cation of  communication  molecules  involved  in 
embryogenesis  is  only  possible  in  organisms 
where  saturation  mutagenesis  can  be  applied  to 
screen  for  genes  with  a  relevant  phenotype.  Both 
in  Drosophila  and  in  Caenorhabditis  elegans, 
such  approaches  have  led  to  the  discovery  of  a 
series  of  developmental  genes,  whose  functions 
can  be  analyzed  by  molecular  cloning.  In  combi- 
nation with  cell  transplantation  and  cell  ablation 
experiments,  this  method  is  extremely  powerful 
and  has  led  to  illuminating  insights. 

From  another  line  of  research,  it  has  become 
clear  that  some  proteins  originally  identified  as 
growth  factors  in  adult  organisms  also  act  as  im- 
portant regulators  early  in  embryogenesis.  And 
other  molecules  implicated  in  embryonic  signal- 
ing have  turned  out  to  be  the  products  of  proto- 
oncogenes:  genes  that  normally  have  essential 
functions  in  the  regulation  of  cell  proliferation, 
but  whose  altered  expression  can  lead  to  cancer. 

The  Wnt/wingless  gene  family  is  one  of  the 
best  examples  of  the  link  between  cancerous 
growth  and  the  control  of  normal  development. 
The  prototypic  member  of  this  group  is  Wnt- 1 ,  an 
oncogene  frequently  activated  in  mouse  mam- 
mary cancer.  The  Wnt-l  gene  is  normally  not  ex- 
pressed in  mammary  gland  or  in  most  other  adult 
tissues,  but  its  transcription  can  be  induced  by 
nearby  insertion  of  proviral  DNA  of  a  retrovirus, 
the  mouse  mammary  tumor  virus.  Wnt-l  encodes 
a  secreted  protein  rich  in  cysteine  residues,  and 
the  gene  is  normally  expressed  only  during  early 
mouse  embryogenesis,  in  particular  in  the  clos- 
ing neural  tube  and  the  developing  brain.  Proof 
that  Wnt- 1  is  an  oncogene  came  from  transfection 
experiments  and  from  the  finding  that,  placed  as 
a  transgene  in  the  germline  of  mice,  it  can  lead  to 
tumor  induction. 

Recently  we  and  others  have  found  that  Wnt- 1 


is  part  of  a  gene  family  comprising,  in  the  mouse, 
at  least  nine  members.  Most  of  these  Wnt-l -re- 
lated  genes  have  a  very  restricted  pattern  of  ex- 
pression during  early  developmental  stages,  and 
at  least  several  members  of  this  group  can  behave 
as  oncogenes  when  activated  by  insertion  of  pro- 
viral  DNA  in  mouse  mammary  tumors.  The  Wnt-5 
gene,  for  example,  is  more  than  50  percent  iden- 
tical to  Wnt- 1  and  is  activated  in  a  low  percentage 
of  mammary  tumors. 

We  have  performed  a  detailed  in  situ  RNA  hy- 
bridization analysis  of  the  expression  of  Wnt-5 
and  a  highly  related  gene,  Wnt-^k.  Both  genes  are 
expressed  in  the  developing  neural  tube,  in  some 
areas  that  overlap  the  expression  domain  of 
Wnt-l  but  also  in  unique  domains.  In  particular, 
the  anterior  boundary  of  expression  of  Wnt-i  and 
Wnt-5^^  is  interesting;  the  genes  are  expressed  in 
the  diencephalon  and  in  the  cerebral  hemi- 
spheres, suggesting  that  they  play  important  roles 
in  establishing  these  compartments  in  the  devel- 
oping brain.  This  could  now  be  tested  by  generat- 
ing mice  with  mutations  in  these  genes,  using  ho- 
mologous recombination  in  embryonic  stem 
cells. 

Our  present  aim  is  to  understand  the  mecha- 
nism of  action  of  the  Wnt- 1  gene  family  during 
embryogenesis  and  to  extrapolate  these  findings 
to  cancerous  growth.  We  wish,  for  example,  to 
identify  the  receptors  for  these  secreted  mole- 
cules. One  of  our  model  systems  is  the  fruit  fly 
Drosophila.  Some  years  ago  we  made  the  surpris- 
ing observation  that  the  homologue  of  the  Dro- 
sophila Wnt- 1  gene  was  identical  to  the  segment 
polarity  gene  wingless.  Because  of  the  extensive 
genetic  analysis  of  Drosophila  embryogenesis,  in 
particular  the  mechanism  of  segmentation,  this 
observation  has  allowed  us  to  study  the  interac- 
tions of  Wnt-l /ivingless  with  other  genes. 

The  basic  body  plan  of  the  fruit  fly  is  set  up  by 
several  classes  of  genes  that  progressively  divide 
the  embryo  into  smaller  compartments:  the  gap 
genes,  the  pair-rule  genes,  and  the  segment  polar- 
ity genes.  The  gap  genes  and  the  pair-rule  genes 
encode  nuclear  proteins  and  are  active  before  the 
Drosophila  embryo  becomes  cellularized,  and 
the  segment  polarity  genes  are  the  first  ones  that 


327 


Function  of  Proto-oncogenes  in  Early  Embryogenesis 


control  cell-cell  interactions.  The  wingless  gene 
is  a  good  example,  encoding  a  secreted  factor, 
but  other  segment  polarity  genes  are  thought  to 
interact  with  wingless.  To  study  the  properties  of 
the  wingless  protein,  we  have  made  antibodies 
that  recognize  the  protein  in  whole-mount  em- 
bryos and  in  individual  cells. 

The  protein  is  seen  on  the  surface  of  cells  and 
in  intracellular  structures  that  constitute  uptake 
vesicles.  Such  structures  are  also  seen  in  cells  ad- 
jacent to  those  that  make  the  wingless  protein, 
suggesting  a  paracrine  mechanism  of  action  of 
the  gene.  When  we  now  look  at  the  distribution 
of  the  wingless  protein  in  embryos  that  are  mu- 
tant for  some  other  segment  polarity  genes,  we 
can  observe  differences  suggesting  that  wingless 
interacts  directly  with  the  products  of  these 
genes. 

In  other  experiments,  we  have  overexpressed 
the  wingless  gene  from  a  heat-shock  promoter 
and  transfected  it  into  established  Drosophila 


cell  lines.  By  deliberately  altering  the  expression 
of  the  gene  in  whole  embryos  or  in  cultured  cells, 
we  are  identifying  additional  genes  that  are  in- 
volved in  the  wingless  signal  transduction  path- 
way. For  example,  our  finding  that  misexpression 
of  wingless  in  an  embryo  results  in  large  areas  of 
pattern  abnormalities  can  be  taken  as  evidence 
that  many,  if  not  all,  cells  can  receive  the  wing- 
less signal  and  must  therefore  have  receptors. 

We  have  also  found  that  wingless  in  Drosoph- 
ila is  also  part  of  a  gene  family,  with  at  least  two 
additional  members.  These  genes,  called  \)Wnt-2 
and  DWnt-5  for  the  time  being,  are  also  ex- 
pressed during  early  embryogenesis,  but  in  char- 
acteristic patterns  that  differ  from  wingless. 

In  the  analysis  of  the  mechanism  of  action  of 
the  Wnt  genes  in  mouse  development,  we  hope 
to  take  advantage  of  the  results  of  the  Drosophila 
work,  conceivably  by  isolating  the  mouse  homo- 
logues  of  those  fly  genes  that  have  been  shown  to 
interact  with  wingless. 
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THIS  laboratory  is  developing  ways  of  under- 
standing the  cells  of  the  immune  system  at 
the  molecular  level,  using  the  tools  of  molecular 
biology  and  genetics. 

The  immune  system  is  responsible  for  protect- 
ing vertebrates  from  both  invasion  by  infectious 
organisms  and  deregulated  growth  of  endoge- 
nous malignancies.  In  order  to  accomplish  this 
task,  the  system  must  be  able  to  distinguish  self 
from  non-self.  Evolution  has  solved  this  problem 
in  higher  vertebrates  by  providing  a  network  of 
cell  types  and  humoral  agents.  It  is  the  lympho- 
cytes— T  cells  and  B  cells — that  direct  the  speci- 
ficity of  immune  responses. 

Although  the  mechanism  of  antigenic  recogni- 
tion differs  for  these  two  cell  types,  the  genera- 
tion of  diversity  in  their  surface  receptors  is 
achieved  in  a  similar  fashion.  In  both  cases  the 
business  end  of  the  receptor  is  created  in  individ- 
ual somatic  cells  by  a  series  of  genetic  recombina- 
tions at  a  minimum  of  two  loci.  For  example,  the 
heavy-  and  light-chain  immunoglobulin  proteins 
that  serve  as  the  B  cell  receptor  derive  from  two 
sets  of  rearranging  genes.  The  same  genes  also 
direct  the  production  of  secreted  antibodies  that 
are  an  important  component  of  the  humoral  im- 
mune system.  Thus  the  regulation  of  T  cell  and  B 
cell  receptor  rearrangements  is  a  central  feature 
of  the  generation  of  immune  responses. 

The  joining  events  that  bring  together  the  im- 
munoglobulin segments  occur  in  an  ordered  and 
regulated  fashion.  In  B  lymphocytes,  rearrange- 
ments begin  at  the  heavy-chain  locus  with  the  re- 
combination of  D  and  J  segments.  This  is  followed 
by  the  joining  of  DJ  with  one  of  100-1,000  vari- 
able-region segments.  After  a  functional  immuno- 
globulin heavy-chain  transcription  unit  is  cre- 
ated, the  light-chain  genes  undergo  a  similar  set 
of  rearrangements. 

One  poorly  understood  aspect  of  these  events 
is  the  ability  of  lymphocytes  to  limit  themselves 
to  the  production  of  a  single  receptor.  Since  pro- 
ductive rearrangements  could  occur  in  two 
heavy-chain  and  four  light-chain  alleles,  a  single 
B  lymphocyte  could  potentially  make  several 
types  of  receptors  including  hybrid  molecules. 


The  mechanism  that  ensures  that  only  one  recep- 
tor is  produced  is  referred  to  as  allelic  exclusion. 
It  is  an  important  safeguard  for  the  immune  sys- 
tem, since  production  of  multiple  receptors  by  a 
lymphocyte  would  dilute  the  specificity  of  any 
given  immune  response. 

Much  of  the  early  work  in  the  area  of  allelic 
exclusion  was  based  on  examining  the  status  of 
immunoglobulin  genes  in  transformed  B  cells. 
The  transformed  cells  are  frozen  in  one  stage  of 
lymphocyte  development  and  for  this  reason 
offer  only  a  static  picture  of  important  regulatory 
events.  Unfortunately  there  is  no  in  vitro  system 
that  faithfully  reproduces  regulated  immunoglob- 
ulin gene  rearrangements.  In  order  to  study  how 
immunoglobulin  genes  can  regulate  allelic  ex- 
clusion, we  turned  to  transgenic  mice. 

Our  approach  has  been  to  introduce  into  the 
germline  of  mice  human  immunoglobulin  genes 
that  have  been  modified  to  direct  the  synthesis  of 
either  membrane-associated  or  secreted  immuno- 
globulin heavy  chains.  We  found  that  the  expres- 
sion of  human  membrane-bound  immunoglobu- 
lin M  results  in  the  exclusion  of  most  endogenous 
mouse  immunoglobulins.  The  secreted  version  of 
the  same  transgene  had  little  effect. 

In  order  to  examine  the  molecular  basis  for  ex- 
clusion by  membrane-associated  IgM,  we  have  de- 
veloped a  sensitive  PGR  assay  for  heavy-chain 
gene  recombination.  This  assay  allows  us  for  the 
first  time  to  examine  specific  heavy-chain  recom- 
bination events  in  pools  of  primary  lymphoid 
cells.  We  are  able  to  assess  D  to  J  segment  recom- 
bination as  well  as  V  to  DJ  joining  events  for  spe- 
cific families  of  variable  regions. 

We  find  that  the  rearrangement  of  the  endoge- 
nous heavy-chain  locus  is  inhibited  in  transgenic 
mice  that  carry  human  membrane  IgM.  Gene  rear- 
rangements are  not  affected  in  the  control  trans- 
genic mice  that  carry  the  secreted  human  IgM 
gene.  Furthermore,  not  all  gene  segments  are 
equally  affected  by  the  membrane  transgene.  The 
large  families  of  variable-region  genes  that  are  in 
the  distal  part  of  the  immunoglobulin  locus  are 
almost  completely  inhibited  from  undergoing  re- 
arrangements, whereas  the  small  families  of  more 
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proximal  variable-region  segments,  as  well  as  the 
D  and  the  J  segments,  are  not  affected.  We  are 
currently  investigating  the  possibility  that  this 
positional  effect  is  related  to  transcription  and 
proximity  to  the  immunoglobulin  heavy-chain 
enhancer. 

,_In  another  area  of  investigation  suggested  by 
our  initial  experiments,  we  are  trying  to  under- 
stand how  a  membrane-anchored  immunoglobu- 
lin produces  a  signal.  Although  the  structure  of 


this  receptor  has  been  known  for  several  years, 
the  mechanism  of  signal  transduction  has  not 
been  elucidated.  Recent  work  has  implicated  ty- 
rosine phosphorylation  as  well  as  GTP  hydrolysis 
in  the  signaling  event,  and  a  series  of  immuno- 
globulin-associated  proteins  have  been  identi- 
fied. We  have  started  to  pursue  this  problem  with 
the  goal  of  developing  a  cell-free  system  that  will 
allow  us  to  isolate  and  reconstitute  the  functional 
components. 


330 


Mechanism  of  DMA  Replication 


Michael  E.  O'Donnell,  Ph.D. — Assistant  Investigator 

Dr.  O'Donnell  is  also  Assistant  Professor  of  Microbiology  at  Cornell  University  Medical  College,  New  York 
City.  He  received  his  Ph.D.  degree  with  Charles  H.  Williams,  Jr.,  from  the  Department  of  Biological  Chem- 
istry at  the  University  of  Michigan,  Ann  Arbor,  on  electron  transfer  in  the  flavoprotein  thioredoxin  re- 
ductase. He  performed  postdoctoral  work  on  Escherichia  coii  replication  with  Arthur  Romberg  and  then 
on  herpes  simplex  virus  replication  with  Bob  Lehman,  both  in  the  Biochemistry  Department  at  Stanford 
University. 


MY  laboratory  is  studying  the  duplication  of 
genetic  information.  By  understanding  the 
fundamental  mechanisms  of  cell  growth,  or  the 
replication  of  DNA,  we  may  obtain  insights  into 
the  development  of  abnormal  cells,  including  tu- 
mor cells. 

The  genetic  material,  the  chromosomes,  is  a 
library  with  all  the  information  needed  for  the 
multitude  of  duties  required  to  maintain  the 
cell's  life.  Included  in  these  duties  is  the  buildup 
of  complete  new  cellular  machinery  for  the  syn- 
thesis of  another  cell  (reproduction) .  The  chro- 
mosome library  is  made  of  two  long  interwound 
helical  fibers  of  DNA  (deoxyribonucleic  acid 
polymers).  Before  a  cell  can  divide  to  form  two 
new  cells,  it  must  duplicate  the  genetic  library  so 
that  each  cell  has  a  complete  copy  of  instructions 
on  how  to  live. 

The  process  of  duplicating  DNA  is  intricate, 
and  the  cell  has  evolved  a  precision  machine  to 
carry  out  this  important  task.  Its  several  protein 
parts  are  like  gears  of  a  machine,  which  coordi- 
nate their  actions  to  unzip  and  unwind  the  dou- 
ble-helical strands  of  DNA.  The  machinery  then 
uses  the  separated  single  strands  as  templates  to 
synthesize  two  double-helical  daughter  chromo- 
somes. Subsequently  these  will  segregate  in  two 
newly  formed  cells. 

The  aim  of  our  research  is  to  understand,  at  a 
molecular  level,  the  workings  of  proteins  in  the 
mechanics  of  DNA  duplication.  The  system  we 
are  studying  is  the  bacterium  Escherichia  coli,  a 
relatively  simple  organism.  The  E.  coli  chromo- 
some is  replicated  by  over  a  dozen  proteins.  Ten 
of  these  are  tightly  bound  into  a  complex  that 
contains  the  DNA  polymerase  activity.  Our  pres- 
ent research  is  focused  on  this  complex,  called 
DNA  polymerase  III  holoenzyme,  which  will  be 
referred  to  below  as  "the  holoenzyme." 

Only  two  proteins  of  the  holoenzyme  have 
well-defined  functions:  a,  the  DNA  polymerase 
protein,  and  e,  an  exonuclease  that  proofreads  the 
product  of  the  polymerase  protein.  Our  aim  is  to 
determine  the  individual  functions  of  the  other 
eight  "accessory  proteins"  of  the  holoenzyme. 
We  hope  that  analysis  of  the  E.  coli  holoenzyme 


will  extend  and  generalize  the  understanding  of 
the  replication  process  in  all  organisms. 

We  have  recently  developed  methods  to  obtain 
nearly  pure  preparations  of  each  protein,  or  sub- 
unit,  of  the  E.  coli  holoenzyme,  and  from  these 
the  whole  complex  can  be  reassembled.  We  have 
studied  the  individual  subunits  for  biochemical 
activities  and  for  their  physical  interactions.  Two 
subunits,  7  and  5,  bind  to  each  other  to  form  a 
complex  that,  upon  binding  to  primed  DNA,  hy- 
drolyses  ATP.  In  the  presence  of  the  ^-subunit, 
the  yb  heterodimer  couples  the  hydrolysis  of  ATP 
to  clamp  a  dimer  of  the  (S-subunit  onto  primed 
DNA.  One  molecule  of  the  7^  heterodimer  can 
clamp  many  jS  dimers  onto  primed  DNAs. 

The  /?  clamp  on  DNA  binds  the  polymerase  sub- 
unit,  tethering  it  to  the  DNA  template.  Whereas 
the  polymerase  alone  is  slow  (20  nucleotides/ 
second),  it  is  greatly  accelerated  upon  binding 
the  |S  clamp  (700  nucleotides/second)  and  repli- 
cates an  entire  8-kb  single-strand  circular  DNA 
without  coming  off  (processive) .  Hence  three  ac- 
cessory proteins  of  the  holoenzyme  (7,  b,  j3)  are 
needed  to  confer  rapid  and  processive  synthesis 
onto  the  polymerase  subunit.  This  fits  nicely  with 
the  fact  that  the  E.  coli  cell  duplicates  its  4  mil- 
lion-base chromosome  within  30  minutes. 

After  the  holoenzyme  has  replicated  the  DNA, 
it  remains  bound  to  it.  However,  upon  addition  of 
primed  DNA  containing  a  |8  clamp,  the  new  clamp 
specifically  seeks  out  polymerase  molecules 
bound  to  completed  DNA  templates,  and  the  /? 
clamp  on  the  fresh  primed  DNA  "steals"  the  poly- 
merase away  from  the  clamp  on  the  completed 
template.  Hence  the  13  protein  appears  to  acceler- 
ate the  polymerase  as  well  as  mediate  its  rapid 
cycling  from  a  completed  DNA  to  a  new  one.  The 
rapid  cycling  is  important  because  one  strand  of 
the  DNA  duplex  (lagging  strand),  as  a  result  of 
the  geometry  of  the  DNA  helix,  must  be  repli- 
cated in  fragments.  Synthesis  of  these  fragments 
requires  that  the  polymerase  be  used  over  and 
over  (cycling)  every  1-2  seconds. 

Two  T-subunits  bind  tightly  to  each  other  (di- 
meric),  and  each  binds  a  polymerase  molecule. 
Hence  the  r-subunit  dimer  serves  as  a  scaffold  to 
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form  a  twin  polymerase.  Since  the  chromosome 
has  two  strands  of  DNA,  both  of  which  must  be 
repHcated,  the  twin  polymerase  likely  serves  the 
function  of  coordinately  replicating  both  DNA 
strands  at  the  same  time. 

These  studies  on  four  (7,  b,  jS,  r)  of  the  eight 
accessory  proteins  of  the  holoenzyme  have  been 
greatly  aided  by  having  the  genes  (the  informa- 
tional area  in  the  DNA)  for  three  of  them  (7,  ^,  r) . 
This  has  provided  large  quantities  of  proteins  for 
studies  via  molecular  cloning  and  overproduc- 
tion techniques. 

Our  future  plans  include  assignment  of  func- 


tion to  the  other  four  accessory  proteins,  and 
identification  of  the  genes  for  the  remaining  five 
accessory  proteins  of  the  holoenzyme  in  order  to 
obtain  them  in  large  quantity.  Possession  of  the 
proteins  in  quantity  would  allow  us  to  start  inves- 
tigations on  the  structure  of  the  holoenzyme. 
Identification  of  their  genes  would  also  allow 
construction  of  E.  colt  genetic  mutants,  which 
would  provide  further  clues  to  the  exact  function 
of  each  subunit  in  replication.  Clearly,  many 
more  studies  by  our  laboratory  and  others  are 
needed  to  gain  a  full  understanding  of  the  mecha- 
nism by  which  duplex  DNA  is  replicated. 
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MOST  human  cells  contain  6  billion  base 
pairs  (bp)  of  DNA.  Embedded  therein  are 
an  unknown  number  of  genes,  perhaps  100,000, 
that  direct  the  biochemical  events  in  the  cells.  At 
present,  well  over  95  percent  of  this  DNA  re- 
mains unexplored. 

Geneticists  have  developed  powerful  methods 
with  which  to  study  DNA  in  small  packets.  Gene- 
splicing  techniques,  DNA  sequencing,  and  meth- 
ods of  reintroducing  altered  DNA  molecules  into 
cells  allow  the  detailed  structural  and  functional 
analysis  of  DNA  molecules  containing  up  to  tens 
of  thousands  of  base  pairs.  Our  laboratory  seeks 
to  extend  these  approaches  to  encompass  mole- 
cules ranging  up  to  millions  of  base  pairs  in  size. 

In  the  short  run,  this  research  should  allow  the 
analysis  of  larger  functional  units  of  DNA — for 
example,  large  human  genes,  clusters  of  coregu- 
lated  genes,  and  such  structures  as  centromeres 
and  telomeres,  which  govern  the  behavior  of  hu- 
man chromosomes  during  the  cell  division  cycle. 
In  the  long  run  these  methods  should  allow  the 
systematic  analysis  of  the  whole  human  genome 
— the  entire  complement  of  DNA  sequences — 
thereby  creating  tools,  such  as  detailed  maps,  that 
would  be  of  permanent  value  in  biology. 

The  heart  of  our  approach  is  a  new  method  of 
cloning  large  DNA  molecules  from  any  organism. 
DNA  cloning,  which  was  the  root  technology  of 
the  recombinant-DNA  revolution  of  the  1970s, 
depends  on  splicing  t-ecfor  sequences  onto  other 
DNA  molecules  to  create  a  new  replicon — that  is, 
a  DNA  molecule  that  will  replicate  inside  a  host 
cell,  allowing  large  numbers  of  identical  copies 
to  be  produced  in  cell  culture.  The  replicons  in 
the  new  cloning  system  are  yeast  artificial  chro- 
mosomes (YACs).  DNA  sequences  in  the  YAC 
vectors  impart  the  properties  of  a  true  yeast  chro- 
mosome to  the  foreign  DNAs  to  which  they  are 
spliced.  Once  introduced  into  host  yeast  cells, 
the  YACs  replicate  at  each  cell  cycle  during  the 
growth  of  the  host  and  segregate  faithfully  into 
the  two  progeny  cells. 

One  advantage  of  YACs  over  previous  cloning 
systems  is  that  there  is  no  absolute  upper  size 
limit.  At  present,  we  can  prepare  large  collec- 


tions of  YAC  clones,  each  containing  a  different 
segment  of  human  DNA,  averaging  hundreds  of 
thousands  of  base  pairs  in  size,  a  1 0-fold  improve- 
ment over  the  capacities  of  previous  cloning  sys- 
tems. Another  advantage  is  that  the  methods  of 
packaging,  maintaining,  and  replicating  DNA  are 
more  similar  in  yeast  than  in  bacteria  to  the  analo- 
gous methods  in  cells  of  higher  organisms.  Con- 
sequently, it  will  likely  be  possible  to  propagate 
more  segments  of  human  DNA  in  yeast  cells  than 
in  bacterial  hosts. 

The  first  phase  of  this  project  has  involved  es- 
tablishing basic  "library"  technology  for  the  YAC 
system.  We  have  prepared  more  than  80,000  YAC 
clones  containing  segments  of  human  DNA  that 
average  250,000  bp  in  size.  A  typical  segment  of 
human  DNA  is  represented  six  to  seven  times  in 
this  library  on  YACs  of  independent  origin.  We 
have  also  developed  efficient  screening  methods, 
based  on  the  polymerase  chain  reaction  (PCR), 
which  allow  YACs  containing  panicular  seg- 
ments of  the  human  genome  to  be  identified  in 
the  library.  Screening  can  be  carried  out  even  if 
one's  only  prior  knowledge  of  the  segment  is  the 
sequence  of  a  mere  100-200  bp  of  the  DNA. 

The  YAC  library  has  yielded  clones  containing 
more  than  200  different  human  genes.  These 
clones  are  being  employed  in  many  collaborating 
laboratories  for  such  purposes  as  aiding  in  the 
search  for  genes  involved  in  specific  human  dis- 
eases. Examples  include  the  successful  search 
during  the  past  year  for  the  gene  that  is  mutated 
in  neurofibromatosis,  carried  out  in  part  in  the 
laboratory  of  Francis  Collins  (HHMI,  University 
of  Michigan).  Andrew  Feinberg  (HHMI,  Univer- 
sity of  Michigan)  has  also  used  YACs  to  character- 
ize a  large  region  of  chromosome  1 1  that  is  impli- 
cated in  the  etiology  of  a  childhood  cancer, 
Wilms'  tumor,  as  well  as  several  developmental 
defects.  YACs  are  also  playing  a  key  role  in  the 
worldwide  effort  to  identify  the  gene  that  is  mu- 
tated in  Huntington's  chorea,  a  fatal,  adult-onset 
neurological  disease. 

Current  research  is  focused  on  improving  the 
power  of  the  YAC  system  for  the  functional  and 
structural  analysis  of  human  DNA.  Functional 
studies  depend  on  the  transfer  of  YAC  clones 
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from  yeast  into  human  cells.  We  have  concen- 
trated on  YACs  containing  two  human  genes, 
HPRT  and  GART.  Both  code  for  enzymes  required 
for  the  synthesis  of  nucleic  acid  precursors  in  hu- 
man cells.  Rodent  cell  lines  are  available  that  do 
not  produce  these  enzymes.  We  have  succeeded 
in  transferring  YACs  containing  the  human  HPRT 
and  GART  genes  into  these  rodent  cells  and  have 
shown  that  the  genes  are  expressed.  A  variety  of 
gene  transfer  methods  have  been  used,  including 
microinjection  and  direct  fusion  of  yeast  sphero- 
plasts  (cells  from  which  the  walls  have  been  re- 
moved) with  rodent  cells. 

Physical  analysis  of  DNA  cloned  into  YACs  also 
remains  a  major  research  interest.  YACs  have 
greatly  improved  our  ability  to  recover  DNA  from 
higher  organisms  in  megabase-pair  blocks.  How- 
ever, much  of  the  biological  interest  in  these 


blocks  occurs  at  the  level  of  the  base  pair,  where 
a  single  change  can  have  profound  conse- 
quences. The  problem  of  analyzing  megabase- 
pair  blocks  of  DNA  at  base-pair  resolution  re- 
mains formidable,  particularly  since  there  is  a 
need  for  generic  solutions  that  can  be  applied  to 
any  large  segment  of  DNA  from  any  organism.  Fur- 
thermore, given  the  amount  of  DNA  present  in 
the  genome  of  higher  organisms  and  the  central 
role  that  its  analysis  plays  in  biomedical  research, 
it  is  essential  that  these  techniques  rely  primarily 
on  advanced  instrumentation  rather  than  skilled 
laboratory  personnel.  The  challenges  and  oppor- 
tunities in  this  area  are  reminiscent  of  those  faced 
by  digital  computing  in  the  1950s:  promising 
technology  was  already  in  hand,  but  vastly  im- 
proved efficiency  was  required  before  its  prom- 
ise could  be  broadly  realized. 
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ALL  blood  cells  derive  from  pluripotent  stem 
cells  in  the  bone  marrow.  The  decision  of 
stem  cells  to  differentiate  leads  to  the  production 
of  a  heterogeneous  population  of  cells  with  vary- 
ing developmental  potentials  and  commitment  to 
expression  of  lineage-specific  protein  products. 
A  goal  of  this  laboratory  is  an  improved  under- 
standing of  hematopoietic  development  and  the 
expression  and  function  of  specific  genes  that  re- 
late to  the  normal  biology  of  hematopoietic  cells. 

Our  efforts  are  concentrated  on  the  analysis  of 
both  red  and  white  blood  cells.  These  cell  types 
are  important  in  severe,  clinically  significant  hu- 
man genetic  disorders  in  which  the  capacity  to 
produce  specific  proteins  is  impaired  by  muta- 
tion. We  seek  to  understand  the  molecular  basis 
of  these  inherited  disorders,  delineate  the  normal 
regulation  of  affected  genes,  and  utilize  what  is 
learned  to  formulate  novel  treatments  based  on 
molecular  biologic  considerations. 

One  of  the  major  disorders  of  red  blood  cells  is 
/3-thalassemia  (Cooley's  anemia),  in  which  the 
synthesis  of  hemoglobin  is  defective.  Through 
molecular  cloning  and  gene  expression  the  mo- 
lecular basis  of  the  disease  was  determined  in  this 
laboratory  several  years  ago.  The  major,  unsolved 
problems  now  relate  to  how  globin  genes  are  nor- 
mally regulated  in  the  developing  erythroid  pre- 
cursor cells.  Specifically,  how  are  globin  genes 
activated  only  in  red  cells  but  not  in  other  tissues? 
How  are  different  globin  genes  regulated  in  devel- 
opment? To  examine  these  general  issues  we  have 
concentrated  on  identifying  and  characterizing 
unique  DNA-binding  proteins  that  appear  to  be 
major  transcriptional  regulators  in  erythroid 
cells. 

A  prominent,  apparently  erythroid-specific 
DNA-binding  protein  was  discovered  that  recog- 
nizes a  small  DNA  motif  found  in  the  promoters 
or  enhancers  of  virtually  all  erythroid-expressed 
genes,  and  the  human,  mouse,  and  frog  homo- 
logues  were  cloned.  The  protein  is  modular,  con- 
sisting of  a  novel  two-finger  structure  required 
for  DNA  binding  and  other  domains  that  serve  as 


potent  activators  of  gene  transcription.  The  ex- 
pression of  this  protein  in  two  other  hematopoi- 
etic cell  types,  megakaryocytes  and  mast  cells, 
suggests  that  it  is  first  expressed  in  a  multipoten- 
tial  progenitor  cell  and  may  regulate  genes  in 
those  cell  types  as  well. 

An  understanding  of  how  this  transcription  fac- 
tor is  itself  regulated  in  erythroid  cells  may  pro- 
vide important  insights  into  the  initial  events  in 
erythroid  decision  making  and  maturation.  Re- 
cent findings  indicate  that  the  gene  is  activated 
early  in  hematopoietic  development  and  is  later 
subject  to  positive  autoregulation  of  its  pro- 
moter. Site-specific  gene  disruption  in  mouse 
embryo  stem  cells  and  generation  of  chimeric  an- 
imals has  also  revealed  that  the  protein  is  essen- 
tial for  normal  erythroid  differentiation  and  that 
other  proteins  binding  to  the  GATA  motif  cannot 
compensate  for  its  absence.  The  focus  of  our  stud- 
ies is  to  understand  in  detail  how  this  transcrip- 
tion factor  functions  in  normal  erythroid  develop- 
ment and  how  its  expression  is  first  turned  on  in 
early  progenitor  differentiation.  Ultimately  these 
studies  may  provide  new  clues  to  differential  reg- 
ulation of  globin  genes  and  the  prospects  for  di- 
rected manipulation  of  their  expression  for  the 
treatment  of  hemoglobinopathies. 

In  a  separate  but  conceptually  related  group  of 
studies  a  gene  that  encodes  an  essential  compo- 
nent of  the  white  blood  cell  (phagocytic)  system 
responsible  for  killing  ingested  microorganisms 
is  being  examined.  We  wish  to  understand  how 
this  clinically  important  host  defense  system  is 
regulated  and,  more  generally,  how  cell-specific 
gene  expression  is  achieved  in  this  lineage,  also 
descendent  from  the  pluripotent  stem  cell.  The 
gene  under  study  encodes  a  subunit  of  a  unique 
cytochrome  that  is  defective  in  chronic  granulo- 
matous disease,  an  X-linked  condition.  By  reverse 
genetics  (positional  cloning)  we  previously  iso- 
lated the  relevant  gene,  determined  its  structure, 
and  demonstrated  the  presence  of  protein  prod- 
uct in  the  cytochrome  complex  of  phagocytic 
cells.  In  addition,  because  interferon-7  stimu- 
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lates  phagocytic  cell  function  generally  as  well  as 
expression  of  the  cytochrome,  it  was  also  possi- 
ble to  show  that  this  lymphoid  is  clinically  effec- 
tive in  chronic  granulomatous  disease.  Studies 
have  identified  several  point  mutations  in  the  cy- 
tochrome that  interfere  with  protein  function  in 
vivo.  We  are  now  working  to  define  the  elements 
of  the  gene  responsible  for  cell-restricted  (i.e., 
phagocytic  cell)  gene  expression.  Through  the 
use  of  transgenic  mice,  we  identified  a  DNA  frag- 
ment suflBcient  for  targeting  reporter  or  onco- 
genes in  phagocytic  cells.  Coupled  to  an  onco- 


gene, this  fragment  leads  to  the  development  of 
an  inherited  malignancy  of  phagocytes  in  mice. 

Efforts  now  are  focused  on  the  characterization 
of  the  cellular  components  that  account  for  lin- 
eage-specific regulation.  In  the  end,  through  an 
understanding  of  normal  regulation  in  white 
blood  cells,  we  envision  improved  capabilities  to 
modulate  gene  expression  and  differentiation  in 
both  health  and  disease.  This  will  facilitate  on- 
going strategies  to  develop  new  approaches  to 
the  treatment  of  genetic  disease  by  gene  transfer 
into  marrow  stem  cells  (somatic  gene  therapy) . 
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TRANSGENIC  mice  are  used  in  my  laboratory 
as  a  model  system  to  study  mammalian  gene 
regulation  and  embryonic  development.  Such 
mice  are  generated  by  microinjection  of  purified 
fragments  of  DNA  into  mouse  embryos  at  the  one- 
cell  stage.  The  embryos  are  then  transferred  into 
the  reproductive  tracts  of  suitably  prepared  fe- 
male mice  and  allowed  to  develop  to  term.  In  a 
typical  experiment,  20-30  percent  of  the  new- 
born mice  are  found  to  have  integrated  the  in- 
jected DNA  stably  into  their  genome.  The  new 
DNA  is  termed  transgenic  DNA. 

Through  recombinant  DNA  technology,  the 
DNA  for  microinjection  can  be  specifically  de- 
signed and  assembled  from  previously  character- 
ized fragments  of  DNA.  The  microinjected  DNA, 
once  it  has  integrated,  is  duplicated  along  with 
the  rest  of  the  genome  at  each  cell  division  and  is 
therefore  present  in  all  cells  of  the  transgenic 
mice.  The  mice  are  analyzed  in  detail  to  deter- 
mine whether  the  new  DNA  causes  changes  in  the 
development  or  behavior  of  the  mice. 

The  new  DNA  can  work  through  at  least  two 
different  mechanisms.  If  the  new  DNA  is  ex- 
pressed— i.e.,  if  it  encodes  a  protein — the  char- 
acteristics of  the  mice  can  be  altered  in  a  "domi- 
nant" fashion.  Alternatively,  the  transgenic  DNA 
may  cause  a  "recessive"  mutation  as  a  result  of  its 
novel  position  in  the  genome.  When  the  new 
DNA  integrates,  the  normal  organization  of  the 
adjacent  endogenous  genes  may  be  disrupted,  re- 
sulting in  recessive  inactivation  of  those  genes. 
Transgenic  mice  are  typically  analyzed  for  both 
dominant  and  recessive  changes  in  their  pattern 
of  development. 

Much  of  our  research  is  directed  at  studies  of 
eye  development.  The  lens  of  the  eye  is  known  to 
express  a  set  of  proteins  called  crystallins.  The 
crystallins  are  essential  for  normal  transparency 
of  the  lens,  and  certain  crystallins  are  expressed 
exclusively  in  the  lens,  implying  that  lens-spe- 
cific regulatory  sequences  must  be  associated 
with  those  genes.  In  collaboration  with  Joram 
Piatigorsky  and  Heiner  Westphal  at  the  National 
Institutes  of  Health,  experiments  were  done  to 
identify  lens-specific  regulatory  sequences.  Re- 


combinant DNA  techniques  were  used  to  link  a 
putative  crystallin  regulatory  sequence  to  coding 
sequences  from  a  bacterial  gene.  The  recombi- 
nant DNA  construct  was  used  to  generate  trans- 
genic mice.  When  the  mice  were  assayed  for  ex- 
pression of  the  bacterial  gene,  activity  was 
detected  only  in  the  lens.  This  result  demon- 
strated that  a  small  stretch  of  DNA  located  near 
the  transcription  initiation  site  of  a  crystallin 
gene  was  sufficient  to  provide  lens-specific  regu- 
lation of  gene  expression. 

A  subsequent  experiment  was  done  to  generate 
transgenic  mice  that  would  express  an  oncogene 
in  their  lenses.  An  oncogene  encodes  a  protein 
that  is  thought  to  be  capable  of  predisposing  cells 
to  cancer  formation.  A  construct  was  made  that 
contained  the  crystallin  regulatory  sequences 
linked  to  an  oncogene  from  a  tumor  virus,  and 
transgenic  mice  were  generated.  The  mice  dis- 
played dramatic  bilateral  cataracts,  and  their 
lenses  were  found  to  contain  rapidly  growing  tu- 
mor cells.  When  the  embryonic  development  of 
the  lenses  was  studied,  it  became  apparent  that 
tumor  cell  proliferation  began  shortly  after  the 
onset  of  expression  of  the  oncogene.  These  ex- 
periments showed  that  expression  of  the  onco- 
gene was  sufficient  to  induce  tumor  formation. 
These  transgenic  mice  provide  a  model  system  in 
which  to  study  the  mechanisms  by  which  an  on- 
cogene can  alter  cellular  differentiation  and 
proliferation. 

Over  the  past  two  years,  regulatory  sequences 
that  give  cell-specific  gene  expression  in  other 
regions  of  the  eye  have  been  characterized.  In 
collaboration  with  Gregory  Liou,  a  promoter  that 
is  active  in  the  photoreceptor  cells  of  the  retina 
has  been  identified.  The  promoter  comes  from  a 
gene  that  encodes  a  retinoid-binding  protein. 
This  protein  plays  a  role  in  the  recycling  of  reti- 
nal, a  vitamin  A  derivative  that  is  essential  for  nor- 
mal vision.  The  identification  of  a  photoreceptor- 
specific  promoter  will  allow  future  studies  of 
retinal  function  and  development. 

Studies  have  also  been  done  to  identify  a  pro- 
moter that  is  active  in  the  pigmented  regions  of 
the  eye.  In  both  mice  and  humans,  loss  of  pig- 
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ment  synthesis  results  in  albinism.  Albinism  is  of- 
ten accompanied  by  a  loss  of  activity  of  the  en- 
2yme  tyrosinase,  suggesting  that  albinism  might 
be  due  to  a  mutation  in  the  tyrosinase  gene.  To 
test  this  hypothesis,  we  constructed  a  nonmutant 
tyrosinase  minigene  and  introduced  it  into  the 
genome  of  albino  mice.  The  resultant  transgenic 
mice  were  pigmented  and  could  be  identified  at 
birth  by  simple  visual  inspection.  The  tyrosinase 
minigene  was  expressed  properly  in  the  different 
pigmented  cell  types  within  the  eye.  These  re- 
sults confirm  the  hypothesis  that  albinism  is  due 
to  loss  of  endogenous  tyrosinase  activity. 
Through  genomic  sequencing  studies,  the  albino 
mutation  in  laboratory  mice  was  subsequently 
identified  as  a  single-base  pair  change  that 
causes  a  conversion  from  cysteine  to  serine  at 
amino  acid  103  of  the  tyrosinase  protein.  The  ty- 
rosinase minigene  is  a  useful  new  research  tool, 
since  it  allows  transgenic  mice  to  be  recognized 
by  simple  visual  inspection. 

The  other  major  focus  of  our  research  is  the 
characterization  of  recessive  mutations  that  are 
caused  by  integration  of  transgenic  DNA.  Such 
mutations  are  valuable,  because  the  transgenic 
DNA  can  be  used  to  isolate  the  inactivated  genes. 
Once  the  transgenic  DNA  has  integrated  into  the 
genome,  it  becomes  linked  to  adjacent  genomic 


sequences.  Using  recombinant  DNA  technology, 
we  can  fragment  the  genome  into  a  large  number 
of  segments,  and  the  segments  can  be  individu- 
ally cloned  to  give  a  genomic  "library."  The  li- 
brary can  be  screened  by  hybridization  tech- 
niques to  identify  the  clones  that  contain  the 
transgenic  DNA  along  with  the  adjacent  genomic 
sequences.  In  this  fashion,  the  transgenic  DNA 
provides  a  molecular  tag  that  can  be  used  to  iso- 
late genes  that  play  essential  roles  in  normal 
mammalian  development. 

In  one  of  our  mouse  families,  the  transgenic 
insert  has  inactivated  the  downless  gene,  a  gene 
essential  for  normal  induction  of  hair  follicle  for- 
mation. In  another  transgenic  family,  there  is  a 
mutation  that  blocks  sperm  development.  For 
both  of  these  mutants,  the  genomic  sequences 
that  flank  the  transgenic  DNA  have  been  isolated. 
Putative  coding  regions  have  been  identified  and 
sequenced,  and  efforts  are  in  progress  to  confirm 
that  the  proteins  encoded  by  these  regions  play 
essential  roles  in  specific  stages  of  development. 

Such  studies  can  provide  important  insights 
into  the  mechanisms  that  specify  differentiation 
and  morphogenesis  during  mammalian  embry- 
onic development,  and  may  provide  clues  for 
strategies  to  help  prevent  birth  defects  in 
humans. 


338 


Structural  Studies  of  DNA-binding  Proteins 


Carl  O.  Pabo,  Ph.D. — Associate  Investigator 

Dr.  Pabo  is  also  Professor  in  the  Departments  of  Molecular  Biology  and  Genetics  and  of  Biophysics  at  the 
Johns  Hopkins  University  School  of  Medicine.  He  received  his  undergraduate  degree  from  Yale  University, 
where  he  majored  in  molecular  biophysics  and  biochemistry.  He  did  his  graduate  work  in  Mark  Ptashne's 
laboratory  at  Harvard  University,  where  he  continued  his  research  as  a  Jane  Coffin  Childs  fellow  in  the 
laboratories  of  Stephen  Harrison  and  Don  Wiley.  In  July  1991,  Dr.  Pabo  will  move  to  the  Massachusetts 
Institute  of  Technology  as  Professor  in  the  Department  of  Biology. 


WE  are  interested  in  understanding  how  pro- 
teins recognize  specific  sites  on  double- 
stranded  DNA  and  how  the  bound  proteins  regu- 
late gene  expression.  We  would  like  to  know 
what  structural  motifs  are  used  by  DNA-binding 
proteins,  what  side  chains  make  sequence- 
specific  contacts,  and  whether  there  are  any  re- 
curring patterns  or  rules  for  recognition  of  sites 
on  double-stranded  DNA.  Much  of  our  current  re- 
search has  focused  on  characterizing  the  major 
structural  motifs  found  in  DNA-binding  proteins. 
We  hope  to  use  this  information  to  design  novel 
DNA-binding  proteins  for  research,  diagnosis, 
and  therapy. 

Prokaryotic  repressors  provide  useful  model 
systems  for  the  study  of  protein-DNA  interactions, 
and  we  are  continuing  to  study  several  bacterial 
repressors.  The  repressor  from  the  bacteriophage 
X  uses  a  helix-turn-helix  motif  and  an  extended 
amino-terminal  arm  to  contact  sites  in  the  major 
groove .  The  arc  repressor  from  Salmonella  bacte- 
riophage P22  uses  a  (8-sheet  for  site-specific  rec- 
ognition. The  major  developments  in  our  labora- 
tory during  the  past  year,  however,  have  involved 
studies  of  two  of  the  key  motifs — the  homeodo- 
main  and  the  zinc  finger — that  are  used  by  eukar- 
yotic  regulatory  proteins. 

Crystal  Structures  of  Homeodomain-DNA 
Complexes 

The  homeodomain  is  a  conserved  structural 
motif  found  in  many  eukaryotic  proteins  that  reg- 
ulate development  and  cell  fate.  To  understand 
how  this  motif  recognizes  DNA  and  how  this  is 
related  to  the  helix-turn-helix  motif  seen  in  pro- 
karyotic repressors,  we  have  determined  the 
crystal  structures  of  two  homeodomain-DNA 
complexes. 

We  began  by  studying  the  homeodomain  from 
the  engrailed  protein,  which  plays  a  key  role  in 
Drosophila  development.  (This  project  is  a  col- 
laboration with  Thomas  Kornberg  at  the  Univer- 
sity of  California,  San  Francisco.)  We  were  able  to 
grow  good  cocrystals  of  the  homeodomain-DNA 
complex,  and  Chuck  Kissinger  solved  the  struc- 
ture of  this  complex.  The  homeodomain  makes 
contacts  in  both  the  major  and  minor  grooves. 


The  helix-turn-helix  unit  makes  critical  contacts 
in  the  major  groove,  but  the  orientation  of  this 
helix-turn-helix  unit  with  respect  to  the  DNA  is 
different  than  the  arrangements  observed  with 
the  prokaryotic  repressors.  Residues  near  the 
amino-terminal  end  of  the  homeodomain  form  an 
extended  "arm"  that  fits  into  the  minor  groove 
and  makes  additional  site-specific  contacts. 

We  also  have  been  studying  a  complex  contain- 
ing the  homeodomain  from  the  a 2  protein, 
which  helps  to  regulate  mating  type  in  yeast. 
(This  project  is  a  collaboration  with  Alexander 
Johnson  at  the  University  of  California,  San  Fran- 
cisco.) Cynthia  Wolberger  has  recently  solved 
this  structure.  The  overall  arrangement  of  the 
helix-turn-helix  unit  and  the  amino-terminal  arm 
are  similar  to  the  arrangement  seen  in  the  en- 
grailed complex,  but  there  are  a  number  of  dif- 
ferences in  the  critical  side  chains  used  for  recog- 
nition. There  also  is  a  rich  background  of 
biochemical  and  genetic  data  about  a  2  that 
should  help  us  understand  the  precise  role  of  this 
homeodomain  in  recognition  and  regulation. 

Structure  of  a  Zinc  Finger-DNA  Complex 

The  zinc  finger  domain,  which  contains  about 
30  amino  acids,  is  another  key  DNA-binding  motif 
that  is  found  in  a  large  family  of  eukaryotic  regula- 
tory proteins.  Studies  from  other  groups  have 
shown  that  each  finger  contains  an  antiparallel 
(8-sheet  and  an  a-helix,  but  little  has  been  known 
about  how  these  fingers  recognize  DNA.  Nikola 
Pavletich  recently  solved  the  structure  of  a  com- 
plex containing  three  zinc  fingers  from  a  murine 
transcription  factor.  Starting  with  cDNA  for  the 
zif 268  gene  (provided  by  Daniel  Nathans,  HHMI, 
at  the  Johns  Hopkins  University) ,  he  cloned  and 
expressed  a  three-finger  peptide  and  crystallized 
the  peptide-DNA  complex.  The  zinc  fingers  rec- 
ognize B-DNA  and  fit  into  the  major  groove.  Each 
finger  makes  its  primary  contacts  with  a  3-base 
pair  "subsite,"  and  side  chains  near  the  amino- 
terminal  end  of  the  a-helix  make  the  critical  con- 
tacts with  the  bases.  Since  the  fingers  are  used  in  a 
modular  fashion,  they  may  be  the  ideal  motif  to 
use  as  we  try  to  design  novel  DNA-binding 
proteins. 
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Tools  for  Analyzing  Protein-DNA 
Interactions 

We  are  working  on  several  other  projects  that 
should  help  to  develop  the  background  for  a  sys- 
tematic analysis  of  protein-DNA  interactions. 

Because  structural  analysis  often  is  limited  by 
the  ability  to  obtain  suitable  crystals,  w^e  are  try- 
ing to  improve  methods  for  the  cocrystallization 
of  protein-DNA  complexes.  Our  initial  approach 
involved  systematic  changes  in  the  length  of  the 
DNA  site  and  required  that  the  entire  site  be  re- 
synthesized  for  each  experiment.  We  now  have 
encouraging  preliminary  results  with  a  linker  co- 
crystallization  scheme  that  combines  the  protein, 
the  binding  site,  and  a  library  of  DNA  linkers  that 
can  be  used  with  any  complex.  This  strategy  may 


allow  a  dramatic  increase  in  the  number  of  co- 
crystallization  conditions  that  can  be  tested. 

We  have  also  been  developing  testing  strate- 
gies for  computer-aided  protein  design.  Our  pro- 
grams can  systematically  consider  a  large  number 
of  sequences  and  conformations,  and  we  are  us- 
ing these  programs  as  we  attempt  to  design  zinc 
finger  proteins  that  will  recognize  novel  binding 
sites.  We  also  are  developing  genetic  strategies 
for  selecting  zinc  finger  proteins  that  recognize 
desired  target  sequences.  We  hope  that  this  com- 
bination of  structural  analysis,  computer-aided 
protein  design,  and  genetic  selection  will  give  us 
a  better  understanding  of  protein-DNA  recogni- 
tion and  allow  us  to  design  zinc  finger  proteins 
that  recognize  novel  target  sites. 
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Crystal  Structure 
of  Homeodomaln-DNA  Complex 


A  representation  of  the  structure  of  the  engrailed  homeodomain-DNA  complex 
as  determined  by  x-ray  crystallography.  The  protein  ( shown  in  red  and  yellow ) 
binds  to  the  DNA  (blue)  and  makes  numerous  contacts  with  a  4-base  pair 
region  (purple)  of  the  double-helical  DNA.  One  a- helix  from  the  homeodomain 
(which  looks  like  a  red  circle  in  projection)  fits  into  the  major  groove  of  the 
DNA,  and  an  extended  region  of  the  protein  backbone  ( which  appears  as  a  red 
ribbon  in  the  figure)  fits  into  the  minor  groove  of  the  DNA. 

This  structure  was  determined  by  Charles  Kissinger  and  Carl  Pabo.  Illustra- 
tion originally  printed  as  cover  o/Cell  63(3),  1990.  See  also  Kissinger,  C.R., 
Liu,  B.,  Martin- Blanco,  E.,  Kornberg,  T.B.,  and  Pabo,  CO.  1990.  Cell 
63:579-590. 


Y  chromosome 


XX  male 
XY  female 


The  tiny  region  of  the  human  Y  chromosome  implicated  in  both  sex  determination  and  Turner 
syndrome  is  shown  in  expanded  view  below  the  schematic  diagram  of  the  entire  Y  chromosome 
( the  black  oval  represents  the  centromere  of  the  Y  chromosome).  The  region  contains  the  genes 
SRY,  RPS4Y,  and  ZFY.  The  black  lines  at  the  bottom  of  the  diagram  depict  the  portion  of  the  Y 
present  in  two  pivotal  patients,  one  an  XX  male  and  the  other  an  XY  female  (lacking  features  of 
Turner  syndrome). 
Research  of  David  C.  Page. 
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TO  a  large  degree,  human  individuals  vary  in 
physical  characteristics  because  of  the  im- 
pact of  genetic  variation  on  the  course  of  embry- 
onic development.  The  human  genome  is  orga- 
nized into  23  pairs  of  chromosomes,  each 
believed  to  carry,  on  average,  about  5,000  genes. 
When  considering  genetic  variability,  it  is  impor- 
tant to  distinguish  between  variation  in  a  single 
gene,  called  Mendelian,  and  massive  multigene 
variation,  called  chromosomal. 

As  discussed  elsewhere  in  this  volume,  single- 
gene  defects  are  responsible  for  certain  condi- 
tions, such  as  color  blindness,  cystic  fibrosis,  and 
muscular  dystrophy.  Other  conditions,  such  as 
Down  syndrome,  appear  to  be  the  result  of 
"wholesale"  abnormalities  affecting  an  entire 
chromosome.  As  reductionists  and  molecular  bi- 
ologists, we  proceed  on  the  assumption  that  the 
developmental  consequences  of  chromosomal  ge- 
netic variability  will  ultimately  be  understood  in 
terms  of  individual  genes  and  their  particular 
functions. 

My  colleagues  and  I  are  seeking  to  understand 
how  massive  variability  in  one  chromosome  pair 
— the  sex  chromosomes — dramatically  affects 
the  course  of  development.  Embryos  normally  in- 
herit one  sex  chromosome  from  each  parent.  The 
mother  contributes  an  X  chromosome,  and  the 
father  contributes  either  an  X  or  a  Y.  Thus  normal 
embryos  have  one  of  two  sex  chromosome  consti- 
tutions, XX  or  XY. 

Sex  Determination 

In  both  humans  and  mice  the  presence  or  ab- 
sence of  the  Y  chromosome  determines  whether 
an  embryo  develops  as  a  male  or  a  female.  XX 
embryos  become  females,  XY  embryos  males.  For 
years  scientists  wondered  whether  the  Y  chromo- 
some carried  few  or  many  sex-determining  genes 
and  how  those  were  distributed  along  the 
chromosome. 

We  have  found  that  the  entire  sex-determining 
function  can  be  traced  to  one  tiny  portion  of  the 
human  Y  chromosome.  This  sex-determining  re- 
gion was  identified  by  studying  DNA  from  "XX 
males"  and  "XY  females."  XX  males  have  small 


testes  and  are  sterile.  XY  females  are  also  sterile 
and  do  not  develop  secondary  sexual  characteris- 
tics. We  found  that  almost  all  XX  males  had  in- 
herited a  small  bit  of  the  Y  chromosome  attached 
to  one  of  their  X  chromosomes.  Conversely,  some 
XY  females  lacked  the  same  segment  of  the  Y  that 
was  present  in  XX  males.  On  the  basis  of  patterns 
of  chromosomal  deletions  found  in  such  patients, 
we  constructed  a  map  of  the  Y  chromosome.  It 
was  then  we  came  to  recognize  that  the  presence 
or  absence  of  one  small  region,  about  0.4  percent 
of  the  chromosome,  correlated  well  with  gender. 

Detailed  analysis  of  XX  males  suggests  that  one 
or  more  genes  within  this  relatively  small  seg- 
ment of  the  Y  chromosome  determine  the 
outcome  of  sexual  development.  Laboratories 
around  the  world  have  scoured  this  small  region 
searching  for  such  sex-determining  genes.  We  are 
now  conducting  a  variety  of  experiments  to  char- 
acterize the  functions  of  two  genes  in  the  region: 
ZFY,  which  we  identified  a  few  years  ago,  and 
SRY,  a  gene  recently  described  by  British 
scientists. 

Both  ZFFand  5i?Fappear  to  encode  DNA-bind- 
ing  proteins  that  are  likely  to  regulate  the  tran- 
scriptional activity  of  particular  but  unknown 
target  genes.  The  evidence  is  strong  that  5.^Fis  a 
sex-determining  gene.  Much  less  clear  is  the  role, 
if  any,  that  ZFF plays  in  the  process.  We  hope  to 
learn  more  about  the  function  of  the  ZFF gene  by 
simultaneously  analyzing  a  closely  related  gene, 
ZFX,  that  we  identified  on  the  X  chromosome. 

Turner  Syndrome 

As  mentioned  earlier,  embryos  normally  have 
two  sex  chromosomes.  However,  about  1-2  per- 
cent of  all  human  embryos  have  only  one.  The 
vast  majority  of  such  XO  embryos  are  lost  to  spon- 
taneous miscarriage,  but  a  few  survive.  The  sur- 
viving XO  embryos  develop  as  females  with  a  par- 
ticular set  of  physical  features  known  as  Turner 
syndrome,  which  includes  short  stature,  webbing 
of  the  neck,  puffiness  of  the  hands  and  feet,  and 
failure  of  secondary  sexual  development.  It  had 
been  postulated  that  Turner  syndrome  might  be 
the  result  of  having  a  single  copy  of  one  or  more 
genes  common  to  the  X  and  Y  chromosomes. 
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Nothing  was  known  as  to  the  number  or  nature  of 
these  hypothetical  Turner  genes. 

We  began  to  focus  our  attention  on  this  dis- 
order when  it  was  noticed  that  certain  XY  females 
exhibit  the  same  anatomic  abnormalities  as  XO 
females.  A  pivotal  finding  was  that  all  such  XY 
Turner  females  lacked  a  portion  of  the  Y  chromo- 
some. We  postulated  that  the  Y  chromosomal  de- 
letions in  these  individuals  might  encompass  not 
only  a  sex-determining  gene  or  genes,  but  also  a 
nearby  Turner  gene  or  genes. 

Pursuing  this  hunch,  we  discovered  two  candi- 
date Turner  genes,  one  on  the  Y  chromosome  and 
one  on  the  X.  These  genes,  named  RPS4Y  and 
RPS4X,  appear  to  encode  slightly  different  forms 
of  a  protein  constituent  of  the  ribosome,  a  struc- 
ture required  for  protein  synthesis  and  vital  to  all 
cells.  In  embryos  lacking  a  second  RPS4  gene 
(i.e.,  having  a  single  RPS4),  the  rate  at  which 
ribosomes  are  constructed  may  be  slowed,  in  turn 
reducing  the  embryo's  capacity  to  synthesize 


other  proteins.  We  are  currently  testing  the 
highly  speculative  hypothesis  that  such  a  reduc- 
tion in  protein  synthetic  capacity  is  the  cause  of 
at  least  some  of  the  physical  features  of  Turner 
syndrome. 

An  interesting  analogy  can  be  found  in  the  fruit 
fly  Drosophila  melanogaster.  There,  deficien- 
cies in  ribosomal  protein  genes  are  associated 
with  a  particular  "syndrome"  called  the  Minute 
(pronounced  mi-NUTE)  phenotype,  which 
includes  reduced  body  size,  diminished  via- 
bility and  fertility,  and  specific  anatomic 
abnormalities. 

The  very  existence  of  related  but  nonidentical 
ribosomal  protein  genes  on  the  X  and  Y  chromo- 
somes suggests  that  the  ribosomes  of  human 
males  may  differ  slightly  from  those  of  females.  It 
will  be  a  surprise  to  many  if  the  differences  be- 
tween the  sexes  extend  all  the  way  down  to  the 
most  fundamental  and  vital  of  intracellular 
machines! 
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ABOUT  1 0  years  ago  we  began  a  fruitful  col- 
laboration with  Ralph  Brinster's  laboratory 
at  the  University  of  Pennsylvania.  Together  we 
helped  develop  methods  for  introducing  func- 
tional genes  into  all  cells  of  the  mouse.  The  genes 
under  study  are  manipulated  in  bacterial  plas- 
mids,  using  standard  recombinant  DNA  tech- 
niques. Then  the  regions  of  interest  are  excised 
from  the  plasmid,  and  a  few  hundred  copies  are 
injected  into  the  pronucleus  of  a  fertilized  mouse 
egg  (or  that  of  any  other  mammal) . 

Remarkably,  the  DNA  integrates  about  30  per- 
cent of  the  time  into  one  of  the  chromosomes 
prior  to  replication,  and  the  genes  are  inherited 
by  all  daughter  cells,  as  any  other  gene  would  be. 
Furthermore,  many  of  the  genes  are  functional, 
imparting  new  genetic  characteristics  to  the  ani- 
mal. Mice  and  other  animals  carrying  foreign  DNA 
are  referred  to  as  transgenic.  Because  the  new 
genes  are  also  in  the  germ  cells,  they  are  usually 
transmitted  to  subsequent  generations. 

One  of  our  goals  has  been  to  discover  what 
parts  of  a  gene  determine  when,  where,  and  how 
efficiently  it  will  be  utilized.  We  often  start  by 
testing  a  large  piece  of  DNA  that  includes  the 
gene  of  interest.  In  transgenic  animals,  the  gene 
will  usually  be  expressed  at  the  appropriate  time 
and  place,  even  though  it  has  integrated  at  an  ab- 
normal chromosomal  location  and  may  be  de- 
rived from  a  different  mammalian  species.  Then 
we  delete  various  regions  of  the  genes  and,  with 
each  variant,  make  transgenic  mice  to  determine 
what  regions  are  essential  for  appropriate 
expression. 

For  example,  we  have  delineated  a  small  re- 
gion (125  base  pairs)  of  the  rat  elastase  I  gene 
that  is  essential  for  the  gene's  expression  in  the 
acinar  cells  of  the  pancreas.  Furthermore,  this  se- 
quence (often  called  an  enhancer)  can  be  used  to 
direct  the  expression  of  another  gene  (e.g.,  the 
growth  hormone  gene)  to  the  acinar  cells,  and 
the  sequence  will  function  when  positioned  al- 
most anywhere  in  the  vicinity  of  the  growth  hor- 
mone gene. 

In  similar  experiments,  we  have  been  identify- 
ing sequences  responsible  for  directing  appro- 


priate expression  of  globin  genes  in  red  blood 
cells,  albumin  to  hepatocytes,  and  protamine  I  to 
male  germ  cells.  More  recently,  we  have  begun  to 
locate  the  elements  involved  in  directing  the  ex- 
pression of  genes  in  the  catecholamine  biosynthe- 
sis pathway  to  specific  neurons. 

Because  the  regulatory  elements  from  one  gene 
can  often  be  used  to  control  another,  the  expres- 
sion of  many  interesting  genes  can  be  directed  to 
a  particular  cell  type  and  the  consequences  on 
cellular  development  and  function  can  be  as- 
sessed. For  example,  using  the  elastase  enhancer 
element,  we  have  been  able  to  make  strains  of 
mice  that  reproducibly  develop  pancreatic 
cancer  as  a  consequence  of  expressing  the  trans- 
forming gene  from  SV40  virus,  the  mouse  myc 
gene,  or  the  human  H-ras  oncogene.  Similarly, 
we  have  developed  models  of  liver  cancer  by  di- 
recting the  expression  of  these  genes  to  hepato- 
cytes with  the  albumin  enhancer. 

Significantly,  each  of  these  genes  results  in  a 
characteristic  morphological  transformation  of 
the  organ,  which  probably  reflects  the  particular 
cellular  events  that  the  genes  mediate.  By  means 
of  simple  genetic  crosses,  mice  carrying  any  pair 
of  these  transforming  genes  can  be  created.  They 
develop  tumors  that  appear  more  rapidly  and  are 
more  aggressive  than  those  in  mice  carrying  a 
single  gene,  suggesting  that  these  genes  act 
cooperatively. 

We  have  recently  shown  that  the  expression  of 
genes  not  generally  considered  oncogenes  may 
also  predispose  cells  to  malignant  transformation 
and  cancerous  growth.  In  one  example,  we  ex- 
pressed the  surface  antigens  of  hepatitis  B  virus 
(HBV)  in  the  liver,  using  the  albumin  enhancer. 
HBV  infects  millions  of  people  worldwide,  and 
the  incidence  of  liver  cancer  among  them  is  high. 
In  transgenic  mice,  expression  of  this  gene  re- 
sulted in  synthesis  of  the  viral  surface  antigen  and 
envelope  protein,  which  aggregated  within  the 
secretory  apparatus  of  the  liver  cells,  causing  cel- 
lular injury  and  death.  When  the  mice  were  more 
than  a  year  old,  they  developed  liver  cancer.  In  a 
similar  case,  expression  in  liver  of  plasminogen 
activator,  a  protease,  results  in  selection  of  cells 
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that  inactivate  the  protease-encoding  transgene. 
These  cells  grow  and  repopulate  the  liver,  and 
the  mice  seem  normal.  However,  they  also  suc- 
cumb to  liver  cancer  when  they  are  one  to  two 
years  old.  In  both  cases,  we  postulate  that  liver 
regeneration  in  a  toxic  environment  of  liver  in- 
jury results  in  genetic  damage  that  predisposes 
the  cells  to  malignant  transformation. 

It  is  also  possible  to  develop  transgenic  mice 
that  mimic  some  human  genetic  diseases.  For  ex- 
ample, we  have  recently  made  a  model  of  human 
sickle  cell  disease.  By  introducing  into  mice  both 
human  a-  and  /3-globin  genes  under  control  of  the 
locus  control  region  (a  newly  discovered  genetic 
element  essential  for  high-level  expression  of  the 
j8-globin  gene) ,  we  have  generated  mice  that  pro- 
duce as  much  human  hemoglobin  as  mouse  he- 
moglobin. When  the  mutant  |S-globin  gene  from 
people  with  sickle  cell  disease  is  substituted  for 
the  normal  gene  in  these  experiments,  the  red 


blood  cells  of  the  mice  sickle  under  appropriate 
conditions.  These  mice  may  be  a  valuable  re- 
source for  testing  experimental  therapies. 

A  long-range  goal  is  to  use  transgenic  mice  to 
study  aspects  of  neural  development.  We  have 
started  by  cloning  the  genes  involved  in  the  syn- 
thesis of  catecholamine  neurotransmitters:  dopa- 
mine, norepinephrine,  and  epinephrine.  These 
genes  are  expressed  in  certain  neurons  of  the 
central  nervous  system,  in  the  peripheral  nervous 
system,  and  in  the  adrenal  medulla.  The  control 
elements  from  these  genes  are  being  tested  in 
conjunction  with  reporter  genes  whose  products 
can  be  easily  visualized  to  assess  when  and  where 
they  are  expressed  during  development.  Subse- 
quently, we  intend  to  use  the  control  elements  to 
direct  the  expression  of  other  genes  to  these  neu- 
rons with  the  aim  of  affecting  the  decisions  they 
make  during  the  process  of  forming  functional 
connections  with  target  cells. 
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THE  adrenal  gland  plays  essential  roles  in  the 
body's  ability  to  respond  to  stress.  Two  dif- 
ferent parts  of  the  gland,  an  inner  medulla  and  an 
outer  cortex,  produce  discrete  agents  of  this  re- 
sponse. The  medulla  produces  epinephrine  and 
norepinephrine,  which  are  released  very  rapidly, 
preparing  the  organism  for  immediate  physical 
activity.  In  contrast,  the  cortex  produces  steroid 
hormones,  which  are  released  more  slowly  and 
exert  prolonged  effects. 

These  adrenal  steroids  constitute  two  major 
classes:  glucocorticoids,  which  are  synthesized 
by  the  inner  zone  of  the  cortex  and  control  carbo- 
hydrate metabolism,  and  mineralocorticoids, 
which  are  made  in  the  outer  zone  and  regulate 
salt  and  water  balance.  Both  classes  of  steroid 
hormones  are  formed  from  cholesterol  by  the  se- 
quential action  of  a  related  group  of  steroido- 
genic enzymes.  Certain  of  these  enzymes,  such  as 
the  cholesterol  side-chain  cleavage  enzyme 
(SCC),  are  expressed  in  all  steroidogenic  tissues. 
In  contrast,  steroid  1 1  jS-hydroxylase  (11/3- 
OHase)  and  2 1 -hydroxylase  (21-OHase)  are  only 
expressed  in  the  adrenal  cortex  and  largely  deter- 
mine the  unique  ability  of  this  tissue  to  make  glu- 
cocorticoids and  mineralocorticoids.  The  physio- 
logical regulators  of  these  two  classes  of  adrenal 
steroids  differ  markedly  despite  the  shared  role  of 
the  enzymes  in  their  biosynthesis. 

We  are  interested  in  defining  the  events  that 
control  the  expression  of  the  adrenal  steroido- 
genic enzymes.  These  studies  have  addressed  two 
major  questions.  First,  what  mechanisms  direct 
the  expression  of  these  related  genes  within  adre- 
nocortical cells?  Second,  what  determines  the 
functional  differentiation  of  the  adrenal  cortex 
into  mineralocorticoid-  and  glucocorticoid- 
producing  zones? 

Our  studies  of  gene  regulation  have  focused  on 
the  5'-flanking  regions  of  these  genes.  This  part, 
termed  the  promoter  region,  contains  most  se- 
quences important  in  transcriptional  regulation 
of  other  genes.  We  first  showed  that  the  5'-flank- 
ing  regions  of  the  steroidogenic  enzymes  retained 
all  information  required  for  adrenal-selective  and 
hormonally  inducible  gene  expression. 


Next,  in  a  combination  of  structural  and  func- 
tional studies,  we  identified  a  protein,  termed 
steroidogenic  regulatory  protein,  that  played  a 
major  role  in  regulating  the  expression  of  the 
steroidogenic  enzymes.  This  protein  was  only 
present  in  nuclear  extracts  from  steroidogenic 
cell  lines,  suggesting  that  it  contributed  to  the 
cell-selective  expression  of  these  genes.  More- 
over, steroidogenic  regulatory  protein  appeared 
to  interact  with  the  promoter  regions  of  all  three 
steroidogenic  enzymes,  suggesting  that  it  coordi- 
nates the  expression  of  this  entire  network  of  en- 
zymes in  adrenal  cells. 

Now  that  this  major  regulatory  protein  has 
been  identified,  our  next  goal  is  to  understand 
how  it  affects  the  expression  of  21-OHase,  1 1(8- 
OHase,  and  SCC.  These  studies  require  sufficient 
amounts  of  protein  for  biochemical  analysis,  and 
we  have  therefore  initiated  studies  using  cow  ad- 
renal glands  as  a  source  of  protein.  These  experi- 
ments demonstrated  that  bovine  protein  behaved 
very  similarly  to  steroidogenic  regulatory  protein 
isolated  from  mouse  adrenocortical  tumor  cells. 
By  sequential  purification  steps,  we  have  mark- 
edly enriched  the  bovine  protein  to  a  degree  that 
should  permit  determination  of  its  amino  acid 
sequence. 

Attempts  are  also  under  way  to  raise  antibodies 
against  the  partially  purified  bovine  protein.  The 
combination  of  specific  antibodies  and  amino 
acid  sequence  data  should  allow  us  to  clone  the 
gene  encoding  this  key  regulator.  By  comparing 
the  primary  structure  of  steroidogenic  regulatory 
protein  with  that  of  previously  described  tran- 
scriptional regulatory  proteins,  such  as  those  of 
the  steroid  hormone  receptor,  we  may  gain  new 
insights  into  the  mechanisms  that  regulate  the  ad- 
renal steroidogenic  enzymes.  The  availability  of 
specific  probes  and  antibodies  to  steroidogenic 
regulatory  protein  will  further  permit  us  to  study 
the  mechanisms  that  regulate  its  expression. 
These  studies  will  provide  new  insights  into  the 
basis  for  tissue-specific  differences  in  the  produc- 
tion of  steroid  hormones.  They  may  also  increase 
our  understanding  of  the  general  mechanisms  of 
tissue-specific  gene  expression. 
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In  a  related  effort,  we  are  trying  to  define  the 
potential  role  of  the  steroidogenic  enzymes  in  hy- 
pertension. Extremely  prevalent,  hypertension 
aff^ects  approximately  20  percent  of  the  adult 
population.  As  such,  it  is  a  major  risk  factor  for 
heart  attacks  and  strokes,  the  leading  causes  of 
-death  in  developed  nations.  Although  the  under- 
lying defect  is  unknown  in  most  cases,  family 
studies  indicate  a  significant  genetic  component. 
Furthermore,  in  certain  animal  models  with  a 
more  carefully  defined  genetic  basis,  the  inheri- 
tance of  hypertension  has  been  clearly  linked  to 
differences  in  activity  of  11/3-OHase,  one  of  the 
steroidogenic  enzymes  described  above.  We  are 
investigating  the  function  of  this  protein  in  more 
detail. 

Initial  studies  implicated  a  single  lljS-OHase 
protein  in  the  biosynthesis  of  both  mineralocorti- 
coids  and  glucocorticoids.  We  now  have  defini- 
tive evidence  that  the  mouse  has  two  1 1  i8-OHase 
genes,  one  of  which  is  expressed  at  significantly 
higher  levels  than  the  other.  By  analogy  to  the  rat, 
for  which  biochemical  characterization  has  been 
more  extensive,  it  is  likely  that  the  gene  ex- 
pressed at  higher  levels  participates  in  the  forma- 
tion of  glucocorticoids  in  the  inner  zone, 
whereas  the  other  gene,  encoding  aldosterone 
synthetase,  produces  mineralocorticoids  in  the 
outer  zone. 

Although  the  two  genes  are  very  similar  in  their 
coding  regions,  which  specify  the  proteins'  com- 
position, they  differ  markedly  in  their  5'-flanking 
regions,  which  regulate  the  proteins'  production. 


We  are  now  seeking  to  define  the  precise  role  of 
these  two  genes  in  the  production  of  steroid  hor- 
mones. In  particular,  we  will  compare  and  con- 
trast the  promoter  regions  of  the  two  genes  to 
identify  both  shared  elements  and  unique  ele- 
ments responsible  for  zone-specific  expression. 

Based  on  our  preliminary  results,  we  plan  to 
use  these  promoter  regions  in  transgenic  mice  to 
target  gene  expression  specifically  to  either  the 
outer  or  inner  cortical  zones.  Initially  we  are  us- 
ing the  two  promoters  to  direct  the  expression  of 
renin,  a  gene  previously  shown  to  cause  genetic 
hypertension  when  expressed  in  multiple  tissues 
of  transgenic  animals.  If  successful,  these  experi- 
ments will  validate  the  zone-specific  expression 
of  the  two  promoter  regions  and  will  establish 
that  adrenal  expression  of  renin  is  relevant  to  the 
hypertensive  state. 

Next,  we  want  to  express  the  aldosterone  syn- 
thetase gene  in  the  inner,  glucocorticoid-produc- 
ing  zone.  As  discussed  above,  this  gene  normally 
produces  mineralocorticoids  in  the  outer  zone. 
We  have  prepared  a  hybrid  gene  with  the  inner- 
zone-specific  promoter  driving  expression  of  the 
aldosterone  synthetase  gene.  We  anticipate  that 
this  hybrid  gene  will  synthesize  large  amounts  of 
mineralocorticoids,  thus  creating  a  genetic  form 
of  hypertension.  Moreover,  treatment  with  glu- 
cocorticoids, which  will  suppress  expression  of 
the  hybrid  gene  in  the  inner  zone,  should  allevi- 
ate the  hypertension.  This  model  may  therefore 
mimic  a  subset  of  human  patients  with  glucocor- 
ticoid-responsive  hypertension. 
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MY  laboratory  is  currently  studying  the  inter- 
actions at  the  molecular  level  between  the 
nervous  and  immune  systems.  These  analyses  are 
focused  on  the  roles  of  certain  receptors  in  cellu- 
lar differentiation  and  signal  transduction  and, 
more  recently,  on  the  organization  of  other 
unique  cell  surface  components.  Projects  that  are 
under  wacy  involve  the  expression  of  tachykinin 
receptors  in  cloned  lymphocyte,  epithelial,  and 
neuronal  cell  lines  and  an  analysis  of  their  signal- 
ing pathways.  In  addition,  we  are  studying  the 
molecular  biology  of  synaptogenesis,  in  particu- 
lar the  function  of  the  molecule  agrin. 

With  Richard  Scheller  (HHMI)  and  Fabio  Rupp 
at  Stanford  University,  I  have  worked  on  the  clon- 
ing and  structural  analysis  of  the  agrin  molecule 
from  the  rat.  Agrin  belongs  to  a  family  of  extracel- 
lular synaptic  organizing  molecules  that  have 
been  shown  to  cause  the  aggregation  of  acetyl- 
choline receptors  and  acetylcholinesterase  on 
the  surface  of  regenerating  and  developing  mus- 
cle fibers.  A  cDNA  library  constructed  from 
poly  (A)  ^  RNA  of  embryonic  rat  spinal  cord  was 
screened,  and  a  number  of  inserts  were  isolated 
that  predicted  amino  acid  sequences  similar  to 
those  of  agrin  identified  in  the  Torpedo  electric 
organ. 

A  number  of  these  clones  have  now  been  se- 
quenced. We  have  constructed  fusion  proteins  in 
order  to  raise  specific  antibodies,  and  we  are 
studying  the  expression  of  the  message  for  agrin 
in  various  tissues  during  the  development  of  the 
rat.  The  deduced  sequence  contains  several  epi- 
dermal growth  factor  (EGF)-like  domains  and  re- 
gions similar  to  the  trypsin  inhibitor  enzyme. 
Moreover,  Northern  blot  analysis  reveals  a  mes- 
sage of  about  8.2  kb  in  embryonic  brain  and  spi- 
nal cord,  which  in  the  adult  decreases  about  10- 
fold  and  is  only  detectable  in  the  cord.  Current 
efforts  are  focused  on  expressing  agrin  in  a  num- 
ber of  cell  lines  and  studying  its  receptor-cluster- 


ing activity.  In  the  future  we  will  analyze  its  func- 
tionally active  domains,  possible  binding  sites, 
and  cellular  distribution  in  embryonic  and  adult 
rat  tissues. 

With  information  derived  from  the  cloning  of 
the  rat  substance  P  (SP)  receptor  (SPR) ,  my  labo- 
ratory is  studying  the  expression  and  the  signal- 
ing properties  of  the  SPR  in  a  number  of  cell 
lines.  We  have  demonstrated  that  the  SPR  signals 
simultaneously  by  both  the  cAMP  and  IP3  (inosi- 
tol 1 ,4,5,-trisphosphate)  pathways. 

With  J.  Sudduth-Klinger  and  C.  Christian,  I 
have  transfected  the  SPR  cDNA  into  Jurkat  lym- 
phocytes and  PCI 2  cells  and  established  stable 
clones  that  express  functional  SPR.  The  initial 
characterization  of  the  lines  has  been  carried  out. 
Jurkat-SPR  cells  will  be  studied  to  see  if,  when 
stimulated  by  SP,  they  alter  the  expression/pro- 
duction  of  specific  cytokines  or  their  receptors. 
Moreover,  the  coordinate  regulation  of  CDjo  (the 
cell  surface  neutral  endopeptidase  that  deacti- 
vates SP)  in  Jurkat-SPR  will  be  examined.  We  will 
study  whether  CDjo  activation  or  inactivation  by 
thiorphan  (a  neutral  endopeptidase  inhibitor) 
changes  SP  responses.  In  addition,  we  will  also 
study  whether  CD  10  coprecipitates  with  the  SPR. 
Preliminary  immunohistochemical  studies  from 
rat  lung  and  gut  suggest  that  the  SPR  and  CD  10  are 
colocalized.  These  experiments  will  for  the  first 
time  allow  us  to  elucidate  the  effects  of  SP  on 
immune  responses  at  a  unicellular  level  in  a  well- 
defined  system. 

I  will  study  with  Mark  Gilben  how  the  SPR  in- 
fluences PC  1 2  cells  before  and  after  differentia- 
tion with  nerve  growth  factor  (NGF) .  Functional 
studies  are  under  way  to  examine  whether  SPR 
expression  alters  growth  characteristics  and  mor- 
phology of  the  cells.  Experiments  from  other  lab- 
oratories done  with  primary  cultures  have  shown 
that  '^'l-SP  binding  is  located  in  the  growth 
cones,  suggesting  the  receptor  may  play  a  role  in 
certain  aspects  of  neuronal  development. 
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IMMUNE  recognition  of  potentially  injurious 
foreign  macromolecules  requires  the  elabora- 
tion of  an  enormous  repertoire  of  clonally  re- 
stricted receptors  (antigen  receptors)  on  the  sur- 
faces of  lymphoid  cells.  These  receptors  are 
sufficiently  heterogeneous  to  permit  recognition 
of  virtually  the  entire  universe  of  infectious  or- 
ganisms. Interaction  of  these  receptors  with  cog- 
nate antigen  provokes  a  stereotyped  response 
leading  to  cell  proliferation  and  the  production 
of  soluble  mediators  of  inflammation.  During  the 
past  several  years,  analysis  of  the  mechanisms  re- 
sponsible for  antigen  receptor  diversification  has 
stimulated  interest  in  a  related  question:  How  is 
the  signal  from  a  lymphocyte  antigen  receptor 
transmitted  to  the  cell  interior?  Our  laboratory 
has  adopted  a  molecular  genetic  approach  to  the 
dissection  of  signaling  pathways  in  immune  cells. 

Initially  we  identified  a  lymphocyte-specific 
en2yme  that  is  similar  in  structure  to  proteins 
known  to  transmit  growth-promoting  signals  in 
nonlymphoid  cells.  This  protein  kinase  modifies 
the  behavior  of  target  proteins  by  catalyzing  the 
addition  of  phosphate  groups  onto  specific  tyro- 
sine amino  acids  in  substrate  proteins.  The  gene, 
Ick,  that  encodes  this  lymphocyte-specific  kinase 
was  identified  by  virtue  of  its  overexpression  in  a 
murine  lymphoid  malignancy.  Moreover,  we 
were  able  to  demonstrate  that  a  single  point  mu- 
tation in  the  Ick  gene  was  sufficient  to  unmask  its 
ability  to  deregulate  the  growth  of  cells  grown  in 
tissue  culture.  Thus  the  Ick  gene  encodes  a  pro- 
tein that  is  capable  of  altering  the  growth  proper- 
ties of  at  least  some  cell  types.  Since  Ick  is  nor- 
mally expressed  only  in  lymphocytes,  there  is 
reason  to  believe  that  its  product  assists  in  regu- 
lating lymphocyte  proliferation. 

Biochemical  studies  support  this  view.  In  par- 
ticular, we  and  others  have  recently  demon- 


strated that  the  /c>fe-encoded  kinase  is  physically 
associated  with  proteins  that  form  part  of  the  an- 
tigen receptor  on  T  lymphocytes.  Our  studies 
also  enabled  us  to  identify  two  additional  protein 
tyrosine  kinases  that  are  specifically  expressed  in 
immune  cells.  In  each  case  there  is  reason  to  be- 
lieve that  the  kinase  is  physically  coupled  to  a 
cell  surface  receptor  involved  in  immune 
recognition. 

To  investigate  the  functional  importance  of 
these  protein  tyrosine  kinase  signaling  elements, 
we  have  developed  methods  for  manipulating  the 
expression  of  each  gene  in  its  appropriate  cellu- 
lar context.  Using  this  approach,  we  have  begun 
to  dissect  the  hierarchy  of  signal  transduction 
events  precipitated  by  normal  immune  recogni- 
tion. For  example,  we  found  that  overexpression 
of  an  activated  Ick  gene  leads  to  extraordinarily 
rapid  development  of  thymic  tumors  in  mice. 
Hence  altered  expression  of  the  Ick  gene  can  di- 
rectly affect  lymphocyte  proliferation.  In  a  re- 
lated series  of  experiments  we  learned  that  aug- 
mented expression  of  a  protein  kinase  encoded 
by  the  fyn  gene  produced  lymphocytes  that  man- 
ifest 10-fold  more  vigorous  responses  to  antigen 
stimulation  than  do  normal  cells.  Interference 
with  this ^n-controlled  pathway  yields  cells  that 
are  refractory  to  any  antigenic  challenge. 

Using  these  and  other  experimental  strategies, 
we  hope  to  deduce  eventually  a  "wiring  dia- 
gram" that  will  adequately  describe  signal  trans- 
duction mechanisms  employed  by  lymphocyte 
antigen  receptors.  Disturbances  in  lymphocyte 
signaling  almost  certainly  contribute  to  the  patho- 
genesis of  lymphoproliferative  and  immunodefi- 
ciency diseases  in  humans.  A  detailed  under- 
standing of  immune  cell  signaling  mechanisms 
should  permit  the  design  of  more  effective  thera- 
peutic strategies  for  the  treatment  of  immune  sys- 
tem dysfunction. 
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CONSIDERING  the  number  of  cells  in  an  or- 
ganism and  the  seemingly  infinite  possibili- 
ties for  developmental  errors,  it  is  amazing  that 
individuals  retain  their  characteristic  shapes. 
Cells  of  specific  tissues  interact  at  multiple  stages 
of  development,  and  their  communication  is  vital 
to  the  establishment  of  patterns.  Unraveling  the 
mysteries  of  the  intercellular  communication 
processes  is  the  interest  of  our  laboratory. 

We  are  studying  these  processes  in  Drosoph- 
ila, since  mutations  are  easy  to  generate,  known 
mutant  phenotypes  are  plentiful,  and  techniques 
for  cloning  and  transferring  genes  are  well  estab- 
lished. Our  goal  is  to  identify  and  characterize, 
using  genetic  and  molecular  techniques,  the  com- 
ponents of  specific  signal  transduction  pathways 
implicated  in  the  control  of  pattern  formation. 

Cell  fate  along  the  anterior-posterior  egg  axis  is 
determined  by  three  groups  of  maternally  ex- 
pressed genes:  those  of  the  anterior,  posterior, 
and  terminal  class  systems.  We  are  investigating 
the  terminal  class  system,  which  controls  cell  fate 
at  both  ends  of  the  Drosophila  embryo.  In  em- 
bryos lacking  maternal  terminal  class  functions, 
both  head  and  tail  structures  are  deleted. 

The  activities  generated  by  the  terminal  class 
system  are  believed  to  be  propagated  through  a 
signal  transduction  mechanism  that  involves  a 
phosphorylation  cascade.  The  current  model  is 
that  the  putative  transmembrane  tyrosine  kinase 
receptor,  encoded  by  the  gene  torso,  is  activated 
only  at  the  egg  termini.  This  localized  activation 
of  torso  then  triggers  a  phosphorylation  cascade 
that  culminates  in  transcription  of  the  zygotic  ter- 
minal gap  genes  tailless  and  huckebein,  which 
are  known  to  encode  transcription  factors. 

Our  molecular  and  genetic  analysis  of  the  ter- 
minal class  system  focuses  on  the  proteins  in- 
volved in  transducing  the  signal  from  torso's 
membrane-bound  kinase  to  the  nucleus,  and  thus 
far  we  have  characterized  two  genes,  l(l)pole 
hole  and  l(  1 )  corkscrew.  We  have  shown  by  ge- 
netic epistasis  experiments  that  both  act  down- 
stream of  the  torso  protein  activity.  The  molec- 
ular characterization  of  l(l)pole  hole  has 
strengthened  the  current  model  for  transfer  of  the 


torso  signal,  since  the  1(1  )pole  hole  gene  prod- 
uct is  the  homologue  of  the  mammalian  Raf-1 
proto-oncogene,  which  encodes  a  serine/thre- 
onine  kinase.  Molecular  characterization  of 
l(  1  Jcorkscrew,  currently  in  progress,  will  pro- 
vide more  insight  into  this  signal  transduction 
pathway. 

To  identify  other  molecules  involved  in  the 
1(1  )pole  hole  signal  pathway,  we  have  under- 
taken two  genetic  approaches  to  screen  for  sec- 
ond-site suppressors  and  enhancers.  In  the  first 
type  of  screen,  we  have  used  a  mutation  in  the 
1(1  )pole  hole  gene  that  reduces  its  level  of  activ- 
ity. Progeny  with  this  gene  are  not  viable.  They 
can  live,  however,  in  the  presence  of  a  second- 
site  suppressor.  We  have  conducted  extensive  ge- 
netic screens  to  search  for  these  suppressors  and 
have  recovered  a  number  of  them.  Presumably, 
these  suppressors  identify  proteins  involved  in 
the  1(1  )pole  hole  signal  transduction  pathway 
that  are  able  to  increase  or  bypass  the  1(1  )pole 
hole  activity.  Future  work  will  encompass  a  de- 
tailed characterization  of  these  suppressors. 

In  the  second  type  of  screen,  we  will  search  for 
second-site  modifiers  that  affect  the  activity  level 
of  an  l(l)pole  hole  gain-of-function  mutation. 
Since  such  hyperactive  gain-of-function  l(  1  )pole 
hole  mutations  have  not  been  recovered  by  con- 
ventional genetic  techniques  (presumably  they 
are  lethal  to  the  fly),  we  decided  to  "switch  on" 
an  activated  form  of  l(  1  )pole  hole  in  nonvital  or- 
gans of  the  adult  fly  to  generate  a  visible  domi- 
nant phenotype.  Using  the  activation  properties 
of  the  yeast  GAL4  protein,  which  activates  only 
those  genes  bearing  a  GAL4-binding  site  within 
their  promoters,  we  were  able  to  express  a  hyper- 
active 1(1  )pole  hole-modified  protein  in  the 
fly's  eye  imaginal  disc.  We  are  using  the  resultant 
dominant  eye  phenotype  to  screen  for  mutations 
that  suppress  the  phenotype.  Presumably  the 
suppressors  will  identify  proteins  that  can  reduce 
the  level  of  l(  1  )pole  hole  activity. 

We  believe  that  such  experiments  will  ulti- 
mately permit  genetic  isolation  of  more  compo- 
nents required  in  the  l(l)pole  hole  signal  trans- 
duction pathway.  Although  our  primary  objective 
is  to  understand  the  role  these  molecules  play  in 
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Drosophila,  our  work  may  be  directly  relevant  to 
an  understanding  of  development  in  other  organ- 
isms as  well.  The  high  levels  of  homology  be- 
tween torso  and  the  gene  for  platelet-derived 


growth  factor  (PDGF)  and  between  l(  1  )pole 
hole  and  Raf- 1  suggest  that  molecules  interacting 
with  l(  1  )pole  hole  might  be  conserved  in  the 
Raf-1  pathway  of  higher  vertebrates. 
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SOME  years  ago  we  described  a  variant  of  the 
genetic  disorder  called  the  bare  lymphocyte 
syndrome  (BLS),  in  which  the  patient's  lympho- 
cytes fail  to  express  either  class  I,  class  II,  or  both 
major  histocompatibility  (MHC)  determinants  on 
their  cell  surfaces.  These  transplantation  antigens 
are  essential  for  the  development  of  the  immune 
system,  for  tumor  surveillance,  for  eradication  of 
viral  infections,  and  for  normal  immune  re- 
sponses. Thus  it  is  not  surprising  that  BLS  patients 
are  severely  immunocompromised  or  fail  to  make 
antibodies  or  have  autoimmune  diseases.  In  addi- 
tion, this  autosomal  recessive  syndrome  is  one  of 
the  two  known  inherited  deficiencies  of  a  regula- 
tory gene  in  humans. 

By  fusing  in  tissue  culture  defective  cells  from 
different  patients  and  those  obtained  by  muta- 
genesis, four  genetic  complementation  groups  of 
BLS  were  found.  The  isolation  of  their  defective 
genes  should  make  possible  prenatal  diagnoses 
through  use  of  specific  genetic  probes  and  might 
lead  to  the  cure  of  BLS  by  the  targeting  of  normal 
genes  into  the  bone  marrow  of  affected  patients. 

To  study  the  defective  gene  in  BLS,  we  first  ex- 
amined regions  that  regulate  B  cell-specific  and 
interferon-7  (IFN-7) -inducible  expression  of 
class  II  genes.  Next,  we  looked  at  proteins  that 
bind  to  these  DNA  sequences  and  compared  class 
Il-specific  factors  in  various  cell  types.  Distinct 
patterns  of  DNA-binding  proteins  were  found  in  B 
cells,  IFN-7-inducible  cells,  and  T  cells. 

We  cloned  several  cDNAs  encoding  proteins 
that  bind  to  B  cell-specific  and  IFN-7-inducible 
sequences  in  class  II  promoters.  One  cDNA  codes 
for  JUN,  which  forms  active  JUN/FOS  hetero- 
dimers  in  cells  that  do  not  express  class  II  deter- 
minants. The  remaining  two  cDNAs  encode  a  B 
cell-specific  helix-loop-helix  protein,  NF-IL6, 
and  an  ETS-like  protein.  By  expressing  one  of 
these  full-length  cDNAs  in  human  cells,  we  hope 
to  rescue  class  II  gene  expression  in  one  type  of 
BLS.  Besides  direct  biochemical  studies,  we  are 
also  using  genetic  approaches  to  rescue  regula- 
tory defects  in  this  disease. 

In  setting  up  these  genetic  approaches,  we  first 


tested  a  well-known  viral  trans-regulatory  system 
— namely,  trans-activation  of  the  human  immuno- 
deficiency virus  (HFV)  by  the  virally  encoded 
TAT  protein.  The  precise  mechanism  of  TAT  ac- 
tion had  not  been  defined.  We  discovered  that 
TAT  acts  slightly  downstream  from  the  promoter 
to  modify  HIV  transcription.  Factors  assembled 
near  the  site  of  initiation  of  HFV  transcription 
bring  the  transcription  complex  to  the  promoter. 
The  addition  of  TAT,  which  binds  to  an  RNA  stem- 
loop  in  the  process  of  nascent  transcription,  re- 
leases this  transcription  complex,  and  efficient 
elongation  of  transcription  and  clearance  of  the 
promoter  follow.  New  transcription  complexes 
can  then  assemble,  interact  with  TAT,  and  move 
quickly  through  the  viral  genome.  Interactions 
between  TAT,  the  RNA  stem-loop,  and  cellular 
proteins  have  been  defined.  We  hope  that  inter- 
fering with  trans-activation  by  TAT  will  lead 
to  new  therapies  for  AIDS  (acquired  im- 
mune deficiency  syndrome)  and  AIDS-related 
disorders. 

Since  upstream  promoter  sequences  are  also 
essential  for  HFV  replication,  we  clarified  inter- 
actions between  host  cell  factors  and  viral  se- 
quences (long  terminal  repeat,  LTR).  Increased 
rates  of  initiation  of  HFV  transcription  were  ob- 
served in  activated  T  cells  and  macrophages. 
These  result  from  actions  of  nuclear  proteins  that 
are  also  required  for  T  cell  and  macrophage  ef- 
fector functions  and  for  T  cell  proliferation. 

Differences  between  LTRs  of  HFV  types  1  and  2 
were  observed  that  might  explain  the  longer  la- 
tency and  attenuated  clinical  course  of  HIV-2  in- 
fection. Furthermore,  effects  of  trans-activators 
encoded  by  several  DNA  viruses  on  HIV  transcrip- 
tion were  examined.  Since  effects  of  these  ago- 
nists and  TAT  were  synergistic,  infection  by  DNA 
viruses  might  be  an  important  cofactor  in  progres- 
sion from  latent  disease  to  clinical  AIDS. 

Thus  the  quest  to  rescue  the  expression  of  class 
II  MHC  genes  in  a  rare  human  congenital  disease 
has  led  to  genetic  approaches  to  the  study  of  tran- 
scription by  RNA  polymerase  II  and  to  the  eluci- 
dation of  a  potent  new  transcriptional  regulatory 
mechanism. 
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A number  of  low-molecular-weight  polypep- 
tides have  been  shown  to  regulate  cell 
growth.  These  growth  factors  bind  to  specific  re- 
ceptors on  the  surface  of  cells.  Through  a  com- 
plex series  of  reactions,  the  binding  of  the  growth 
factor  to  its  receptor  stimulates  the  cell  to  grow 
and  divide.  Until  recently,  little  was  known  of  the 
mechanism  by  which  growth  factors  induce  cell 
division.  It  is  now  recognized,  however,  that  the 
receptors  not  only  bind  the  appropriate  growth 
factor  but  also  have  an  enzymatic  activity. 

The  receptors  catalyze  the  transfer  of  a  phos- 
phate group  from  adenosine  triphosphate  to  tyro- 
sine residues  in  selected  protein  substrates.  The 
enzyme  effecting  this  activity  is  called  a  tyrosine 
protein  kinase.  Typically,  the  phosphorylation  of 
a  protein  by  a  kinase  leads  to  changes  in  the  activ- 
ity of  the  protein.  Although  much  is  known  about 
the  groMTih  factor  receptor  kinases,  the  substrates 
for  these  enzymes  have  not  been  identified. 

My  laboratory  is  involved  in  studies  of  the 
mechanism  by  which  the  binding  of  epidermal 
grov^Tih  factor  (EGF)  to  the  outside  of  the  cell 
elicits  a  biological  response  inside.  This  is  re- 
ferred to  as  signal  transduction.  The  model  sys- 
tem used  in  our  studies  employs  the  A43 1  line  of 
human  epidermal  carcinoma  cells,  which  are 
highly  responsive  to  EGF. 

Phosphatidylinositol  Metabolism 

One  of  the  earliest  responses  of  A431  cells  to 
EGF  is  an  increase  in  the  metabolism  of  a  particu- 
lar phospholipid,  phosphatidylinositol.  This  im- 
portant lipid  serves  as  a  precursor  for  the  genera- 
tion of  two  intracellular  compounds  that  activate 
various  enzymes  and  thereby  mediate  the  effects 
of  EGF  within  the  cell.  One  of  the  enzymes  in- 
volved in  phosphatidylinositol  metabolism  is 
phosphatidylinositol  kinase.  Since  this  enzyme  is 
stimulated  by  EGF,  it  represents  a  potential  sub- 
strate for  phosphorylation  by  the  EGF  receptor 
tyrosine  protein  kinase. 

We  have  purified  the  phosphatidylinositol  ki- 
nase from  both  A431  cells  and  human  placenta. 
This  55-kDa  enzyme  is  active  as  a  monomer — 
that  is,  as  a  single  polypeptide  chain.  It  phos- 


phorylates  phosphatidylinositol  on  the  4  position 
of  the  inositol  ring  and  hence  is  distinct  from  an- 
other phosphatidylinositol  kinase  that  phosphor- 
ylates  the  ring  on  the  3  position.  Preliminary  data 
suggest  that  in  membranes  and  whole  cells  this 
activity  is  regulated  via  phosphorylation,  and  a 
protein  kinase  has  been  identified  that  phosphor- 
ylates  and  activates  the  phosphatidylinositol  ki- 
nase in  vitro.  Work  is  under  way  to  characterize 
this  event. 

Another  enzyme  involved  in  phosphatidylino- 
sitol metabolism  is  a  phosphatidylinositol  mono- 
phosphate phosphatase.  This  enzyme  catalyzes 
the  reverse  of  the  reaction  catalyzed  by  the  phos- 
phatidylinositol kinase — that  is,  it  removes  the 
phosphate  from  the  4  position  of  the  inositol  ring 
on  phosphatidylinositol  monophosphate.  Al- 
though this  enzyme  has  not  been  studied  previ- 
ously, its  position  in  the  metabolic  pathway  of 
phosphatidylinositol  suggests  it  may  be  impor- 
tant in  the  overall  regulation  of  the  pathway.  We 
have  identified  its  activity  in  A43 1  cells  and  have 
characterized  it  with  respect  to  its  kinetic  proper- 
ties, substrate  specificity,  and  response  to  various 
inhibitors.  We  have  purified  the  enzyme  to  a  high 
degree  and  have  shown  that  it  is  a  140,000-Da 
glycoprotein.  Unlike  other  enzymes  in  this  path- 
way, this  enzyme  appears  to  have  an  extracellular 
domain.  The  possibility  that  the  activity  of  this 
enzyme  is  regulated  through  the  binding  of  an 
extracellular  mediator  is  being  investigated. 

It  has  recently  been  postulated  that  the  inositol 
phospholipids  may  function  in  the  regulation  of 
cell  shape  via  interactions  with  proteins  that  con- 
trol the  polymerization  of  actin.  Using  EGF  and 
A43 1  cells,  we  have  shown  that  there  is  no  corre- 
lation between  EGF-induced  changes  in  cell 
shape  and  changes  in  the  levels  of  the  classical 
phosphoinositides,  phosphatidylinositol  4-phos- 
phate  and  phosphatidylinositol  4,5-bisphos- 
phate.  Further  experiments  are  in  progress  to  de- 
termine whether  EGF-stimulated  changes  in  cell 
shape  can  be  attributed  to  the  generation  of  novel 
inositol  phospholipids  or  is  due  to  the  ability  of 
the  EGF  receptor  to  catalyze  the  phosphorylation 
of  cytoskeletal  proteins. 


357 


Mechanism  of  Action  of  Polypeptide  Growth  Factors 


Desensitization  of  the  EGF  Receptor 

When  A43 1  cells  are  treated  with  large  doses  of 
EGF,  washed,  and  subsequently  rechallenged 
with  EGF,  they  fail  to  respond  to  the  growth  fac- 
tor. This  phenomenon  is  known  as  desensitiza- 
tion. Our  studies  have  shown  that  when  the  EGF 
jxceptor  becomes  desensitized,  it  is  no  longer  in- 
ternalized into  the  cells,  and  EGF  no  longer  stimu- 
lates phosphatidylinositol  metabolism.  This  EGF- 
induced  desensitization  is  specific  for  the  EGF 
receptor,  as  the  responsiveness  of  other  receptors 
is  not  decreased  after  EGF  treatment. 

The  EGF  receptor  itself  is  a  monomeric  pro- 
tein, a  single  chain.  Upon  binding  of  EGF  to  its 
receptor,  two  of  the  receptor  monomers  come 
together  to  form  an  EGF  receptor  dimer.  This 
dimer  is  the  form  that  is  active  in  signal  transduc- 
tion. Desensitized  EGF  receptors  cannot  undergo 
this  EGF-induced  dimerization.  As  a  result,  they 


cannot  transduce  the  hormonal  signal  to  the  inte- 
rior of  the  cell. 

Evidence  suggests  that  the  desensitization  of 
the  EGF  receptor  results  from  its  phosphorylation 
by  a  protein  kinase.  We  have  identified  a  protein 
kinase  in  A431  cell  cytosol  that  is  activated  by 
EGF  and  appears  to  be  involved  in  receptor  de- 
sensitization. The  kinase  catalyzes  the  phosphor- 
ylation of  the  EGF  receptor  in  vitro.  Consistent 
with  what  has  been  observed  in  whole  cells, 
phosphorylation  of  EGF  receptor  monomers  by 
this  kinase  leads  in  vitro  to  an  inhibition  of  the 
ability  of  the  phosphorylated  monomers  to  di- 
merize.  The  kinase  phosphorylates  the  EGF  re- 
ceptor on  a  serine  residue  in  the  second  half  of 
the  receptor  molecule.  Using  techniques  of  mo- 
lecular biology,  we  plan  to  alter  this  site  and 
study  the  effect  of  this  mutation  on  the  function 
of  the  EGF  receptor. 
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The  structure  of  adenosine  deaminase  complexed  with  6-hydroxyl- 1, 6-dihydropurine  ribonucleo- 
side,  a  nearly  ideal  analogue  of  the  transition  state  of  catalysis,  and  the  zinc  cofactor,  as  deter- 
mined by  crystallographic  technique.  Adenosine  deaminase  plays  a  central  role  in  maintaining 
immune  competence;  impairment  of  its  catalytic  activity  is  associated  with  severe  combined 
immunodeficiency  (SCID),  affecting  both  T  and  B  lymphocytes. 
Research  of  David  K.  Wilson  and  Florante  A.  Quiocho. 
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FORMATION  of  complexes  between  proteins 
and  their  ligands  is  the  basis  of  biological 
specificity  and  activity.  Our  long-term  goal  is  to 
elucidate  the  atomic  interactions  between  pro- 
teins and  ligands  in  a  variety  of  biological  sys- 
tems. While  x-ray  crystallography  is  our  primary 
approach,  we  also  do  correlative  studies  em- 
ploying biochemical  and  recombinant  DNA 
techniques. 

Adenosine  Deaminase 

Adenosine  deaminase  (ADA),  present  in  vir- 
tually all  mammalian  cells,  catalyzes  the  irrevers- 
ible deamination  of  adenosine  to  inosine.  The 
enzyme  has  a  central  role  in  the  normal  develop- 
ment of  the  immune  response,  and  its  genetic  ab- 
sence leads  to  severe  combined  immunodefi- 
ciency disease  (SCID).  It  is  also  specifically 
involved,  or  its  levels  are  changed,  in  a  variety  of 
diseases,  including  AIDS  (acquired  immune  defi- 
ciency syndrome) ,  anemia,  and  various  lympho- 
mas and  leukemias.  It  also  appears  to  modulate 
synaptic  transmission. 

We  have  previously  obtained  crystals  of  ADA  in 
the  presence  of  purine  ribonucleoside,  a  sub- 
strate analogue.  Employing  x-ray  crystallographic 
techniques,  we  have  recently  determined  and  re- 
fined the  three-dimensional  structure  of  ADA  at 
2.4  A  resolution.  The  deaminase  has  a  parallel 
(ai8)8  barrel  structure.  In  the  course  of  the  struc- 
ture refinement,  we  discovered  that  ADA  contains 
a  zinc  cofactor  and  that  the  bound  ligand  is  6-hy- 
droxyl-l,6-dihydropurine  ribonucleoside  (HDPR), 
a  nearly  ideal  transition-state  analogue,  instead  of 
the  substrate  analogue.  The  zinc  and  HDPR  are 
bound  and  sequestered  in  a  deep  pocket  located 
at  the  carboxyl-terminal  end  of  the  |8  barrel. 

The  potent  transition-state  analogue  (inhibi- 
tion constant  estimated  at  10"'^  M)  is  held  tightly 
in  place  by  the  coordination  of  the  6-hydroxyl  to 
the  zinc  ion  and  by  the  formation  of  nine  hydro- 
gen bonds  with  seven  residues.  The  enzyme  pref- 
erentially recognizes  the  6R  distereomer  of  the 
ribonucleoside. 

The  discovery  of  the  bound  zinc  and  transition- 


state  analogue  has  enabled  us  to  understand  in 
great  detail  the  mode  of  action  of  ADA.  We  have 
also  been  able  to  provide  a  molecular  explana- 
tion of  the  mutations  of  the  enzyme  that  result  in 
SCID.  Inherited  deficiency  of  ADA  accounts  for 
about  one-third  of  SCID  cases.  The  sequences  of 
eight  point  mutations  of  ADA  from  five  cell  lines 
derived  from  affected  patients  have  been  re- 
ported. All  but  two  of  these  mutations  occur 
close  to  an  active-site  pocket  or  to  peptide  seg- 
ments that  line  it.  Consequently,  the  deleterious 
effects  of  the  six  mutations  can  be  ascribed  to 
structural  changes  that  affect  the  active-site 
geometry. 

Knowledge  of  the  ADA  structure  paves  the  way 
for  determining  the  structures  of  mutants,  some 
mimicking  those  related  to  deaminase  defi- 
ciency, and  other  complexes  with  various  ana- 
logues and  inhibitors.  We  have  also  obtained 
crystals  of  the  deaminase  in  the  presence  of  two 
other  analogues  (one  currently  being  used  for 
chemotherapy  in  certain  leukemias)  and  seven 
compounds  with  various  therapeutic  uses, 
such  as  sedatives,  anxiolytics,  analgesics,  and 
relaxants. 

The  High  Specificity  of  a  Phosphate 
Transport  System 

Phosphorus  in  the  form  of  orthophosphate  is 
probably  one  of  the  most  essential  nutrients  for 
cell  life.  The  transport  of  phosphate  into  cells 
and  organelles  (mitochondria,  chloroplasts,  etc.) 
is  highly  specific.  Obtaining  the  extremely  well- 
refined  1.7-A  structure  of  the  liganded  form  of 
phosphate-binding  protein  has  revealed  for  the 
first  time  the  atomic  features  responsible  for  the 
exquisite  high  selectivity  for  phosphate  and  the 
exclusion  of  sulfate.  The  phosphate,  which  is  to- 
tally dehydrated  and  buried  in  the  cleft  between 
the  two  domains  of  the  protein,  is  held  in  place 
by  a  total  of  1 2  hydrogen  bonds  formed  with  pro- 
tein groups,  one  of  which  is  the  carboxylate 
group  of  an  Asp  residue.  This  residue  is  responsi- 
ble for  the  recognition  of  a  proton  on  the 
phosphate. 
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Structure  and  Recognition  of  Epitopes 

Knowledge  of  the  active  conformation  of  a 
dominant  neutralizing  epitope  (antigenic  deter- 
minant group)  is  essential  to  understanding  the 
molecular  basis  of  the  immunogenicity  of  the 
segments  and  to  obtaining  possible  leads  for  vac- 
cine design.  Recombinant  technology  is  being 
used  to  insert,  without  loss  of  immunogenicity, 
foreign  epitopes,  especially  of  proteins  with  un- 
known structures,  onto  the  surface  of  the  malto- 
dextrin-binding  protein  of  Escherichia  coli,  in 
sites  known  to  be  permissive  to  changes.  The 
structure  of  the  protein  containing  such  an  epi- 
tope provides  information  about  the  epitope's 
configuration  and  how  its  immunogenicity  is  pre- 
served. Currently  under  investigation  are  epi- 
topes of  type  1  poliovirus  and  HIV  (human  immu- 
nodeficiency virus)  gpl20,  the  coat  protein  of 
the  AIDS  virus. 

Antibody- Antigen  Interaction 

We  have  previously  reported  the  crystalliza- 
tion of  the  Fab  fragment  of  a  monoclonal  anti- 
body raised  against  a  dominant  neutralizing  deter- 
minant of  gpl20.  Diffraction  data  from  these 
crystals,  out  to  2.8  A  resolution,  have  been  col- 
lected by  means  of  a  synchrotron  facility.  At- 
tempts to  determine  the  structure,  using  molecu- 
lar replacement  techniques,  are  under  way.  X-ray 
structure  determination  should  reveal  details  of 
molecular  recognition  of  an  antigen  and  impor- 
tant facets  of  gpl20  that  are  amenable  to  anti- 
body intervention. 

We  have  determined  the  structure  of  the  Fab 
fragment  of  an  antibody  against  cell  surface  poly- 


saccharide antigen  of  Shigella  flexneri  in  com- 
plex with  tri-  and  pentasaccharide-containing  de- 
terminants of  the  O-antigen  serotype  of  the 
bacteria.  Determination  of  the  structures  of  these 
crystals  is  in  progress. 

Molecular  Recognition  of  Carbohydrates, 
Charged  Substrates,  and  Calcium 

The  family  of  binding  proteins  that  serve  as  ini- 
tial receptors  for  bacterial  active  transport  and 
chemotaxis  is  an  excellent  system  in  which  to 
study  the  molecular  recognition  of  a  variety  of 
ligands  and  structure-function  relationships.  To 
date,  we  have  determined  and  refined  at  better 
than  1 .9  A  resolution  the  structures  of  seven  dif- 
ferent periplasmic  receptor  proteins:  those 
binding  L-arabinose,  D-galactose/o-glucose,  mal- 
todextrin,  leucine/isoleucine/valine,  leucine 
alone,  sulfate,  and  phosphate.  Using  site-directed 
mutagenesis,  coupled  with  analysis  of  substrate 
binding  to  mutant  proteins  by  crystallographic 
and  other  techniques,  we  are  beginning  to  dissect 
various  aspects  of  the  molecular  recognition  of 
carbohydrates,  charged  ligands  such  as  sulfate 
and  phosphate,  and  calcium  ion. 

A  method  different  from  site-directed  muta- 
genesis for  probing  the  contribution  of  specific 
hydrogen  bonds  in  protein-carbohydrate  interac- 
tions involves  specific  modifications  of  the  li- 
gand.  The  binding  of  deoxy  and  fluorodeoxy  su- 
gar analogues  to  the  arabinose-binding  protein 
has  been  analyzed  by  equilibrium  and  very  high 
resolution  crystallographic  techniques.  The  com- 
plexes have  structures  virtually  identical  to  the 
wild  type  and  reflect  more  realistically  the  hydro- 
gen bond  interaction  being  probed. 
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OLFACTION  is  among  the  oldest  of  the  sen- 
sory systems.  All  multicellular  and  many 
unicellular  organisms  have  evolved  sensitive 
chemosensory  systems  able  to  detect  and  identify 
natural  chemical  substances.  The  olfactory  sys- 
tem of  vertebrates  and  analogous  systems  for  the 
other  senses — vision,  hearing,  taste,  and  touch 
— allows  the  conversion  of  external  stimuli  into 
nerve  impulses.  In  mammals  the  olfactory  system 
is  exquisitely  sensitive,  capable  of  detecting 
some  odorants  present  at  a  concentration  of  only 
a  few  parts  per  trillion. 

The  ability  of  the  olfactory  system  to  discrimi- 
nate thousands  of  different  odorants  suggests  a 
complex  coding  mechanism.  However,  the  bio- 
logical basis  for  this  coding  remains  a  mystery. 
Unlike  the  visual  and  auditory  systems,  which 
need  only  encode  information  on  frequency  (or 
wavelength)  and  intensity,  the  olfactory  sense  re- 
quires multidimensional  information.  These  con- 
siderations suggest  a  complex  signal  transductory 
process. 

The  signal  transductory  pathway  for  olfaction 
can  be  divided  temporally  and  spatially  into  sev- 
eral distinct  steps.  The  first  consists  of  the  solubi- 
lization and  concentration  of  airborne  odorants. 
Considerable  experimental  data  suggest  that  com- 
ponents of  the  mucus  are  able  to  concentrate 
odorants  several  thousandfold.  We  previously 
identified  cDNA  clones  encoding  proteins  that 
are  present  at  high  concentration  in  the  mucus 
and  that  appear  to  bind  odorants.  These  proteins, 
from  rat  and  frog,  are  members  of  the  retinol- 
binding  protein  family,  many  of  which  have  been 
shown  to  solubilize  hydrophobic  ligands  in 
serum.  They  are  likely  to  play  a  similar  role  in  the 
olfactory  system. 

The  recognition  of  the  chemical  structure  of  an 
odorant  and  transduction  of  that  information 
across  the  plasma  membrane  is  a  poorly  under- 
stood process.  Some  investigators  hypothesize 
that  odorants  interact  directly  with  the  lipid 
membrane,  but  it  is  difficult  to  see  how  the  abil- 
ity to  discriminate  stereoisomeric  compounds 
could  be  accommodated  by  such  a  system.  Sev- 
eral years  ago,  we  began  to  test  an  alternative  hy- 


pothesis: that  binding  of  odorants  to  specific 
membrane-associated  proteins  leads  to  intracel- 
lular changes  in  the  primary  sensory  neuron. 

The  detection  of  odorant-stimulated  activation 
of  adenylyl  cyclase  in  olfactory  neurons  sug- 
gested an  analogy  to  visual  signal  transduction. 
Moreover,  activation  of  adenylyl  cyclase  in  prepa- 
rations of  rat  olfactory  cilia  depends  on  the  pres- 
ence of  guanine  nucleotides.  The  observation  of 
odorant-stimulated  GTP-dependent  adenylate  cy- 
clase activity  argues  strongly  for  a  GTP-binding 
protein-coupled  signal  transductory  pathway. 
We  have  identified  a  GTP-binding  protein  ex- 
pressed exclusively  in  olfactory  sensory  neurons. 
Moreover,  this  G  protein,  Goif,  is  localized  to  the 
olfactory  cilia,  where  the  initial  events  in  olfac- 
tory signal  transduction  are  thought  to  occur.  The 
Golf  protein,  which  is  highly  homologous  to  a 
GTP-binding  protein  that  stimulates  adenylyl  cy- 
clase in  other  systems,  is  expected  to  interact  di- 
rectly with  olfactory  receptors.  We  have  demon- 
strated that  Golf  can  couple  receptor  activation  to 
increases  in  intracellular  cAMP  in  cell  lines  defi- 
cient for  Gsj. 

Does  the  identification  of  an  olfactory-specific 
G  protein  provide  any  insight  into  the  nature  of 
the  olfactory  receptor?  A  number  of  G  protein- 
coupled  receptors  have  been  cloned  and  charac- 
terized; each  consists  of  a  single  glycosylated 
polypeptide  chain  containing  seven  membrane 
crossing  regions.  There  is  limited  but  significant 
homology  among  members  of  this  family,  and  we 
are  presently  attempting  the  direct  cloning  of  the 
olfactory  receptors  based  on  these  similarities. 
We  have  recently  identified  several  novel  G  pro- 
tein-coupled receptors  expressed  in  olfactory  ep- 
ithelium. Their  role  in  olfaction  remains  unclear. 
Identification  of  genes  encoding  olfactory  recep- 
tors may  reveal  how  receptors  are  able  to  detect 
thousands  of  different  odors.  Specifically,  does 
each  of  the  several  million  olfactory  sensory  cells 
encode  a  distinct  receptor  protein? 

The  final  step  in  the  transduction  of  odorant 
stimuli  is  the  generation  of  the  intracellular  sig- 
nal and  the  firing  of  an  action  potential.  Special- 
ized forms  of  second  messenger-generating  en- 
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zymes  and  novel  ion  channels  play  important 
roles  in  this  process.  We  have  identified  cDNA 
clones  encoding  three  distinct  forms  of  adenylyl 
cyclase  and  are  investigating  the  regulation  of 
this  important  enzyme  in  olfactory  tissue.  It  ap- 
pears that  one  form  of  this  enzyme  is  expressed 
by  olfactory  sensory  neurons.  This  olfactory  neu- 
ronal adenylyl  cyclase,  type  III,  has  biochemical 
properties  that  w^ould  be  advantageous  for  an  en- 
zyme involved  in  sensory  transduction.  Electro- 
physiologic experiments  have  identified  cyclic 
nucleotide-gated  ion  channels  in  olfactory  neu- 
rons, and  we  have  recently  isolated  and  charac- 
terized cDNA  clones  from  olfactory  tissue  that  en- 
code ion  channels  with  properties  similar  to 
those  found  in  the  visual  system. 

The  olfactory  system  is  also  interesting  as  a 
model  for  neuron  differentiation  and  develop- 
ment. The  olfactory  neuroepithelium  is  the  only 
neuronal  tissue  in  adult  mammals  that  undergoes 
continual  regeneration.  The  lifetime  of  sensory 
neurons  is  approximately  40  days,  after  which 
they  are  shed  from  the  epithelium  and  replaced 
from  a  population  of  neuroblast-like  precursor 
cells.  Moreover,  if  the  nerve  leading  from  the  sen- 
sory neurons  to  the  olfactory  bulb  is  severed,  all 
1 0  million  receptor  cells  are  rapidly  lost  and  sub- 
sequently replaced  in  a  relatively  synchronous 
fashion. 

We  have  recently  identified  a  number  of  cDNA 
clones  that  encode  proteins  expressed  only  in  the 
mature  sensory  neurons  and  are  presently  at- 


tempting to  elucidate  their  role  in  the  cell.  A 
novel  group  of  proteins,  those  expressed  tran- 
siently during  neuron  maturation,  may  include 
receptors  for  neurogenic  as  well  as  neurotrophic 
factors.  Several  of  the  genes  we  have  identified 
appear  to  encode  membrane-bound  or  cell  sur- 
face proteins. 

We  have  also  initiated  a  project  to  generate  cell 
lines  derived  from  cells  of  olfactory  neuronal  lin- 
eage. Transgenic  mice  have  been  identified  that 
carry  the  SV40  T  antigen  oncogene  linked  to  a 
transcriptional  promoter  for  an  abundant  olfac- 
tory neuron-specific  gene  product.  These  animals 
develop  tumors  originating  in  the  olfactory  neu- 
roepithelium. From  these  tumors,  we  have  estab- 
lished cell  lines  in  culture  that  express  a  number 
of  characteristic  neuronal  markers.  These  cell 
lines  express  several ;  of  the  neuron-specific 
markers  we  previously  identified.  This  approach 
may  be  generally  applicable  to  the  isolation  of 
defined  cell  lines  in  many  other  systems. 

We  are  continuing  to  use  several  techniques  to 
elucidate  the  mechanism  of  signal  transduction. 
Likewise,  the  identification  of  proteins  asso- 
ciated with  the  replacement  of  olfactory  neurons 
provides  the  tools  to  study  neural  development, 
not  just  in  the  olfactory  system  but  also  in  other 
areas  of  the  brain.  In  the  future,  we  will  focus  on 
the  molecular  components  that  underlie  the 
complex  mechanisms  of  signal  transduction,  sig- 
nal processing,  and  the  formation  of  neural 
connections. 
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CRITICAL  for  normal  functioning  of  the  kid- 
ney is  the  integrity  of  the  glomerular  base- 
ment membrane  (GBM),  a  complex  extracellular 
structure  that  forms  one  of  the  main  barriers  be- 
tween the  blood  and  the  urine.  The  GBM  is  com- 
posed of  several  proteins,  including  five  related 
but  subtly  different  collagens  that  interact  to 
form  a  chicken-wire  mesh  holding  the  membrane 
together.  One  of  the  interests  of  our  laboratory  is 
Goodpasture  syndrome,  an  autoimmune  disorder 
in  which,  for  unknown  reasons,  autoantibodies 
are  suddenly  targeted  at  the  collagen  components 
of  basement  membrane  in  the  lungs  and  kidneys. 
In  the  kidney,  these  antibodies  produce  a  sudden 
and  devastating  inflammation,  which  frequently 
leads  to  acute  renal  failure,  irreversible  unless 
treated.  The  nephritis  is  often  accompanied  by 
autoimmune  lung  damage,  manifested  by  bleed- 
ing into  the  alveoli. 

Previous  studies  have  shown  that  the  probable 
target  of  Goodpasture  autoantibodies  is  the  a  3 
chain  of  basement  membrane  collagen.  To  under- 
stand the  pathogenesis  more  clearly,  we  under- 
took to  isolate  and  purify  the  collagen  chain  so  as 
to  study  its  structure.  Because  the  collagen  is  pres- 
ent in  very  small  amounts  and  is  accompanied  by 
four  similar  proteins,  purifying  it  has  proved  dif- 
ficult. We  have  therefore  isolated,  cloned,  and 
sequenced  the  gene  for  the  a3  chain  of  basement 
membrane  collagen  and  have  used  the  sequence 
information  to  predict  the  primary  structure  and 
compare  this  protein  with  other  basement  mem- 
brane collagens. 

In  collaboration  with  Billy  Hudson  (Kansas 
City) ,  we  used  knowledge  of  the  primary  struc- 
ture to  identify  several  potential  antibody-bind- 
ing sites  (epitopes)  in  the  q;3  molecule.  We  syn- 
thesized short  peptides  and  used  them  to  test  the 
binding  of  some  of  these  sites,  which  we  local- 
ized to  within  a  small  region  of  1 2  amino  acids.  At 
least  one  of  the  peptides  has  very  high  affinity  for 
Goodpasture  antibodies  and  adsorbs  them  from 
patients'  serum.  Knowledge  of  the  epitope  struc- 
ture should  enable  us  to  develop  a  means  of  selec- 
tively adsorbing  Goodpasture  antibodies,  open- 


ing possibilities  for  a  new  treatment  modality.  In 
addition,  this  information  may  provide  clues  to 
the  development  of  autoimmunity  in  this 
disorder. 

One  of  the  major  projects  in  our  laboratory  is  a 
study  of  the  molecular  and  cellular  pathology  of 
autosomal  dominant  polycystic  kidney  disease 
(ADPKD),  one  of  the  commonest  causes  of  kid- 
ney failure  in  humans,  affecting  at  least  1  in 
1,000  of  the  population.  The  disease  is  an  enor- 
mous burden  to  families  and  the  community, 
since  the  majority  of  patients  develop  irreversible 
kidney  failure  in  middle  life  and  require  dialysis 
or  transplantation  for  survival. 

Having  previously  ascertained  that  the  majority 
of  the  inherited  mutations  in  ADPKD  lie  close  to 
the  tip  of  the  short  arm  of  chromosome  16,  we 
have  isolated  and  cloned  a  small  segment  of  DNA 
(550,000  base  pairs)  that  includes  the  mutated 
gene.  This  region  turns  out  to  be  extremely  gene 
rich,  and  we  have  already  isolated  22  genes  from 
within  it.  Since  we  have  not  been  able  to  detect 
any  large-scale  deletions  or  rearrangements  af- 
fecting any  of  the  22,  we  have  begun  to  examine 
the  sequence  of  these  genes  in  detail  and  to  look 
for  mutations  that  may  affect  only  one  or  two 
nucleotides. 

Jaap  IJdo,  a  Howard  Hughes  associate  in  our 
laboratory,  has  been  interested  in  the  structure  of 
telomeres,  the  ends  of  chromosomes.  He  has 
shown  that  banks  of  repetitive  sequence  reminis- 
cent of  human  telomeres  (TTAGGG)  are  also  pres- 
ent at  other  sites  within  the  human  genome.  One 
of  the  most  interesting  of  these  banks  is  embed- 
ded in  the  middle  of  the  long  arm  of  chromo- 
some 2.  Comparison  of  the  chromosome  banding 
pattern  of  humans  with  that  of  several  closely  related 
apes  suggests  that  this  region  of  the  chromosome 
contains  a  point  at  which  two  ancestral  ape  chromo- 
somes fused.  Dr.  IJdo  has  cloned  this  point  and 
shown  that  it  consists  of  a  head-to-head  telomere- 
telomere  fusion.  He  is  investigating  the  possibility 
that  this  point  of  fusion  is  also  responsible  for  a  rare 
form  of  chromosome  fragility  that  has  been  observed 
to  occur  at  or  very  near  this  site. 
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MY  laboratory  is  investigating  molecules  in 
the  extracellular  environment  of  neurons 
(conducting  nerve  cells)  that  direct  their  devel- 
opment in  vivo.  These  include  trophic  (nutri- 
tive) factors,  exemplified  by  nerve  growth  factor, 
and  molecules  in  the  extracellular  matrix  or  on 
the  surface  of  cells  that  serve  as  substrates  for  the 
growth  of  axons  (long  nerve  fibers) .  Such  mole- 
cules— e.g.,  laminin  and  N-cadherin — help  to 
regulate  neuronal  survival,  axon  growth,  and  syn- 
apse (nerve  junction)  formation. 

Neurons  require  contact  with  targets  to  survive 
during  development.  Experimentally  increasing 
or  decreasing  the  volume  of  target  tissue  corre- 
spondingly increases  or  decreases  neuronal  sur- 
vival. To  explain  these  target  influences,  it  has 
been  proposed  that  target  organs  synthesize  tro- 
phic factors  required  for  the  survival  of  the  inner- 
vating neurons.  Defects  in  the  synthesis  of  these 
factors  or  in  the  neuron's  ability  to  respond  to 
them  may  explain  some  neurodegenerative 
disorders. 

Work  on  trophic  factors  in  the  past  year  has 
focused  on  those  related  to  nerve  growth  factor. 
The  human  homologue  of  one  of  these,  brain- 
derived  neurotropic  factor,  was  isolated,  and 
NT-3,  a  novel  trophic  factor  in  the  nerve  growth 
factor  family,  was  identified.  Each  acts  on  distinct 
but  partially  overlapping  sets  of  neurons. 

The  key  to  understanding  trophic  factors  is  to 
identify  and  understand  the  actions  of  their  re- 
ceptors. One  of  the  proteins  that  functions  as  a 
receptor  for  the  family  of  nerve  growth  factor- 
related  trophic  factors  is  the  low-affinity  nerve 
growth  factor  receptor.  Antibodies  to  this  protein 
have  been  prepared  and  used  to  show  that  it  is 
distinct  from  a  second  protein  (or  set  of  proteins 
with  similar  sizes)  that  constitutes  a  distinct 
high-affinity  receptor  for  nerve  growth  factor. 
The  same  antibodies  have  been  used  to  show  that 
nerve  growth  factor-dependent  survival  and 
axon  growth  by  response  neurons  do  not  require 
the  low-affinity  receptor.  These  antibodies  are 
being  used  to  examine  the  role  of  this  class  of 
receptor  in  mediating  other  responses  to  nerve 
growth  factor. 


Our  laboratory  has  devoted  considerable  effort 
to  identifying  molecules  that  promote  the  growth 
of  neuronal  processes.  We  have  tried  to  identify 
both  the  molecules  that  axons  recognize  in  their 
environment  and  the  receptors  that  neurons  use 
for  binding  to  these  molecules.  Our  results  have 
shown  that  laminin,  an  adhesive  protein,  is  by  far 
the  most  active  of  the  glycoproteins  that  cells  se- 
crete into  the  extracellular  matrix.  Other  pro- 
teins that  have  similar  activities  include  fibronec- 
tin,  thrombospondin,  and  vitronectin. 

We  have  also  shown  that  neurons  utilize  a  fam- 
ily of  receptors,  the  integrins,  to  bind  to  laminin 
and  other  matrix  glycoproteins.  Distinct  recep- 
tors appear  to  mediate  neuronal  adhesion  and 
growth  cone  motility  on  laminin,  fibronectin, 
and  collagen.  In  the  past  two  years  we  have  puri- 
fied several  of  these  proteins  and  isolated  clones 
encoding  subunits  of  the  receptors.  Specific  anti- 
bodies to  these  subunits  have  been  prepared  and 
are  being  used  to  study  their  regulation.  Of  partic- 
ular interest,  we  have  identified  the  receptors 
used  by  both  peripheral  neurons  and  retinal  neu- 
rons to  interact  with  laminin,  a  heterotrimer  as- 
sembled from  three  diff^erent  gene  products — A, 
Bl,  and  B2.  Josh  Sanes  and  Eva  Engvall  have 
shown  that  at  least  two  different  A-like  genes  and 
two  different  Bl  -like  genes  exist  and  are  differen- 
tially expressed,  making  it  possible  to  assemble 
four  different  isoforms  of  laminin,  which  are  dif- 
ferentially distributed  in  embryos.  We  have 
shown  that  individual  integrin  heterodimers  dis- 
tinguish among  these  isoforms,  making  it  possi- 
ble for  cells  to  exhibit  different  responses  to  indi- 
vidual isoforms. 

Evidence  suggests  that  the  activity  of  integrin 
receptors  may  modulate  the  behavior  of  axons  in 
vivo.  In  studies  on  regulation  of  otf^fi^,  the  major 
laminin  receptor  in  the  neuroretina,  we  have 
shown  that  receptor  function  can  be  regulated  on 
several  levels.  First,  expression  of  the  genes  en- 
coding the  two  subunits  is  regulated.  Retinal  gan- 
glion cells  lose  responsiveness  to  laminin  when 
they  contact  their  synaptic  partners  in  the  optic 
tectum,  and  this  reflects  down-regulation  of  ex- 
pression of  the  gene.  Second,  other  neurons  in 
the  retina  modulate  responses  to  laminin  by  ex- 
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hibiting  changes  in  the  activity  of  integrin  recep- 
tors on  their  surfaces.  These  changes  occur  rap- 
idly and  can  be  modulated  by  external  and 
internal  agents.  Yet  a  third  level  of  regulation  ap- 
pears to  modulate  the  signals  conveyed  by  inte- 
grin receptor  binding  to  laminin  and  other  pro- 
teins. These  signals  can  be  modulated  by  trophic 
factors,  such  as  nerve  growth  factor. 

In  addition  to  proteins  secreted  into  the  extra- 
cellular environment  of  cells,  neurons  also  recog- 
nize and  use  for  axon  extension  integral  mem- 
brane proteins  on  the  surfaces  of  different  cell 
types.  Recent  work  in  our  laboratory  has  identi- 
fied some  of  the  major  proteins  that  neurons  use 
for  axonal  growth  on  central  and  peripheral  cells. 
Thus  neurons  have  been  shown  to  use  both  extra- 
cellular matrix  and  cell-cell  adhesion  molecules 
for  growth  of  axons  on  Schwann  cells,  the  major 
cell  type  with  which  they  interact  in  peripheral 
nerves.  The  cell  adhesion  molecules  include 
members  of  the  immunoglobulin  superfamily 
and  members  of  a  new  family  of  calcium-depen- 
dent adhesion  molecules,  the  cadherins. 


In  studies  on  interactions  of  neurons  with  other 
cells  in  the  brain,  it  became  clear  that  additional 
adhesion  molecules,  not  yet  identified  or  charac- 
terized, must  be  important.  We  have  used  molec- 
ular biological  methods  to  identify  two  of  these 
with  localized  expression  patterns  in  the  brain. 
One  of  these,  the  integrin  a^^^,  is  localized  on 
axons  in  many  of  the  major  axon  tracts  of  the 
brain.  Its  concentration  on  axons  in  these  tracts 
suggests  that  it  mediates  interactions  between 
these  axons.  The  second  novel  adhesion-promot- 
ing molecule  is  B-cadherin,  a  Ca^^-dependent  ad- 
hesion molecule  that  is  also  expressed  in  the  ner- 
vous system.  It  appears  to  be  concentrated  in 
specific  cells  in  several  areas  of  the  brain.  Both 
molecules  are  also  expressed  in  some  areas  out- 
side the  nervous  system. 

Future  work  will  focus  on  determining  how 
these  neuronal  receptors  act  to  promote  growth 
cone  motility,  how  they  are  regulated  during  de- 
velopment, and  what  their  importance  is  in  regu- 
lating the  growth  of  axons  during  development 
and  regeneration. 
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MULTICELLULAR  organisms  have  evolved 
complex  mechanisms  to  differentiate  them- 
selves from  their  environments  and  to  perceive 
and  defend  against  potential  threats  to  their  in- 
tegrity. The  immune  system  of  vertebrates  is  a 
sophisticated  defense  system  organized  to  deal 
with  attack  at  the  microbial  and/or  molecular 
level.  Recent  data  indicate  that  microbial  organ- 
isms similarly  have  evolved  mechanisms  to  en- 
hance the  effectiveness  of  their  interactions  with 
specific  recognition  elements  of  vertebrate  im- 
mune systems.  The  primary  focus  of  specific  rec- 
ognition by  the  immune  system  is  a  specialized 
subpopulation  of  lymphocytes,  termed  T  cells, 
that  is  responsible  for  discriminating  molecules 
perceived  as  self  from  those  that  are  foreign.  Sev- 
eral families  of  genes  are  directly  involved  in  this 
process,  including  genes  that  encode  the  T  cell 
receptor  for  antigen  and  those  of  the  major  histo- 
compatibility complex  (MHC). 

The  MHC  genes,  which  exist  as  a  linked  cluster 
on  a  single  chromosome,  are  intimately  involved 
in  the  definition  of  "self"  for  cells  of  the  immune 
system.  The  importance  of  the  MHC  in  self- 
nonself  discrimination  was  appreciated  long  be- 
fore its  essential  role  in  antigen  recognition  was 
defined.  As  suggested  by  its  name,  the  MHC  con- 
stitutes the  major  barrier  to  exchange  of  tissue 
grafts  between  members  of  a  vertebrate  species. 

T  cell  receptors  recognize  foreign  antigen 
solely  in  the  context  of  self  MHC  molecules, 
which  are  expressed  on  the  surfaces  of  cells  spe- 
cialized for  antigen  presentation.  Recent  data 
suggest  that  recognition  is  accomplished  by  the 
specific  binding  of  small  peptide  antigens  within 
a  groove  on  the  outer  face  of  MHC  molecules.  T 
cell  receptors  are  thought  to  be  selected  during 
differentiation  in  the  thymus  for  their  capacity  to 
bind  this  binary  complex  of  foreign  peptide  and 
self  MHC. 

Our  objectives  include  definition  of  the  genes 
and  gene  products  of  the  MHC  in  mice  and  hu- 
mans and  incorporation  of  such  information  into 
an  understanding  of  T  cell  recognition  of  foreign- 
ness.  The  relationships  between  the  structure  of 
MHC  molecules  and  their  biological  functions 


are  of  particular  interest.  Two  major  classes  of 
MHC  molecules  have  been  defined  by  their  dis- 
tinctive structural  features.  Molecules  of 
both  classes  are  important  in  self-nonself 
discrimination. 

T  cells  involved  in  induction  of  immune  re- 
sponses primarily  employ  class  II  MHC  mole- 
cules. Three  major  types  of  human  class  II  mole- 
cules are  recognized:  DP,  DQ,  and  DR.  Previously 
we  employed  cloned  T  cells  to  define  the  contri- 
butions of  each  type  to  the  recognition  of  foreign 
antigens  and  to  investigate  the  cellular  mecha- 
nisms involved  in  antigen  presentation.  A  DR  mol- 
ecule was  identified  with  unique  biochemical 
features  and  distinctive  defects  in  its  capacity  to 
function  in  antigen  presentation.  This  defect  was 
highly  associated  with  a  deficiency  in  activity  of 
linked  genes  controlling  an  enzyme  important  in 
the  biosynthesis  of  adrenal  steroids.  Because  an 
understanding  of  the  DR  molecule  should  pro- 
vide insight  into  general  structure-function  rela- 
tionships of  MHC  molecules,  we  sequenced 
cDNAs  encoding  its  constituent  polypeptides. 
The  biochemical  and  functional  anomalies  were 
associated  with  point  mutations  in  two  adjacent 
codons,  resulting  in  amino  acid  substitutions 
within  the  ;8-chain  of  the  DR  molecule.  These 
substitutions  are  hypothesized  to  affect  interac- 
tions between  the  ^-  and  a-chain  polypeptides 
and  may  affect  binding  of  foreign  antigenic  pep- 
tides. Studies  in  progress  are  designed  to  eluci- 
date the  molecular  mechanisms  accounting  for 
the  unique  properties  of  this  molecule. 

Our  studies  of  MHC  class  Il-dependent  T  cell 
stimulation  led  us  to  analysis  of  the  mitogenic 
activities  of  a  family  of  staphylococcal  exotoxins 
that  have  been  of  considerable  medical  interest  as 
the  etiologic  agents  of  staphylococcal  food  poi- 
soning and  toxic  shock  syndrome.  The  recent  ap- 
preciation of  unique  aspects  of  the  interaction  of 
these  bacterial  products  with  T  cell  receptors, 
and  their  capacity  to  thus  shape  the  T  cell  reper- 
toire, has  led  to  their  designation  as  "superanti- 
gens"  (Philippa  Marrack  [HHMI]  and  John 
Kappler  [HHMI],  National  Jewish  Center  for  Im- 
munology and  Respiratory  Medicine,  Denver). 
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We  initially  reported  that  MHC  class  II  molecules 
are  the  specific  cellular  receptors  for  a  prototype 
of  this  family  of  molecules,  staphylococcal  en- 
terotoxin  A  (SEA) . 

Interest  in  the  structural  basis  of  superantigen 
activities  led  us  to  assess  their  binding  affinities 
to  MHC  class  II  molecules  and  the  effects  of  bio- 
chemical modifications  on  binding  and  function. 
We  observed  a  direct  correlation  between  bind- 
ing affinity  as  a  linear  function  and  the  logarithm 
of  the  effective  dose  for  T  cell  stimulation.  Thus 
relatively  small  changes  in  binding  affinity  had 
marked  effects  on  enterotoxin  potency.  A  com- 
mon feature  of  members  of  the  enterotoxin  fam- 
ily is  a  small  intrachain  disulfide  loop  in  the  cen- 
tral region  of  the  molecule.  We  assessed  the  role 
of  this  loop  and  adjacent  amino  acids  on  func- 
tional activity  and  demonstrated  that  the  disul- 
fide bond  was  essential  to  T  cell  mitogenic  activ- 
ity but  had  little  effect  on  MHC  class  II  binding  or 
monocyte  stimulation.  We  concluded  that  the  in- 
ability of  these  modified  toxins  to  induce  T  cell 
proliferation  reflected  their  failure  to  interact  ef- 
fectively with  the  T  cell  receptor  for  antigen 
rather  than  with  class  II  molecules. 

For  several  years  we  have  also  studied  "non- 
classical"  MHC  class  I  genes  and  antigens  of  the 
mouse.  These  genes,  which  constitute  the  major- 
ity of  class  I  genes,  were  termed  nonclassical  be- 
cause they  had  not  been  thought  to  be  involved  in 
conventional  antigen  presentation  and  self- 
nonself  discrimination.  We  have  been  specifi- 
cally interested  in  the  maternally  transmitted  an- 
tigen (Mta).  Discovered  by  Kirsten  Fischer  Lin- 
dahl  (HHMI,  University  of  Texas  Southwestern 


Medical  Center  at  Dallas),  Mta  is  recognized  by 
cytolytic  T  cells  and  is  unique  in  its  mode  of  in- 
heritance. Offspring  of  mice  with  different  forms 
of  the  antigen  always  express  the  maternal  form. 
Both  nuclear  and  mitochondrial  genes  are  in- 
volved in  Mta  expression,  but  the  maternally  in- 
herited polymorphism  is  determined  by  a  mito- 
chondrial gene  product.  This  is  a  hydrophobic 
peptide  encoded  at  the  amino  terminus  of  the 
NDl  mitochondrial  gene.  We  have  investigated 
structural  properties  of  this  peptide  that  enable  it 
to  bind  and  be  presented  by  a  specific  nonclassi- 
cal class  I  gene  product,  Hmt.  The  essential  fea- 
ture of  this  peptide  is  a  biochemical  substitution 
(an  amino-terminal  formyl  residue)  that  is  char- 
acteristic of  the  first  amino  acid  of  mitochondrial 
proteins  but  is  not  found  on  proteins  synthesized 
from  nuclear  genes.  Proteins  synthesized  within 
mitochondria  share  this  property  with  proteins 
synthesized  by  bacteria.  We  therefore  investi- 
gated the  capacity  of  certain  bacterial  peptides  to 
bind  similarly  to  the  Hmt  class  I  molecule.  Three 
of  the  first  six  synthetic  bacterial  peptides  char- 
acterized displayed  this  property,  but  if  and  only 
if  they  also  had  the  terminal  formyl  residue.  Un- 
modified peptides  or  peptides  modified  with 
other  biochemical  radicals  were  unable  to  bind. 

These  data  suggest  that  the  Hmt  class  I  mole- 
cule may  represent  a  specialized  host  defense 
mechanism  for  recognition  of  proteins  of  bacte- 
rial origin.  As  such  they  may  provide  insight  into 
the  biological  functions  of  the  nonclassical  MHC 
class  I  antigens.  Additionally,  they  offer  the  first 
evidence  of  true  biochemical  specialization  in 
the  binding  of  antigenic  peptides  to  a  specific 
MHC  molecule. 
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MY  laboratory  is  interested  in  two  fundamen- 
tal problems.  Our  earliest  and  foremost  in- 
terest is  the  molecular  genetics  of  RNA  process- 
ing. For  this  subject  our  principal  experimental 
system  is  the  budding  yeast  Saccharomyces  cere- 
visiae,  which  is  amenable  to  both  genetic  and 
biochemical  attack.  Our  more  recent  interest  is 
the  molecular  genetics  of  behavior — in  particu- 
lar, circadian  rhythms.  This  problem  is  addressed 
in  the  fruit  fly  Drosophila  melanogaster,  which 
is  amenable  to  behavioral  as  well  as  biochemical 
and  genetic  approaches. 

A  major  objective  in  the  field  of  RNA  process- 
ing is  to  understand  the  rules  that  govern  pre- 
mRNA  splicing.  It  is  now  more  than  10  years 
since  splicing  was  discovered  in  mammalian  cells 
and  viruses.  This  discovery  revolutionized  the 
field  of  genetics,  since  it  refuted  a  major  tenet — 
that  genes  are  intact  units.  It  became  clear  that 
genes  are  divided  into  segments  and  that  the  inco- 
herent material  (the  introns)  is  discarded,  while 
the  retained  sense  material  needed  to  code  for 
proteins  (the  exons)  is  carefully  sewn  together. 
Neither  the  efficiency  nor  the  specificity  of  the 
process  is  well  understood. 

More  recently  it  emerged  that  all  eukaryotes 
(including  S.  cerevisiae)  undergo  the  same  splic- 
ing process  with  essentially  the  same  biochemi- 
cal machinery.  This  includes  a  myriad  of  protein 
factors,  most  of  which  remain  uncharacterized  if 
not  unidentified,  and  five  small  nuclear  ribonu- 
cleoprotein  particles  (snRNPs) .  Each  snRNP  con- 
tains one  or  two  small  molecules  of  RNA  and  per- 
haps 5-10  proteins.  The  pre-mRNA  substrate  and 
many  of  these  factors,  including  the  five  snRNPs, 
assemble  into  the  spliceosome,  a  large  particle 
within  which  the  cleavage  and  ligation  events  of 
splicing  take  place. 

We  are  particularly  interested  in  the  early 
events  of  the  spliceosome  assembly  process,  as  it 
seems  to  be  related  to  those  of  intron  recognition 
or  definition.  The  highly  ordered  assembly  pro- 
cess must  involve  recognition  of  an  appropriate 
pre-mRNA  substrate  to  initiate  the  splicing  pro- 
cess and  prevent  unproductive  transport  of  the 
unspliced  pre-mRNA  to  the  cytoplasm.  In  addi- 


tion, these  early  events  are  likely  related  to  the 
mysterious  prevention  of  exon  shuffling — i.e., 
exons  are  usually  sewn  together  in  the  correct 
order. 

We  have  focused  on  Ul  snRNP,  as  there  was 
reason  to  believe  that  it  features  prominently  in 
recognition  of  one  of  the  two  intron  splice  sites 
(the  5'  splice  site)  and  even  in  catalytic  specific- 
ity. Our  recent  results  indicate  that  Ul  snRNP  is 
indeed  very  important  to  the  early  stages  of  spli- 
ceosome assembly.  Surprisingly,  we  found  that  it 
participates  in  the  recognition  of  both  splice 
sites,  but  that  this  recognition  is  not  sufficient  to 
define  the  specificity  with  which  the  splicing  ma- 
chinery cleaves  the  5'  splice  site. 

Genetics  of  Behavior 

The  other  aspect  of  our  work  is  concerned  with 
circadian  rhythms.  The  choice  of  Drosophila  was 
dictated  by  such  factors  as  its  suitability  for  ge- 
netic analysis  and  the  large  amount  of  behavioral 
work  done  on  this  animal.  For  example,  it  was 
known  that  its  circadian  rhythms,  as  in  all  other 
systems  examined  to  date,  manifest  temperature 
compensation:  i.e.,  the  circadian  rhythm  periods 
(approximately  24  hours)  change  little  if  at  all 
with  temperature. 

The  period  {per)  gene  has  a  profound  influ- 
ence on  the  animal's  circadian  rhythms.  Alleles  of 
the  gene  speed  up,  slow  down,  or  apparently 
eliminate  the  animal's  circadian  rhythms.  At  least 
two  different  rhythms,  including  those  of  loco- 
motor activity  (the  animal's  analogue  of  our 
sleep-wake  cycle),  are  similarly  affected.  This 
work  suggested  that  the  per  gene  might  consti- 
tute an  entree  into  the  mysterious  world  of  the 
circadian  oscillator. 

We  have  focused  on  the  per  gene  and  its  prod- 
uct with  the  thought  that  it  might  code  for  a 
"clock  molecule."  This  possibility  is  even 
stronger  due  to  the  characterization  of  a  variety 
of  mutant  and  transformed  strains. 

Our  experiments  have  reinforced  the  sugges- 
tion that  the  gene  is  making  a  physiological  rather 
than  developmental  contribution  to  the  oscilla- 
tor— i.e.,  it  is  carrying  out  a  biochemical  func- 
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tion  necessary  to  the  running  of  the  oscillator 
rather  than  contributing  to  the  "building"  of  the 
oscillator  during  development.  Until  the  per 
gene  is  turned  on,  the  animal  cannot  tell  time. 
These  experiments  also  provide  some  indication 
as  to  where  the  circadian  oscillator  may  reside. 

We  have  also  discovered  that  the  per  gene  prod- 
uct cycles  in  a  circadian  fashion:  it  is  present  at 
relatively  high  levels  in  the  middle  of  the  day. 
Moreover,  the  per  mRNA  exhibits  a  similar  "be- 


havior"; the  cycling  of  the  per  protein  is  due,  at 
least  in  part,  to  a  cycling  of  its  mRNA.  A  variety  of 
experimental  approaches  indicate  that  the  per 
protein  feeds  back  to  influence  the  cycling  of  its 
own  mRNA.  Recent  evidence  suggests  that  a  sec- 
ond post-transcriptional  regulatory  event  is  also 
necessary  for  cycling  of  the  per  protein.  We  sus- 
pect that  this  important  feedback  loop  underlies 
the  behavioral  rhythms  that  are  observed  in  nor- 
mal and  mutant  fruit  flies. 
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OVER  the  past  year  the  central  research  focus 
in  this  laboratory  has  been  to  define  the  mo- 
lecular mechanisms  that  dictate  the  developmen- 
tal and  regulated  expression  of  neuroendocrine 
genes  and  to  begin  to  apply  emerging  principles 
to  analysis  of  neuronal  gene  expression. 

The  neuroendocrine  system  coordinates  the 
complex  pattern  of  regulation  necessary  to 
achieve  the  precise  temporal,  spatial,  and  homeo- 
static  patterns  of  gene  expression  required  by 
complex  eukaryotic  organisms.  Development  of 
the  central  nervous  system  and  endocrine  organs 
involves  precise  patterns  of  responses  to  morpho- 
gens  and  other  regulatory  signals  that  ultimately 
establish  the  intricate  patterns  of  neural  and  en- 
docrine phenotypes.  The  cloning  and  analysis  of 
specific  genes  encoding  regulatory  genes,  recep- 
tors, and  cell-specific  transcription  factors  have 
permitted  an  initial  definition  of  developmental 
and  regulatory  transcriptional  and  post-transcrip- 
tional  strategies  in  the  neuroendocrine  system. 

We  are  using  the  anterior  pituitary  gland  as  a 
model  to  investigate  the  molecular  basis  for  gen- 
eration of  specific  cell  phenotypes  in  an  organ. 
The  rat  genes  for  growth  hormone  and  prolactin 
(a  pituitary  hormone  that  stimulates  and  sustains 
lactation)  exhibit  precisely  restricted  expression 
in  somatotrophs  and  lactotrophs,  respectively. 
Prolactin  gene  expression  is  dictated  by  two  sepa- 
rate regions,  a  distal  enhancer  and  a  proximal  re- 
gion, each  containing  at  least  four  critical  ele- 
ments. The  two  regions,  each  capable  of  targeting 
tissue-specific  gene  expression,  act  synergisti- 
cally  to  generate  high  levels  of  prolactin  gene  ex- 
pression in  transgenic  mice. 

The  consensus  binding  site  for  cell-specific 
transcription-regulating  protein (s),  referred  to  as 
Pit-1 ,  was  characterized;  the  33-kDa  Pit- 1  protein 
was  purified;  and,  using  in  vitro  transcription  as- 
says, it  was  shown  to  exert  functional  effects. 
Pit-1  cDNAs  were  cloned  from  rat  pituitary  ex- 
pression libraries.  Bacterially  expressed  Pit-1 
bound,  specifically  and  with  high  affinity,  to  pro- 
lactin and  growth  hormone  promoters.  Addi- 
tional elements  and  factors  are  required  to 
achieve  the  full  physiological  levels  and  re- 


stricted patterns  of  expression  of  the  prolactin 
and  growth  hormone  genes. 

The  carboxyl  terminus  of  Pit- 1  contains  a  60- 
amino  acid  region  similar  to  the  homeodomain  of 
several  Drosophila  and  vertebrate  regulatory 
genes.  Pit-1  contains  a  second,  76-amino  acid 
sequence  with  striking  homology  to  other  tran- 
scription regulators.  These  two  regions,  together 
with  a  nonconserved  region  between  them,  are 
referred  to  as  the  POU  domain.  We  have  found 
that  the  76-amino  acid  POU-specific  domain 
functions  in  high-affinity  DNA  binding,  in  confer- 
ring site  specificity,  and  in  protein-protein  inter- 
actions. The  major  transcription-activating  do- 
main of  Pit-1  is  a  70-amino  acid  amino-terminal 
region  rich  in  serine  and  threonine  and  distinct 
from  recognized  motifs.  A  detailed  structure- 
function  analysis  has  suggested  that  the  determi- 
nants of  DNA  binding  by  POU-domain  proteins 
are  distinct  from  those  of  the  classical  homeodo- 
main proteins.  A  genetic  approach  was  utilized  to 
determine  the  functional  role  of  Pit- 1  during  or- 
ganogenesis. Based  on  the  demonstration  of 
disrupted  patterns  of  the  Pit  l  gene  in  genetic 
dwarf  mice,  it  appears  that  the  POU-domain  pro- 
tein acts  as  a  developmental  regulator  to  deter- 
mine patterns  of  commitment,  progression,  and 
proliferation  of  specific  cell  types  in  the  anterior 
pituitary  gland. 

We  devised  a  strategy  to  isolate  new  members 
of  the  POU-domain  gene  family.  Four  were  iden- 
tified from  cDNA  in  brain  and  testes.  Transcripts 
from  the  Brn-1  and  Brn-2  genes  exhibited  vir- 
tually identical  patterns  of  expression  in  the  cen- 
tral nervous  system,  although  Brn-1  was  clearly 
expressed  in  the  medullary  zone  of  the  kidney, 
and  Brn-2  was  not.  Brn-3  mRNA  was  present  pre- 
dominantly in  the  peripheral  nervous  system; 
Tst- 1  transcripts  were  present  in  testes  and  brain. 
Subsequently,  other  POU  domains  have  been 
identified  expressed  in  early  development  or 
during  organogenesis. 

Most  POU-domain  genes  were  widely  ex- 
pressed in  all  levels  of  the  neural  tube  (including 
the  retina)  during  early  development,  and  hybrid- 
ization in  the  ventricular  (proliferative)  zone  of 
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the  neuroepithelium  was  evident  for  all  four 
transcripts  at  all  levels.  The  time  course  of  ana- 
tomic restriction  in  the  developing  neural  tube 
was  distinct,  and  the  patterns  for  each  gene  prod- 
uct tended  to  reflect  the  adult  loci  of  expression. 
In  addition,  reactive  transcripts  for  some  of  these 
geftes  were  also  expressed  during  mammalian 
neurogenesis.  We  recently  identified  many  addi- 
tional family  members  in  the  brain  and  obtained 
evidence  that  at  least  some  members  of  this  fam- 
ily can  bind  to  specific  elements  in  distinct 
classes  of  neuronally  expressed  genes.  Two  POU- 
domain  genes  were  cloned  from  Drosophila 
cDNA  libraries  and  identified  in  neural  structures 
during  development.  These  proteins  form  stable 
heterodimers  and  act  to  alter  patterns  of  activa- 
tion of  neuron-specific  genes. 

Interactions  between  diverse  transcription  fac- 
tors generate  heterodimers  that  exert  distinct  pat- 
terns of  gene  activation.  Thus,  in  the  case  of  the 
retinoic  acid  receptor,  we  find  that  a  series  of  cell 
type-specific  coregulators  impose  novel  hierar- 
chies of  binding  site  preferences.  Unique  posi- 
tive and  negative  transcriptional  regulators  im- 
pose variable  patterns  of  gene  activation, 
potentially  contributing  to  the  refinement  of  phe- 


notypic  variance  required  in  the  central  nervous 
system. 

We  have  provided  evidence  that  a  gene  of  the 
neuroendocrine  system,  the  calcitonin/calci- 
tonin gene-related  peptide  (CGRP)  gene,  con- 
tains genomic  regions  that  represent  discrete 
hormone-encoding  domains.  The  ultimate  ex- 
pression of  these  domains  depends  on  alternative 
RNA-processing  events  that  differentially  include 
or  exclude  specific  exons  encoding  specific  com- 
ponents of  polypeptide  regulators  in  the  mature 
mRNA  products.  The  rat  and  human  calcitonin/ 
CGRP  genes  are  composed  of  six  exons.  More 
than  95  percent  of  the  mature  transcripts  in  thy- 
roid C  cells,  encoding  calcitonin,  are  produced 
by  splicing  of  the  first  three  to  four  exons.  CGRP 
mRNA  is  the  only  detectable  mature  transcript  in 
rat  neuronal  tissue.  This  appears  to  reflect  the 
actions  of  a  specific  regulatory  machinery  con- 
trolling post-transcriptional  RNA  splice  acceptor 
choice,  dependent  on  a  specific,  discrete  cis- 
active  element. 

In  summary,  these  investigations  have  estab- 
lished several  novel  aspects  of  transcriptional 
and  post-transcriptional  regulatory  strategies 
in  neuroendocrine  gene  expression  and  in 
organogenesis. 
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RESEARCH  in  our  laboratory  is  directed  to- 
ward studies  of  differentiation  and  gene  regu- 
lation in  the  developing  nervous  system.  Our  ap- 
proach involves  studying  genes  whose  mutations 
disrupt  neural  development.  During  the  past 
year,  we  have  focused  on  several  genes  important 
for  the  determination  of  cell  fates  in  the  develop- 
ing retina  of  the  fruit  fly  Drosophila. 

Two  very  different  but  not  exclusive  mecha- 
nisms can  account  for  the  selection  of  distinct 
developmental  pathways.  First,  cells  may  be  pro- 
grammed in  a  lineage-dependent  manner  by  the 
asymmetric  partitioning  of  determinants  during 
cell  division.  Different  developmental  pathways 
are  then  selected  in  the  daughter  cells  in  re- 
sponse to  the  different  localized  determinants.  Al- 
ternatively, cellular  diff^erentiation  may  occur  in 
a  lineage-independent  manner,  where  the  posi- 
tion that  a  cell  occupies  in  a  developing  field 
determines  its  fate.  In  this  case,  diffusible  sub- 
stances, such  as  hormones,  or  interactions  with 
adjacent  cells  are  the  primary  determinants  of 
cellular  differentiation.  Although  the  mecha- 
nisms used  to  read  and  interpret  such  positional 
information  are  largely  unknown,  short-range 
cellular  interactions  are  thought  to  be  of  princi- 
pal importance  in  a  wide  variety  of  developmen- 
tal phenomena. 

The  compound  eye  of  Drosophila  melanogas- 
ter,  which  is  an  attractive  system  to  study  the 
mechanisms  underlying  lineage-independent  de- 
velopmental decisions,  is  a  two-dimensional 
array  of  800  repeating  units,  or  ommatidia.  Each 
ommatidium  contains  8  photoreceptor  cells  and 
12  nonneuronal  accessory  cells.  Each  photore- 
ceptor cell  has  a  distinct  cellular  identity,  based 
on  both  its  position  within  the  ommatidium  and 
its  projection  pattern  to  the  optic  lobes  of  the 
brain.  The  stereotyped  arrangement  of  this  small 
number  of  nerve  cells,  together  with  the  dispen- 
sability of  the  visual  system  under  laboratory  con- 
ditions, makes  this  an  attractive  model  system  to 
study  genes  involved  in  the  specification  of  nerve 
cells. 

Assembly  of  ommatidia  begins  in  an  initially 


unpatterned  monolayer  of  epithelial  cells,  the 
eye  imaginal  disc.  Ommatidial  assembly  does  not 
occur  synchronously  throughout  the  disc,  but  in- 
stead begins  at  the  posterior  edge  and  progresses 
anteriorly.  Eye  discs  removed  from  larvae  just 
prior  to  pupariation  show  a  smoothly  graded  se- 
ries of  ommatidia  at  different  stages  of  develop- 
ment, covering  just  over  half  of  the  disc.  Examina- 
tion of  individual  cells  in  the  forming  ommatidia 
has  shown  that  the  photoreceptors  differentiate 
in  a  fixed  sequence,  beginning  with  the  central 
R8  photoreceptor  and  proceeding  pairwise  with 
R2  and  R5,  R3  and  R4,  Rl  and  R6,  and  finally  R7. 

The  fate  of  a  cell  within  a  developing  omma- 
tidium appears  to  be  governed  by  the  specific 
combination  of  signals  received  by  that  cell  from 
its  immediate  neighbors.  We  would  like  to  un- 
derstand how  such  signals  are  generated,  re- 
ceived, and  interpreted.  Our  approach  has  been 
to  study  mutations  that  specifically  disrupt  these 
processes,  as  illustrated  by  our  studies  of  the 
sevenless  gene. 

The  sevenless  gene  is  essential  for  the  develop- 
ment of  a  single  type  of  photoreceptor  cell.  In  the 
absence  of  proper  sevenless  function,  cells  that 
would  normally  become  R7  photoreceptors  be- 
come instead  nonneuronal  cells.  Previous  mor- 
phological and  genetic  analysis  has  indicated  that 
the  product  of  the  sevenless  gene  is  involved 
in  reading  or  interpreting  the  positional  informa- 
tion that  specifies  this  particular  developmental 
pathway.  We  have  isolated  and  characterized  the 
sevenless  gene.  Our  data  indicate  that  sevenless 
encodes  a  transmembrane  protein  with  a  tyrosine 
kinase  domain.  The  structural  analogies  between 
the  sevenless  protein  and  certain  hormone  recep- 
tors suggest  that  developmental  pathway  selec- 
tions dependent  on  cell-cell  interactions  may 
involve  molecular  mechanisms  similar  to  physio- 
logical or  developmental  changes  induced  by 
long-range  difi'usible  factors. 

To  investigate  the  role  of  the  sevenless  protein 
in  R7  development,  we  have  examined  the  pat- 
tern of  sevenless  expression  in  the  developing 
retina  and  have  studied  the  effects  of  experimen- 
tally altering  this  pattern.  By  transiently  express- 
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ing  sevenless  protein,  we  have  shown  that  there 
is  only  a  brief  period  during  eye  development 
when  the  sevenless  protein  is  required  for  the 
formation  of  the  R7  photoreceptor.  Our  results 
are  consistent  with  the  proposal  that  sevenless 
directly  reads  positional  information  required  to 
specify  the  R7  developmental  pathway. 

A  major  current  goal  is  to  determine  how  acti- 
vation of  the  sevenless  tyrosine  kinase  instructs  a 
ceil  to  become  an  R7  photoreceptor.  Defining 
the  biologically  relevant  substrates  of  tyrosine  ki- 
nase receptors  has  been  a  long-standing  and  diffi- 
cult problem.  We  have  used  two  strategies  in  a 
genetic  approach  toward  the  identification  of 
genes  whose  products  act  downstream  of  seven- 
less, including  those  that  might  be  direct  sub- 
strates for  the  sevenless  kinase. 

First,  we  have  looked  for  other  mutations  that 
give  the  same  phenotype  as  sevenless — transfor- 
mation of  the  R7  cell  into  a  nonneuronal  cell 
type.  In  this  way  we  isolated  the  seven  in-absen- 
tia  {sina)  gene.  Function  of  sina  is  required  only 


in  R7  for  correct  R7  cell  development.  The  sina 
protein,  which  has  a  potential  metal-binding  do- 
main, is  localized  in  the  nuclei  of  several  omma- 
tidial  precursor  cells,  including  R7,  and  sina  ex- 
pression in  R7  appears  before  R7  overtly  begins 
to  differentiate.  These  data  indicate  that  the  sina 
gene  product  is  necessary  at  a  stage  in  the  deter- 
mination of  R7  cell  fate  when  R7  receives  and 
interprets  developmental  signals  from  neighbor- 
ing cells,  and  possibly  acts  by  regulating  gene 
expression. 

Second,  we  utilized  a  crippled  sevenless  pro- 
tein, whose  activity  is  just  barely  adequate  to 
specify  R7  cell  development,  to  establish  a  highly 
sensitive  assay  for  other  components  of  this  signal 
transduction  pathway.  Using  this  assay  we  looked 
for  other  genes  in  which  a  50  percent  reduction 
of  the  level  of  their  protein  products  resulted  in  a 
failure  to  specify  R7  cells.  In  this  way  we  have 
identified  seven  genes  that  appear  to  encode 
products  that  act  to  interpret  the  signal  mediated 
by  the  sevenless  receptor. 
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UNDER  normal  circumstances,  blood  clots  oc- 
cur only  at  sites  of  vascular  injury,  and  un- 
necessary clots  are  dissolved  promptly.  Inappro- 
priate blood  clots  can  cause  devastating  illness, 
such  as  stroke  and  myocardial  infarction. 
Abnormal  thrombosis  also  complicates  many 
common  diseases,  including  certain  cancers  and 
infections. 

In  the  blood,  proteins  and  small  cells  called 
platelets  are  required  for  clot  formation.  The  en- 
dothelial cells  that  line  all  blood  vessels  and  cir- 
culating white  blood  cells  are  not,  however,  pas- 
sive bystanders  in  these  reactions,  but  actively 
promote  or  inhibit  clotting.  Compounds  that  are 
produced  during  inflammation  modulate  these 
cellular  activities. 

We  are  investigating  the  structure,  function, 
regulation,  and  evolution  of  proteins  that  control 
blood  coagulation.  Our  goal  is  to  understand  how 
these  opposing  tendencies — to  stimulate  or  to  in- 
hibit clotting — are  balanced  to  achieve  normal 
hemostasis  and  prevent  dangerous  thrombosis. 
These  studies  will  increase  our  knowledge  of  the 
interaction  between  blood  coagulation  and  in- 
flammation and  may  provide  a  foundation  for  the 
design  of  new  therapies  for  thrombotic  disorders. 

von  Willebrand  Factor 
and  von  Willebrand  Disease 

The  von  Willebrand  factor  (vWF)  is  a  blood 
protein  that  is  made  by  endothelial  cells  and  is 
required  for  normal  platelet  function.  vWF  also 
binds  to  and  stabilizes  blood  coagulation  factor 
VIII,  the  factor  that  is  deficient  in  classical  hemo- 
philia. The  structure  of  vWF  was  determined  indi- 
rectly by  cDNA  cloning:  vWF  contains  12  re- 
peated domains  that  belong  to  four  families  of 
ancestral  sequences.  Hereditary  deficiency  of 
vWF,  or  von  Willebrand  disease,  is  the  most  com- 
mon genetic  bleeding  disorder  of  humans.  Mild 
abnormalities  of  vWF  function  can  be  detected  in 
nearly  1  percent  of  the  population. 

We  determined  the  structure  of  the  human  vWF 
gene  and  also  of  a  related  pseudogene  that  has 
diverged  recently  from  the  authentic  vWF  gene. 
This  allowed  us  to  investigate  von  Willebrand 


disease  at  the  level  of  gene  sequence.  We  charac- 
terized deletions  of  the  vWF  gene  that  cause  se- 
vere von  Willebrand  disease  in  five  unrelated  pa- 
tients. These  patients  treat  transfused  vWF  as  a 
foreign  protein  and  make  inhibitory  antibodies  to 
it.  Deletions  in  the  vWF  gene  appear  to  predis- 
pose to  the  formation  of  such  antibodies. 

These  studies  were  extended  to  include  pa- 
tients with  variants  of  von  Willebrand  disease 
who  make  a  defective  vWF  molecule.  Severe 
bleeding  in  certain  variants  can  be  associated  ei- 
ther with  increased  or  decreased  binding  of  vWF 
to  a  specific  receptor  protein  on  platelets.  Among 
nine  such  unrelated  patients,  five  different  muta- 
tions were  characterized  in  a  small  vWF  domain 
that  appears  to  modulate  the  affinity  of  vWF  for 
this  platelet  receptor.  This  illustrates  the  impor- 
tance of  proper  balance  between  increased  and 
decreased  vWF  function  for  normal  hemostasis. 

A  newly  described  variant  of  von  Willebrand 
disease,  first  recognized  in  Normandy,  France,  is 
characterized  by  defective  binding  of  vWF  to 
blood  coagulation  factor  VIII.  In  such  patients 
factor  VIII  is  unstable,  and  this  results  in  a  second- 
ary factor  VIII  deficiency  that  mimics  classical 
hemophilia.  A  missense  mutation  was  identified 
in  the  factor  VIII  binding  site  of  vWF.  The  corre- 
sponding recombinant  vWF  exhibited  the  same 
defect  in  factor  VIII  binding  as  natural  vWF  Nor- 
mandy, confirming  that  this  mutation  causes  the 
disease.  The  genetic  defects  of  these  and  other 
such  patients  provide  insights  into  structure- 
function  relationships  of  vWF  and  may  suggest 
new  therapeutic  strategies  to  inhibit  or  augment 
vWF  function. 

Thrombomodulin  and  Thrombin 

Thrombomodulin  is  a  protein  of  the  endothe- 
lial cell  surface  that  binds  thrombin,  a  blood  clot- 
ting enzyme.  Because  of  its  effects  on  thrombin 
activity,  thrombomodulin  is  an  essential  natural 
anticoagulant.  Several  inflammatory  mediators 
decrease  the  expression  of  thrombomodulin  by 
endothelial  cells.  Understanding  this  process 
may  help  us  to  understand  the  abnormal  blood 
coagulation  that  accompanies  much  human  dis- 
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ease.  We  have  cloned  and  expressed  recombinant 
human  thrombomodulin  in  a  variety  of  cultured 
cell  lines,  and  the  recombinant  protein  appears 
to  be  structurally  and  functionally  normal.  These 
and  similar  cells  that  express  altered  forms  of 
thrombomodulin  are  being  used  to  investigate 
the^tructural  requirements  for  thrombomodulin 
cofactor  activity. 

Thrombin  is  a  protease  enzyme  that  is  required 
to  form  blood  clots  from  fibrinogen.  Thrombin 
also  can  inhibit  clotting  by  first  binding  to  throm- 
bomodulin and  then  digesting  certain  other 
blood  coagulation  factors.  Disruption  of  throm- 
bin's normal  balance  between  promoting  and  in- 
hibiting clotting  can,  in  principle,  cause  either 
bleeding  or  thrombosis.  We  have  constructed 
mutant  forms  of  thrombin  with  predominantly 
anti-  or  procoagulant  activities.  Such  mutant 
thrombins  help  to  define  the  structural  basis  for 
the  different  activities  of  thrombin.  In  addition, 
they  provide  reagents  to  test  the  physiological 
importance  of  specific  thrombin  activities,  and 
mutant  thrombins  may  also  be  found  to  have  ther- 
apeutic pro-  or  anticoagulant  properties. 

Tissue  Factor 

Tissue  factor,  a  cell  surface  protein  that  is 
found  on  many  cells  that  do  not  normally  contact 
the  blood,  is  the  most  important  physiological 
initiator  of  blood  coagulation.  When  blood  ves- 
sels are  damaged,  tissue  factor  is  exposed  to  and 
binds  to  blood  coagulation  factor  VII.  The  factor 
Vll-tissue  factor  complex  then  initiates  a  cascade 
of  reactions  that  cause  blood  to  clot.  Both  mono- 
cytes and  endothelial  cells  express  tissue  factor 


activity  in  response  to  many  stimuli,  and  this  con- 
tributes to  the  abnormal  thrombosis  that  accom- 
panies systemic  infections. 

We  isolated  cDNA  clones  for  human  tissue  fac- 
tor and  localized  the  gene  to  chromosome  1. 
These  clones  have  been  used  to  study  the  regula- 
tion of  tissue  factor  in  endothelial  cells.  Tumor 
necrosis  factor,  a  protein  made  during  inflamma- 
tion, causes  a  dramatic  but  transient  increase  in 
tissue  factor  activity.  This  appears  to  be  the  result 
of  activating  the  gene,  which  is  normally  silent  in 
endothelium.  A  second  level  of  control  can  be 
employed  to  amplify  this  response.  Tissue  factor 
mRNA  normally  is  degraded  very  rapidly,  but  it  is 
stabilized  by  some  agents  that  stimulate  endothe- 
lial cells,  and  this  may  contribute  to  the  induc- 
tion of  tissue  factor  activity  during  inflam- 
mation. We  are  currently  studying  the  structural 
basis  for  this  regulation  of  tissue  factor  mRNA 
degradation. 

The  systems  we  are  investigating  provide  abun- 
dant opportunities  to  answer  biological  ques- 
tions concerning  the  regulation  of  blood  coagula- 
tion and  to  approach  fundamental  questions 
related  to  signal  transduction,  gene  expression, 
and  protein  structure-function  relationships. 
These  studies  may  illustrate  how  several  proteins 
can  be  coordinately  regulated  to  promote  blood 
clotting  reactions  on  the  vascular  endothelium 
during  inflammation.  We  will  continue  to  ex- 
plore the  mechanisms  by  which  vWF,  thrombo- 
modulin, thrombin,  and  tissue  factor  are  regu- 
lated and  will  extend  this  work  to  other 
endothelial  cell  proteins  that  can  promote  or  in- 
hibit thrombosis. 
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DURING  the  development  of  higher  organ- 
isms, a  fertilized  egg  undergoes  many  divi- 
sions to  produce  a  multicellular  body  with  char- 
acteristic segments.  The  basic  pattern  of  the  body 
is  invariant  from  generation  to  generation  and  is 
dictated  by  a  genetic  blueprint  within  the  organ- 
ism's own  genome.  My  group  is  interested  in 
learning  about  the  genetic  blueprint  of  the  fruit 
fly  Drosophila  melanogaster. 

The  fruit  fly's  body  is  made  up  of  several  fused 
segments  in  the  head,  three  thoracic  segments 
with  wings  and  legs,  and  ten  abdominal  seg- 
ments. The  unique  characteristics  of  each  seg- 
ment are  determined  by  the  activities  of  the  genes 
called  homeotic.  When  these  genes  malfunction, 
a  body  segment  or  group  of  segments  transforms 
to  take  on  characteristics  of  another  segment.  For 
example,  some  mutations  in  the  homeotic  gene 
Ultrabithorax  cause  the  transformation  of  a  non- 
winged  segment  into  one  with  wings,  leading  to  a 
four-winged  instead  of  the  normal  two-winged 
fly.  Thus  homeotic  genes  can  be  thought  of  as 
master  regulators  that  trigger  the  genetic  circuits 
necessary  for  the  normal  body  pattern.  Our  goal  is 
to  understand  how  the  homeotic  genes  accom- 
plish this  task  at  the  molecular  level. 

Three  homeotic  genes — Ultrabithorax  {Ubx), 
abdominal- A  (abd-A),  and  Abdominal-B  (Abd- 
E) — are  responsible  for  determining  the  charac- 
teristics of  two  thoracic  and  eight  abdominal  seg- 
ments. They  are  located  in  a  chromosomal  region 
called  the  bithorax  complex  (BX-C) .  Flies  carry- 
ing mutations  in  any  one  of  these  BX-C  genes 
clearly  show  characteristic  transformations.  By 
noting  which  segments  are  transformed  in  these 
mutants,  we  can  infer  that  Ubx  is  active  in  two 
thoracic  and  eight  abdominal  segments,  abd-A  in 
the  second-through-eighth  abdominal  segments, 
and  Ahd-B  in  the  fifth-through-ninth  abdominal 
segments. 

To  learn  about  what  the  homeotic  gene  prod- 
ucts do,  we  have  focused  our  studies  on  abd-A 
and  Abd-B.  Since  each  of  the  abdominal  segments 
shows  unique  characteristics,  it  seemed  possible 
that  these  genes  encode  multiple  protein  prod- 
ucts that  are  each  responsible  for  determining  the 


identity  of  a  particular  segment.  Contrary  to  this 
expectation,  we  have  found  that  abd-A  and  Abd- 
B  encode  one  and  two  proteins,  respectively. 
Each  of  these  proteins  contains  a  stretch  of  almost 
identical  amino  acid  sequence,  known  as  the  ho- 
meodomain.  This  sequence  is  found  in  the  pro- 
teins from  many  organisms,  including  humans. 

As  has  been  shown  for  other  homeodomain- 
containing  proteins,  we  have  demonstrated  that 
abd-A  and  Abd-B  proteins  bind  to  DNA.  Further- 
more, we  have  shown  that  the  abd-A  protein  can 
repress  the  transcription  of  other  genes  in  cul- 
tured Drosophila  cells.  It  seems,  therefore,  that 
the  abd-A  and  Abd-B  proteins  accomplish  their 
tasks  by  regulating  the  transcription  of  other 
genes,  which  in  turn  contribute  to  unique  char- 
acteristics of  each  segment. 

We  are  now  extending  our  work  to  demon- 
strate that  the  homeotic  proteins  regulate  the 
transcription  process  in  developing  embryos  as 
well  as  in  tissue  culture  cells.  For  this  purpose  we 
are  studying  the  interaction  of  abd-A  with  a  tran- 
scription start  site,  or  promoter,  for  another  ho- 
meotic gene,  Antennapedia  (Antp).  Genetic  evi- 
dence suggests  that  Antp  expression  in  wild-type 
embryos  is  suppressed  in  the  first  abdominal  seg- 
ment by  Ubx  and  in  the  second-through-seventh 
abdominal  segments  by  abd-A.  From  the  genetic 
evidence,  however,  it  is  not  known  if  this  sup- 
pression is  direct  or  indirect.  We  have  now 
shown  that  purified  Ubx  and  abd-A  proteins  bind 
to  a  number  of  sites  near  the  promoter  of  the  Antp 
gene. 

To  test  if  these  in  vitro  binding  sites  are  indeed 
utilized  for  repression  of  Antp  in  embryos,  we 
have  mutagenized  each  binding  site,  rendering  it 
incapable  of  binding  the  homeotic  proteins. 
These  altered  DNA  sequences  are  being  intro- 
duced into  the  genome  of  the  fly  to  see  if  the 
normally  observed  repression  of  Antp  by  Ubx  or 
abd-A  protein  is  abolished.  If  so,  this  would  pro- 
vide the  first  conclusive  evidence  that  homeotic 
proteins  act  as  transcriptional  regulators  in  devel- 
oping embryos. 

We  are  also  interested  in  knowing  how  the  ho- 
meotic genes,  with  a  relatively  small  number  of 
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encoded  proteins,  can  specify  many  unique  body 
segments.  One  answer  lies  in  the  fact  that  these 
genes  are  expressed  in  different  but  overlapping 
sets  of  segments,  leading  to  different  combina- 
tions of  homeotic  proteins  in  the  various  seg- 
ments. For  example,  of  the  three  homeotic  pro- 
teins from  the  BX-C,  only  Ubx  proiein  is  detected 
in  the  second  and  third  thoracic  segments;  both 
Ubx  and  abd-A  proteins  are  present  in  the  sec- 
ond-through-fourth abdominal  segments;  and  all 
three  BX-C  proteins  are  present  in  the  fifth- 
through-eighth  abdominal  segments.  These  three 
combinations  would  of  course  define  only  three 
segmental  identities  if  all  cells  within  segments 
were  expressing  the  same  combination  of  the  ho- 
meotic genes. 

In  fact,  cells  within  a  segment  do  not  express 
the  same  combination.  We  find,  for  example,  that 
some  cells  within  the  fifth  abdominal  segment 
express  abd-A  protein  exclusively  or  predomi- 
nantly; other  cells  express  Ubx  protein;  and  yet 
other  cells,  Abd-B  protein.  In  the  sixth  abdomi- 
nal segment,  Ubx  and  abd-A  expression  appears 
similar  to  that  of  the  fifth  abdominal  segment,  but 
Abd-B  expression  is  different  from  that  of  the  ad- 
jacent segment.  This  kind  of  "mosaic"  expres- 
sion can,  in  theory,  specify  an  unlimited  number 
of  segmental  identities.  We  have  therefore  asked 
the  question.  Is  the  mosaic  expression  necessary 
for  correct  specification  of  segment  identities? 

To  answer  this,  we  constructed  genetically  al- 
tered flies  in  which  we  could  express  Abd-B  pro- 


tein at  will.  When  Abd-B  protein  is  expressed  in 
all  cells,  abolishing  the  normal  mosaic  pattern  of 
Abd-B  expression,  thoracic  and  abdominal  seg- 
ments exhibit  characteristics  of  the  eighth  abdom- 
inal segment.  Surprisingly,  we  do  not  observe 
completely  nonsensical  identities,  as  might  have 
been  expected  from  this  abnormal  situation.  One 
reason  for  this  result  could  be  that  the  eighth  seg- 
ment identity  requires  only  Abd-B  protein  (al- 
though Ubx  and  abd-A  proteins  are  present  at 
low  levels)  and  that  the  high  level  of  Abd-B  pro- 
tein in  other  parts  of  the  body  is  sufficient  to  sup- 
press the  effects  of  Ubx  and  abd-A,  leading  to  a 
repetition  of  the  eighth  abdominal  segment  iden- 
tities. This  suppression  of  Ubx  and  abd-A  activi- 
ties could  be  due  to  repression  of  Ubx  and  abd-A 
transcription  by  Abd-B,  or  alternatively,  the  Abd- 
B  protein  could  simply  override  the  effect  of  Ubx 
or  abd-A  protein  in  the  same  cell. 

To  distinguish  these  possibilities,  we  asked 
what  segmental  identity  would  result  if  we  engi- 
neered the  fly  so  that  both  Ubx  and  Abd-B  pro- 
teins are  expressed  in  all  cells.  Our  preliminary 
results  indicate  that  Ubx  protein  exerts  its  effect 
over  Abd-B  protein  in  anterior  body  segments, 
while  in  posterior  segments  the  reverse  applies: 
the  Abd-B  protein  predominates.  Since  the  level 
of  Ubx  or  Abd-B  proteins  should  be  the  same  in 
all  cells,  these  results  suggest  that  the  response 
depends  on  where  the  cell  is  located.  This  in  turn 
suggests  that  there  may  be  other  factors  in  addi- 
tion to  the  homeotic  genes  that  contribute  to  the 
correct  identities  of  body  segments. 


380 


Intracellular  Transport  of  Proteins 


Randy  W.  Schekman,  Ph.D. — Investigator 

Dr.  Schekman  is  also  Professor  of  Biochemistry  and  Molecular  Biology  at  the  University  of  California, 
Berkeley,  and  Adjunct  Professor  of  Biochemistry  and  Biophysics  at  the  University  of  California,  San  Fran- 
cisco. As  a  graduate  student,  he  studied  the  enzymology  of  DNA  replication  with  Arthur  Kornberg  at 
Stanford  University.  His  current  interest  in  cellular  membranes  developed  during  a  postdoctoral  period 
with  S.  J.  Singer  at  the  University  of  California,  San  Diego.  At  Berkeley,  he  developed  a  genetic  approach 
to  the  study  of  eukaryotic  membrane  traffic. 


RESEARCH  in  our  laboratory  is  devoted  to  a 
molecular  description  of  two  processes: 
polypeptide  translocation  from  the  cytosol  into 
the  endoplasmic  reticulum  (ER)  and  vesicular 
transport  among  organelles  of  the  secretory 
pathway. 

Genetic  Dissection  of  the  Secretory  Process 

A  genetic  approach  to  the  study  of  protein 
transport  in  eukaryotes  involved  the  isolation  of 
conditional  mutants.  We  isolated  a  series  of  se- 
cretory {sec)  mutants  in  the  yeast  Saccharo- 
myces  cerevisiae  that  are  temperature-sensitive 
for  cell  surface  growth,  division,  and  secretion. 
Most  of  the  mutants  accumulate  secretory  pro- 
teins in  an  intracellular  pool  that  can  be  released 
when  cells  are  returned  to  a  permissive  tempera- 
ture. Over  30  gene  products  have  been  impli- 
cated in  the  process  of  delivering  membrane  and 
secretory  proteins  to  the  cell  surface. 

A  combination  of  genetic  and  cytologic  evalua- 
tion of  the  sec  mutants  has  allowed  a  description 
of  the  secretory  pathway.  Protein  transport  in 
yeast  appears  to  be  mediated  by  the  same  organ- 
elles and  proteins  that  operate  in  mammalian 
cells.  Molecular  cloning  analysis  of  5.£'C  genes  has 
revealed  striking  structural  and  functional  homol- 
ogy with  corresponding  mammalian  genes. 

Protein  Translocation  Into 
the  Endoplasmic  Reticulum 

Protein  translocation  into  the  lumen  of  the  ER 
represents  the  initial  step  in  assembly  of  the  eu- 
karyotic cell  surface.  This  process  has  been  re- 
constituted with  detergent-solubilized  mem- 
brane proteins  and  purified  cytosolic  proteins, 
yet  the  mechanism  of  polypeptide  penetration  is 
unclear.  We  have  isolated  mutants  that  are  defec- 
tive in  translocation,  using  a  genetic  selection 
that  requires  secretory  polypeptides  to  be  re- 
tained in  the  cytosol. 

Four  SEC  genes  have  been  identified  whose 
gene  products  are  required  for  the  translocation 
of  a  wide  variety  of  secretory  and  membrane  pro- 
teins. Three  of  the  Sec  proteins  (Sec61p,  -62p, 
and  -63p)  are  found  in  a  complex  that  may  be 


isolated  from  detergent-solubilized  membranes. 
The  complex  includes  two  other  polypeptides  of 
unknown  structure.  This  complex  may  catalyze 
the  translocation  event  or  create  a  pore  in  the  ER 
membrane  through  which  secretory  polypep- 
tides are  transmitted. 

At  least  one  of  the  proteins  in  the  translocation 
complex  forms  a  direct  contact  with  the  nascent 
secretory  polypeptide.  A  protein  conjugate  con- 
sisting of  the  yeast  secretory  protein,  a-factor 
precursor,  linked  chemically  to  avidin,  engages 
but  jams  the  translocation  apparatus.  Treatment 
of  the  jammed  complex  with  a  cleavable  cross- 
linking  reagent  generates  a  covalent  connection 
between  a-factor  precursor  and  Sec61p.  The  re- 
quirements for  the  production  of  this  interme- 
diate suggest  that  the  contact  between  Sec6lp  and 
the  secretory  polypeptide  follows  an  earlier 
event  in  which  a-factor  first  binds  to  a  receptor 
on  the  cytosolic  surface  of  the  ER  membrane. 

A  fourth  gene,  SEC65,  required  for  transloca- 
tion, resembles  the  19-kDa  subunit  of  the  mam- 
malian signal  recognition  particle  (SRP).  sec65 
mutants  are  defective  in  the  translocation  of  mole- 
cules that  are  inserted  cotranslationally,  such  as 
yeast  invertase,  or  post-translationally,  such  as  a- 
factor  precursor.  This  implies  that  at  least  part  of 
a  yeast  SRP  panicipates  in  the  translocation  of 
molecules  uncoupled  from  ongoing  protein 
synthesis. 

Two  forms  of  the  70-kDa  heat-shock  protein 
family  (heat-shock  cognate,  orHscVO)  also  partic- 
ipate in  the  translocation  event.  Depletion  of  cy- 
tosolic Hsc70  by  genetic  manipulation  results  in 
the  accumulation  of  certain  secretory  and  mito- 
chondrial precursor  proteins.  A  direct  participa- 
tion of  these  molecules  in  the  translocation  event 
was  established  by  reconstitution  of  Hsc70- 
dependent  in  vitro  a-factor  precursor  assembly 
into  yeast  ER  vesicles.  Cytosolic  Hsc70  may  asso- 
ciate with  translocation  precursor  polypeptides 
and  prevent  them  from  folding  into  unfavorable 
structures  or  complexes. 

A  luminal  Hsc70,  homologous  to  mammalian 
BiP,  was  shown  by  Joe  Vogel  and  Mark  Rose  at 
Princeton  also  to  be  involved  in  translocation. 
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Luminal  Hsc70  may  exert  its  influence  through 
interaction  with  the  integral  membrane  complex 
of  Sec  proteins.  Sec63p,  which  is  a  member  of 
this  complex,  has  a  luminal  domain  with  se- 
quence homology  to  a  bacterial  heat-shock  pro- 
tein known  to  interact  with  the  Escherichia  coli 
homologue  of  Hsc70.  Luminal  Hsc70  may  con- 
trol the  assembly  and  disassembly  of  the  translo- 
cation complex  by  binding  to  Sec63p. 

Vesicle  Transport  Early  in  the 
Secretory  Pathway 

Subsequent  stages  in  the  secretory  pathway  in- 
volve protein  sorting  and  transport  from  the  ER  to 
the  Golgi  apparatus  and  from  there  to  the  cell 
surface.  Genes  required  for  each  of  these  steps 
are  being  evaluated  by  molecular  cloning  and  by 
development  of  cell-free  reactions  that  measure 
individual  steps  in  the  transport  process.  An  assay 
that  depends  on  Sec  proteins  has  been  reconsti- 
tuted in  vitro.  Yeast  a-factor  precursor  is  translo- 
cated into  the  ER  lumen  of  gently  lysed  yeast 
spheroplasts.  In  the  presence  of  soluble  proteins 
and  ATP,  the  precursor  is  transferred  to  the  Golgi 
apparatus.  This  system  allows  the  purification 
and  functional  characterization  of  Sec  proteins. 

Transfer  of  secretory  proteins  from  the  ER  to 
the  Golgi  apparatus  is  mediated  by  small  vesicle 
carriers.  There  are  two  categories  of  sec  mutants 
defective  in  this  limb  of  the  pathway:  mutant 
cells  that  accumulate  ER  tubules  at  a  restrictive 
temperature  (class  I;  sec  12,  -13,  -16,  and  -23) 
and  mutant  cells  that  also  accumulate  several 
thousand  60-nm  vesicles  (class  II;  sec  17,  -18, 
and  -22).  Genetic  epistasis  tests  indicate  that 
class  I  genes  must  execute  their  function  prior  to 
class  II  genes.  This  implies  that  class  I  gene  prod- 
ucts participate  in  the  production  of  the  60-nm 
vesicles  that  are  consumed,  by  fusion  with  the 


Golgi  apparatus,  through  the  action  of  class  II 
gene  products.  Genetic  interactions  among 
members  of  class  I  genes  and  of  class  II  genes 
suggest  that  the  Sec  proteins  in  each  group  act  in 
a  complex,  or  at  least  in  a  concerted  manner,  to 
perform  their  respective  roles  in  vesicle  budding 
or  fusion. 

Transport  of  a-factor  precursor  in  vitro  is  me- 
diated by  diffusible  vesicles.  Transport  vesicles 
contain  core-glycosylated  precursor  and  are  phys- 
ically separable  from  donor  ER  and  target  Golgi 
membranes.  Budding  of  vesicles  from  the  ER  re- 
quires a  crude  cytosol  fraction,  ATP,  Secl2p, 
Sec23p,  and  GTP.  Fusion  of  the  vesicles  to  the 
Golgi  compartment  is  measured  by  the  conver- 
sion of  core-glycosylated  precursor  to  a  more 
highly  glycosylated  form.  Enriched  transport  vesi- 
cles target  to  the  Golgi  compartment  and  then 
fuse  in  distinct  subreactions  that  require  cytosol, 
Ca^"^,  ATP,  and  only  a  subset  of  Sec  proteins.  Addi- 
tional proteins  in  the  cytosol  fraction  that  facili- 
tate vesicle  budding  are  being  purified. 

Vesicle  budding  in  the  secretory  pathway  was 
thought  to  be  mediated  by  clathrin,  a  structure- 
forming  protein  that  spontaneously  assembles 
into  coats  about  the  size  of  small  transport  vesi- 
cles. A  test  of  clathrin  function  was  performed  by 
molecular  cloning  and  disruption  of  the  chromo- 
somal genes  encoding  the  large  and  small  sub- 
units  of  the  protein.  Clathrin  is  important  but  not 
essential  for  cell  growth.  Deletion  mutations  that 
eliminate  clathrin  produce  sickly  cells  that  never- 
theless are  competent  for  most  aspects  of  protein 
transport.  A  specific  lesion  in  a-factor  precursor 
processing,  an  event  that  occurs  in  the  Golgi  ap- 
paratus, is  deficient  in  clathrin  mutant  cells.  Al- 
ternative structure-forming  proteins  that  serve 
more  essential  roles  in  secretion  may  be  discov- 
ered using  the  Sec  protein-dependent  vesicle- 
budding  reaction. 
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SEC  (secretory)  genes  regulate  the  flow  of  yeast  secretory  polypeptides  into  and  from  the  endo- 
plasmic reticulum  (ER)  and  through  the  Golgi  complex.  In  the  Golgi,  proteins  destined  for  secre- 
tion are  sorted  from  vacuolar  /lysosomal  proteins.  Mature  secretory  proteins  are  packaged  into 
vesicles  and  directed  to  the  inner  surface  of  the  growing  yeast  hud.  The  secretory  process  and  the 
SEC  gene  products  are  very  similar  in  yeast  and  mammalian  cells. 
Research  of  Randy  W.  Schekman. 
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Analysis  of  embryonic  rat  motor  neurons  demonstrates  that  these  cells  express  a  gene  encoding 
agrin.  The  agrin  molecule  is  secreted  by  the  nerve  cell  axons  as  they  contact  the  muscle  fibers.  The 
agrin  protein  plays  a  role  in  organizing  the  synapse  and  may  be  important  in  regeneration 
following  nerve  or  muscle  damage. 

From  Rupp,  F.,  Payan,  D.G.,  Magill-Solc,  C,  Cowan,  D.,  and  Scheller,  R.H.  1991.  Neuron 
6:811-823. 
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THE  nervous  system  is  composed  of  large  num- 
bers of  unique  cells  that  communicate  with 
each  other  via  the  regulated  release  of  chemical 
neurotransmitters.  These  synaptic  interactions 
govern  animal  behavior,  and  modulation  of  the 
efficacy  of  synaptic  communication  is  thought  to 
underlie  learning  and  memory.  We  are  interested 
in  understanding  the  molecular  mechanisms  of 
synaptic  formation  during  development  and  re- 
generation in  the  peripheral  nervous  system  after 
nerve  injury.  It  is  also  our  goal  to  contribute  to  an 
understanding  of  how  the  nerve  terminal  func- 
tions in  the  regulation  of  neurotransmitter 
release. 

Processing  and  Packaging  of  Neuropeptides 

Most  synapses  release  two  types  of  chemical 
messengers:  fast-acting  compounds,  or  classical 
transmitters,  and  slower-acting  messengers,  or 
neuromodulators.  Most  of  the  various  chemicals 
used  as  messengers  in  the  brain  are  neuropep- 
tides. These  molecules  are  synthesized  as  larger 
precursors  that  are  processed  to  smaller  active 
peptides.  One  interesting  neuropeptide  precur- 
sor is  expressed  in  an  identified  set  of  neurons, 
the  bag  cells,  in  the  marine  snail  Aplysia.  When 
these  neurons  fire,  they  release  a  set  of  neuropep- 
tides derived  from  a  single  precursor.  These  pep- 
tides act  on  neurons  and  peripheral  tissues  to  reg- 
ulate egg  laying,  a  stereotyped  behavior. 

Interestingly,  the  peptides  produced  on  the 
egg-laying  hormone  (ELH)  precursor  are  pack- 
aged in  two  types  of  vesicles.  These  vesicles  con- 
tain different  sets  of  peptides  and  are  differen- 
tially localized  within  the  neurons.  We  are 
interested  in  understanding  how  the  peptides  ini- 
tially synthesized  on  a  single  precursor  are  sorted 
into  different  vesicles.  We  are  also  interested  in 
understanding  the  physiological  significance  of 
the  differential  packaging  and  localization. 

When  the  ELH  precursor  is  transfected  into 
mammalian  pituitary  tumor  cells  (AtT-20  cells), 
ELH  is  packaged  with  the  endogenous  hormone, 
while  the  other  region  of  the  precursor  is  not 
stored  for  secretion.  Thus  the  AtT-20  cells,  like 
the  bag  cells,  differentially  route  the  two  regions 


of  the  ELH  prohormone.  As  a  result,  we  have  a  cell 
culture  system  for  investigating  the  mechanism 
of  differential  routing  through  the  secretory 
pathway.  We  have  generated  mutations  in  the 
first  processing  site,  which  is  thought  to  be  criti- 
cal for  the  sorting.  The  effects  of  these  mutations 
are  currently  being  analyzed. 

Mechanisms  of  Synaptic  Transmission 

When  the  action  potential  travels  down  the 
nerve  and  enters  a  release  zone,  changes  in  the 
membrane  potential  open  channels  that  allow  cal- 
cium to  enter  the  cell.  The  calcium  promotes 
transmitter  release  and  membrane  fusion.  The 
membrane  then  recycles,  forming  new  vesicles, 
which  are  then  replenished  with  chemical  trans- 
mitter. This  cycle  might  be  considered  the  funda- 
mental process  that  underlies  nervous  system 
function,  yet  little  is  known  about  the  molecular 
mechanisms  involved.  In  an  attempt  to  define  the 
molecular  mechanisms  that  regulate  membrane 
flow  in  the  nerve,  our  laboratory  and  others  have 
begun  to  characterize  the  proteins  associated 
with  the  critical  organelle  in  the  process,  the  syn- 
aptic vesicle.  For  these  studies,  we  use  mamma- 
lian brain  and  the  electric  organs  of  marine  rays. 
These  electric  organs  have  a  concentration  of  syn- 
apses approximately  100-fold  higher  than  that  of 
skeletal  muscle.  In  addition,  these  synapses  are 
homogeneous;  they  all  use  the  neurotransmitter 
acetylcholine. 

Purified  synaptic  vesicles  contain  about  20-50 
protein  bands  when  fractionated  on  acrylamide 
gels.  Genes  encoding  many  of  these  proteins  have 
been  characterized  and  the  primary  sequence  of 
the  molecules  determined.  Some  of  the  proteins 
show  interesting  homologies  to  other  molecules, 
and  others  are  turning  out  to  have  counterparts  in 
yeast  where  genetic  studies  of  membrane  traffick- 
ing have  provided  insight  into  the  secretory  pro- 
cess. It  has  also  become  apparent  that  many  of  the 
synaptic  vesicle  proteins  are  members  of  small 
gene  families.  Individual  members  of  these  gene 
families  are  differentially  expressed  through  the 
brain,  resulting  in  a  variety  of  combinations  of 
these  molecules  on  different  vesicles. 
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To  study  the  functions  of  these  proteins,  we  are 
using  DNA  transfection  and  antibody  techniques. 
Wild-type  and  mutated  forms  of  the  proteins  are 
being  transfected  into  AtT-20  and  PC  12  cells.  In 
addition,  some  of  the  proteins  have  homologues 
in  Drosophila.  We  have  constructed  vectors  con- 
taining wild-type  or  mutated  forms  of  the  pro- 
teins driven  by  heat-shock  promoters.  These  con- 
structs allow  us  to  transfer  the  gene  into  flies  and 
to  turn  on  expression  at  any  time  during  develop- 
ment and  in  the  adult. 

Synapse  Development 

Motor  neurons  in  the  spinal  cord  send  axons  to 
muscle  fibers  throughout  the  body.  When  axons 
contact  muscle  fibers,  a  highly  ordered  structure 
consisting  of  a  presynaptic  nerve  terminal  and  a 
postsynaptic  site  develops.  The  presynaptic  ter- 
minal comprises  an  active  zone  rich  in  synaptic 
vesicles  containing  neurotransmitter.  The  post- 
synaptic element  is  made  up  of  a  membrane  rich 
in  receptors  for  the  neurotransmitter  and  an  in- 
dentation in  the  membrane  called  the  junctional 
fold.  An  extracellular  matrix,  or  basal  lamina, 
surrounds  the  muscle  fiber,  including  the  space 
between  the  nerve  and  muscle. 

One  of  the  key  events  in  the  development  of 
the  neuromuscular  junction  is  the  redistribution 
of  neurotransmitter  receptors  that  occurs  when 


nerve  contacts  muscle.  Initially  receptors  for  the 
neurotransmitter,  in  this  case  acetylcholine,  are 
randomly  distributed  on  the  muscle  fiber.  When 
the  nerve  contacts  muscle,  neurotransmitter  re- 
ceptors aggregate  under  the  nerve  terminal  in  an 
appropriate  position  to  detect  the  chemicals  re- 
leased during  synaptic  transmission. 

Agrin,  a  component  of  the  extracellular  matrix, 
causes  acetylcholine  receptors  to  cluster  when 
added  to  muscle  fibers  growing  in  culture.  We 
have  isolated  recombinant  DNA  clones  encoding 
agrin  molecules  and,  through  an  analysis  of  the 
nucleotide  sequence,  have  defined  the  primary 
amino  acid  sequence  of  the  molecule.  When  we 
compare  the  predicted  agrin  sequence  with  the 
proteins  in  the  data  bank,  two  types  of  similarities 
are  revealed.  The  first  is  to  a  class  of  molecules 
that  inhibit  proteases,  and  the  second  to  a  protein 
motif  called  EGF  (epidermal  growth  factor)  re- 
peats. The  gene  is  expressed  in  embryonic  motor 
neurons  at  the  time  they  are  first  contacting  mus- 
cle fibers.  We  are  currently  attempting  to  express 
this  gene  in  cell  culture  to  study  the  roles  of  the 
agrin  molecule  in  the  formation  of  synapses. 

Since  agrin  is  stably  maintained  in  the  synaptic 
basal  lamina  after  nerve  or  muscle  damage,  it  may 
also  play  a  role  in  regeneration  events.  Under- 
standing the  mechanisms  of  peripheral  synapse 
regeneration  may  lead  to  procedures  that  could 
aid  in  central  nervous  system  regeneration. 
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BACTERIAL,  viral,  and  parasitic  infections  of 
the  gastrointestinal  tract  cause  an  estimated 
500  million  illnesses  and  5  million  deaths  each 
year  among  children  living  in  the  developing 
countries.  The  principal  mission  of  my  laboratory 
is  to  discover  how  these  infectious  agents  cause 
disease,  how  they  are  spread  between  individ- 
uals, and  how  this  information  can  lead  to  new 
tactics  for  prevention  and  treatment.  This  effort 
has  entailed  work  in  two  settings:  molecular  stud- 
ies in  our  laboratory  at  Stanford  University,  and 
epidemiological  investigations  at  a  field  labora- 
tory in  southeastern  Mexico,  where  infections  of 
this  kind  are  common. 

In  the  first  setting,  the  unit  of  analysis  is  the 
organism  itself.  We  try  to  determine  how  the  or- 
ganism attaches  to,  invades,  and  damages  human 
cells.  In  the  second  setting,  the  unit  of  analysis  is 
a  household  or  a  village.  In  this  context  we  seek 
to  understand  how  the  organism  is  transmitted 
within  the  community,  where  its  reservoirs  are, 
and  how  it  manages  to  survive  as  a  viable  entity  in 
the  real  world. 

From  studies  of  the  latter  kind,  we  have  learned 
that  the  survival  of  a  pathogenic  microorganism 
correlates  with  its  ability  to  occupy  different  habi- 
tats. Within  the  context  of  a  Third  World  village, 
these  habitats  include  contaminated  food  and 
well  water,  sewage,  and  the  gastrointestinal  tracts 
of  people  and  animals.  From  an  experimental 
point  of  view,  we  have  tried  in  the  last  year  to 
identify  molecular  and  genetic  events  that  tran- 
spire within  an  infectious  agent  as  it  enters  a  new 
habitat  and  adapts. 

The  results  of  these  studies  have  led  to  the  rec- 
ognition that  pathogenic  bacteria  seldom  attack 
the  gastrointestinal  tract  as  individual,  indepen- 
dent, well-separated  organisms.  Instead,  they  first 
seem  to  form  infectious  units  composed  of  multi- 
ple organisms  and  then  become  established  as  ad- 
herent colonies  on  the  surface  of  the  intestine. 
While  our  studies  have  focused  on  one  kind  of 
microbe — enteropathogenic  strains  of  Esche- 
richia coli  (EPEC) — we  have  discovered  that  this 
phenomenon  is  exhibited  by  many  other  bacte- 


rial species  that  selectively  infect  mucosal 
surfaces. 

Accordingly,  we  now  believe  that  the  capacity 
to  form  infectious  units  confers  a  selective  advan- 
tage on  bacterial  species  infecting  mucous  mem- 
branes, that  it  is  very  probably  an  essential  deter- 
minant of  virulence,  and  that  it  offers  a  logical 
point  of  attack  for  the  prevention  of  disease  by 
vaccination.  This  phenomenon  is  discussed  be- 
low as  it  pertains  to  EPEC  and  its  ability  to  cause 
gastroenteritis. 

Formation  of  Infectious  Units 

EPEC  are  a  common  cause  of  infantile  diarrhea 
in  Third  World  children.  When  small  bowel 
biopsies  of  infected  children  are  performed,  colo- 
nies of  EPEC  are  found  attached  to  the  underlying 
epithelia.  Similar  findings  are  also  seen  in  cul- 
tured epithelial  cells,  to  which  EPEC  readily  ad- 
here as  circumscribed  clusters  of  bacteria,  a  phe- 
nomenon known  as  localized  adherence.  From 
these  observations  we  know  that  the  bacteria 
within  the  colonies  not  only  interact  with  host 
cells  to  which  they  are  bound,  but  also  with  each 
other.  Scanning  electron  micrographs  revealed 
multiple,  ribbon-like  structures  coursing  be- 
tween the  bacteria,  and  fibers  also  appeared  to 
tether  individual  bacteria  to  the  epithelial  cell 
surface. 

Transmission  electron  micrographs  of  these  in- 
terbacterial  ribbons  showed  that  they  are  com- 
prised of  many  individual  filaments  that,  having 
laterally  aggregated  and  intertwined,  formed 
bundles  50-500  nm  wide  and  15-20  ^lm  long. 
These  tended  to  twist,  curl,  and  form  loops.  More- 
over, bundles  expressed  by  different  organisms 
appeared  to  form  three-dimensional  arrays  re- 
sembling a  fishing  net  or  a  meshwork,  in  which 
individual  bacteria  were  embedded. 

EPEC  did  not  express  these  bundles  while 
growing  on  most  laboratory  culture  media.  How- 
ever, when  transferred  to  chambers  containing 
cultured  epithelial  cells,  EPEC  evinced  bundles 
within  15  minutes,  and  by  30  minutes  colonies 
composed  of  tightly  adherent  bacteria  appeared 
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on  the  epithelial  cell  surfaces  as  on  the  intestinal 
tissue  of  an  infected  child.  Thus  the  expression  of 
bundles  by  EPEC  is  induced  by  physical  and 
chemical  variables  found  near  the  epithelial  cell 
surface,  where  the  bacteria  seem  to  colonize 
through  their  propensity  to  form  a  network  link- 
ing one  to  another.  We  have  elected  to  call  these 
structures  colony-forming  bundles,  or  CFB.  It 
now  seems  likely  that  the  colony  begins  to  form 
as  the  bacteria  descend  to  the  epithelial  cell  sur- 
face and  that  it  therefore  constitutes  the  EPEC 
infectious  unit. 

The  role  hypothesized  for  CFB  was  corrobo- 
rated by  two  additional  observations.  First,  dele- 
tion of  genes  from  EPEC  that  are  required  for  the 


production  of  CFB  yielded  mutant  bacteria  that 
were  unable  to  form  cell  surface  colonies.  Sec- 
ond, antibodies  to  CFB  substantially  reduced  the 
colony- forming  capacity  of  EPEC  strains.  This  lat- 
ter finding  suggests  the  use  of  a  CFB  vaccine  to 
prevent  diarrhea. 

We  determined  the  molecular  structure  of  the 
CFB  filaments.  Remarkable  similarities  were 
found  between  their  amino  acid  sequence  and 
the  sequences  of  fibrous  appendages  produced 
by  a  variety  of  pathogenic  bacteria  that  infect  mu- 
cous membranes,  including  the  agents  of  cholera 
and  gonorrhea.  Thus  the  CFB  appears  to  be  a 
shared  structural  innovation  by  bacterial  species 
that  live  on  mucous  membranes. 


Scanning  electron  micrograph  of  enteropathogenic  Escherichia  coli  (EPEC )  growing  on  an  epithe- 
lial cell  and  expressing  colony -forming  bundles. 
Research  of  Gary  K.  Schoolnik. 
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IN  the  initiation  of  an  immune  response  against 
a  foreign  protein  antigen,  the  antigen  is  taken 
up  by  a  speciahzed  antigen-presenting  cell  (APC) 
and  broken  down  into  small  peptide  fragments. 
These  fragments  are  then  thought  to  be  bound  by 
proteins  of  the  immune  system,  namely  major  his- 
tocompatibility class  II  molecules.  The  complex 
of  class  II  molecule  and  bound  peptide  fragment 
is  presumably  expressed  on  the  APC  surface, 
where  it  can  be  recognized  by  a  specific  T  lym- 
phocyte. The  recognition  event  activates  the  T 
lymphocyte,  which  then  initiates  the  immune 
response. 

Despite  a  large  body  of  circumstantial  evi- 
dence from  which  the  presence  of  class  II  mole- 
cule linked  to  a  foreign  peptide  on  the  APC  has 
been  inferred,  the  existence  of  this  complex  on 
the  surface  of  viable  cells  has  not  been  directly 
demonstrated. 

We  have  been  collaborating  with  Tom  and  Vi- 
vian Braciale  in  the  Department  of  Pathology  at 
Washington  University  School  of  Medicine  to  elu- 
cidate the  immune  response  against  influenza 
virus.  A  particular  human  T  cell  clone,  termed 
VI,  was  demonstrated  to  recognize  a  synthetic 
peptide,  HA  128-145  (corresponding  to  an  18- 
amino  acid  segment  of  the  influenza  viral  hemag- 
glutinin protein) ,  only  when  that  peptide  is  pre- 
sented on  an  APC  bearing  a  particular  class  II 
molecule,  HLA-DRwll. 

The  recognition  of  this  complex  by  the  T  cell 
clone  indicated  that  the  HA  128-145  peptide 
was  immunogenic.  We  therefore  used  the  HA  to 
immunize  a  rabbit  and  raise  an  antiserum,  anti- 
HA.  The  antiserum  recognizes  not  only  free  HA 
peptide  but  also  a  complex  of  the  peptide  bound 
to  HLA-DRwl  1  on  the  surface  of  an  APC. 

The  composition  of  this  complex  was  proved 
in  several  ways.  Anti-HA  did  not  recognize  a  mu- 
tant APC  that  expressed  no  class  II  molecules 
after  the  cells  were  exposed  to  HA,  demonstrating 
that  the  latter  did  not  bind  nonspecifically  to 
APCs.  Also,  anti-HA  did  not  bind  to  DRwl  1 -bear- 


ing APCs  that  had  not  been  exposed  to  HA,  nor  to 
APCs  bearing  several  DRs  that  had  been  exposed 
to  the  peptide. 

A  second  APC,  known  to  express  only  half  the 
amount  of  DRwl  1  as  did  the  first  APC  bearing 
DRwl  1 ,  when  exposed  to  HA  bound  only  half  the 
amount  of  anti-HA.  This  last  result  indicates  that 
binding  of  anti-HA  to  complexed  DRwll-HAs 
is  commensurate  with  the  level  of  DRwl  1 
expression. 

Further  proof  of  the  composition  of  the  com- 
plex was  achieved  biochemically.  The  HA  pep- 
tide was  tagged  with  radioactive  iodine  and  a 
photoreactive  group,  so  that  when  exposed  to  ul- 
traviolet light,  the  photoreactive  group  on  the 
peptide  would  form  a  covalent  bond  with  any 
molecule  to  which  the  peptide  had  bound. 
DRwl  1 -bearing  APCs  were  incubated  with  the 
modified  peptide,  exposed  to  ultraviolet  light, 
and  then  lysed.  The  cell  lysate  was  then  reacted 
with  either  the  anti-HA  antibody  or  a  monoclonal 
antibody  that  recognized  the  DRwl  1  molecules. 

Biochemical  analysis  of  the  molecules  reactive 
with  the  antibodies  demonstrated  by  several  crite- 
ria that  the  molecules  covalently  bound  to  the 
radioactive  iodinated  peptide  were  indeed  the 
DRwll  molecules.  Furthermore,  the  DRwl  1 
were  the  only  molecules  tagged  with  radioactive 
iodine,  demonstrating  that  no  other  molecules  of 
the  APC  were  capable  of  binding  peptide. 

A  series  of  experiments  was  done  with  meta- 
bolic inhibitors  to  ascertain  whether  the  HA 
128-145  peptide  followed  the  proposed  route 
of  intracellular  processing.  These  experiments 
demonstrated  that  metabolic  inhibition,  includ- 
ing poisoning  and  fixation  of  the  APC,  did  not 
decrease  the  number  of  DRwll -HA  complexes 
detected  by  anti-HA.  These  results  therefore  indi- 
cate that  HA  must  be  binding  directly  to  cell  sur- 
face DRwl  1  molecules.  It  is  of  interest,  however, 
that  a  temperature  of  4°C  totally  abrogated,  and 
18°C  significantly  inhibited,  the  HA  peptide's 
binding  to  DRwl  1  molecules.  Collectively  these 
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results  suggest  that  temperature  directly  affects 
the  ability  of  DRwl  1  molecules  to  bind  HA 
128-145. 

Because  these  results  suggested  that  the  pep- 
tide was  not  following  the  normal  antigen- 
processing  pathways,  we  turned  to  a  system 
where  we  knew  these  pathways  were  being  used. 
It  had  already  been  shown  that  the  T  cell  clone  VI 
could  recognize  naturally  processed  antigenic 
peptide  derived  from  an  intact  influenza  virion 
on  the  surface  of  a  DRwl  1 -bearing  APC  when  the 
APC  was  exposed  to  intact  virions.  We  therefore 
tested  whether  anti-HA  could  inhibit  this 
recognition. 

Anti-HA  indeed  inhibited  recognition  by  80 
percent.  This  result  established  that  anti-HA 
could  recognize  the  peptide  naturally  derived 


from  the  intact  virion.  In  addition,  we  have  re- 
cently synthesized  a  longer  peptide,  encompass- 
ing 45  amino  acids,  including  HA  128-145 
(which  we  believe  has  to  be  processed  by  a 
DRwl  1 -bearing  APC  to  be  recognized),  and  have 
demonstrated  that  anti-HA  will  bind  this 
complex. 

These  results  indicate  that  anti-HA  can  be  uti- 
lized to  identify  naturally  processed  hemaggluti- 
nin peptide,  as  well  as  the  compartments  within 
the  APC  where  processing  of  intact  antigen  to 
peptide  occurs.  These  studies  are  ongoing. 

Knowledge  of  such  antigen-processing  path- 
ways will  allow  us  to  manipulate  an  immune  re- 
sponse so  that  it  may  be  augmented  during  vacci- 
nations and  suppressed  in  immune  responses 
against  self  that  produce  autoimmune  disease. 
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STIMULATION  of  a  nerve  cell  can  result  in 
long-lasting  changes  in  its  biochemistry. 
These  are  produced  by  a  process  called  signal 
transduction.  The  stimulus,  delivered  by  either  a 
neurotransmitter  or  a  hormone,  acts  upon  spe- 
cific receptors  on  the  nerve  cell's  surface;  and 
these,  in  turn,  induce  the  formation  of  "second 
messengers"  inside  the  cell. 

Second  messengers  usually  work  by  binding  to 
proteins  that  might  be  called  secondary  effectors. 
In  many  instances  the  effectors  are  enzymes 
termed  protein  kinases.  When  activated,  protein 
kinases  catalyze  the  transfer  of  a  phosphoryl 
group  from  ATP  to  a  large  number  of  different 
cellular  proteins.  As  a  consequence  of  the  phos- 
phorylation reactions,  the  functions  of  these  cel- 
lular proteins  are  changed.  Thus  production  of 
the  second  messenger  alters  the  way  the  neuron 
behaves. 

How  long  the  nerve  cell  remains  changed  ini- 
tially depends  on  the  lifetime  of  the  second  mes- 
senger. The  secondary  effector  remains  activated 
as  long  as  there  is  enough  second  messenger 
within  the  cell.  After  brief  stimulation,  second 
messengers  are  formed  for  only  a  few  minutes, 
the  action  of  the  secondary  effector  is  brief,  and 
the  altered  state  of  the  neuron  persists  for  less 
than  an  hour.  If  the  stimulation  is  prolonged, 
however,  the  result  is  quite  different.  An  interest- 
ing and  important  phenomenon  occurs. 

In  such  cases,  the  activity  of  the  secondary  ef- 
fector enzyme  persists  even  after  formation  of  the 
second  messenger  stops.  One  might  say  fancifully 
that  the  enzyme  is  taught  to  behave  differently. 
Like  the  animal,  the  molecule  behaves  as  if  it  has 
learned.  This  phenomenon  underlies  the  en- 
hancement of  synaptic  transmission  believed  to 
be  the  basis  of  several  simple  kinds  of  learning 
and  memory. 

We  have  been  interested  in  the  mechanisms  by 
which  secondary  effector  enzymes  become  edu- 
cated. In  previous  years,  we  reported  progress  in 
understanding  the  mechanisms  for  three  of  the 
best  known  second  messenger  pathways.  After  dis- 
covering in  1985  the  phenomenon  for  the  Ca^V 


calmodulin-dependent  protein  kinase  (a  second- 
ary effector  enzyme  for  Ca^"^),  we  described 
similar  phenomena  for  the  cAMP-dependent  pro- 
tein kinase  (PKA)  and  for  protein  kinase  C.  This 
year  we  made  significant  advances  in  understand- 
ing how  PKA  is  made  to  persist  for  many  hours 
after  stimulation  ceases. 

For  all  of  these  protein  kinases  (which  are  dis- 
tantly related  in  evolution) ,  the  second  messen- 
ger activates  the  enzyme  by  releasing  an  inhibi- 
tion built  into  the  latter  molecule.  This  process  is 
reversible.  With  PKA  the  inactive  form  is  a  tetra- 
meric  complex  consisting  of  two  identical  regula- 
tory (R)  subunits  that  fit  precisely  together  with 
two  catalytic  (C)  subunits.  The  R  subunits,  upon 
binding  the  second  messenger,  cAMP,  release  the 
two  C  subunits  for  action. 

In  earlier  experiments  we  demonstrated  that 
the  ratio  of  R  to  C  changes  in  sensory  neurons  of 
the  marine  moUusk  Aplysia  californica  to  cause 
the  enhanced  protein  phosphorylation  that  oc- 
curs in  the  presynaptic  facilitation  of  sensory-to- 
motor  neuron  synapses  in  long-term  sensitization 
of  defensive  reflexes.  The  change  consists  of  a 
decrease  in  the  amount  of  R  subunit,  which  re- 
quires the  synthesis  of  new  protein. 

What  is  the  mechanism  for  producing  the 
change  in  the  R  to  C  ratio  observed  in  sensory 
neurons  of  Aplysia  trained  behaviorally  to  ex- 
hibit the  long-term  sensitization?  To  answer  this 
question,  we  cloned  cDNAs  encoding  both  R  and 
C  subunits  and  therefore  can  determine  the 
amounts  of  their  messenger  RNAs.  In  addition,  we 
can  raise  antibodies  to  measure  the  amounts  of 
subunit  protein  in  sensory  cells  before,  during, 
and  after  training.  We  find  that  the  PKA  changes 
after  long-term  training  because  R  subunits  in  the 
neurons  disappear.  Furthermore,  there  is  no 
change  in  the  amount  of  C  subunits,  nor  in  R  or  C 
subunit  messenger  RNAs. 

Disappearance  of  proteins  is  commonly  caused 
by  enzymes  that  digest  them.  In  the  sequence  of 
amino  acids  of  the  R  subunits  inferred  from  the 
cDNA  cloning  experiments,  there  is  an  amino- 
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terminal  domain  that  is  highly  enriched  in  pro- 
line, glutamine,  serine,  and  threonine  (PEST)  res- 
idues. This  sort  of  domain  has  been  recognized  as 
a  determinant  of  proteins  that  turn  over  rapidly. 
But  digestion  of  proteins  does  not  normally  re- 
quire the  synthesis  of  new  protein.  Why  then 
does  the  degradation  of  R  subunits  depend  on 
new  protein  synthesis? 

Preliminary  evidence  suggests  that  the  proteo- 
lytic mechanism  responsible  for  the  disappear- 
ance of  R  subunits  under  conditions  of  training 
depends  upon  ubiquitination,  a  complex  degra- 
dative  process  involving  enzyme-catalyzed  cou- 
pling of  a  small  protein,  ubiquitin,  to  the  protein 
to  be  destroyed. 

Since  ubiquitin  and  the  enzymes  required  for 
coupling  are  present  in  the  neuron  whether  or 
not  it  has  been  stimulated  for  long-term  facilita- 
tion, how  is  the  degradation  of  R  subunits  en- 


hanced in  stimulated  neurons?  Our  working  hy- 
pothesis is  that  the  new  protein  synthesized  as  a 
consequence  of  long-term  stimulation  is  a  factor 
that  targets  R  subunits  for  coupling  with 
ubiquitin. 

A  similar  hypothesis  has  recently  been  ad- 
vanced for  the  periodic  degradation  of  cyclin,  a 
protein  that  regulates  a  special  protein  kinase 
crucially  involved  in  cell  division.  In  early  frog 
embryos,  cyclin  is  rapidly  proteolyzed  every  30 
minutes  during  mitosis.  We  find  that  R  subunits 
of  PKA  in  early  Aplysia  embryos  also  are  degraded 
through  ubiquitin-mediated  proteolysis.  In  con- 
trast, the  R  subunit  of  mature  muscle  is  quite 
stable.  These  observations  that  R  subunits  are 
readily  down-regulated  in  growing  cells  and  in 
nervous  tissue  provide  suppon  for  the  popular 
idea  that  both  learning  and  development  employ 
similar  molecular  mechanisms. 
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Superimposed  images  of  chromosomes,  microtubules,  and  the  nuclear  envelope.  Optical  sections 
of  embryos  triply  stained  for  these  entities  were  obtained  at  0.2-^im  focus  intervals.  Selected 
optical  sections  are  displayed,  each  separated  by  1  nm  along  the  focus  direction  from  the  interior 
(lower  left)  to  the  surface  ( upper  right)  of  the  embryo.  Chromosomes,  microtubules,  and  nuclear 
envelope  are  displayed  in  blue,  green,  and  red,  respectively. 

Research  offohn  W.  Sedat,  David  A.  Agard,  and  Yashui  Hiraoka. 
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THE  three-dimensional  structure  of  chromo- 
somes, both  in  the  nucleus  and  during  cell 
division,  remains  a  major  unsolved  problem  in 
biology.  Our  laboratory,  in  collaboration  with 
David  Agard  (HHMI,  University  of  California,  San 
Francisco),  has  investigated  chromosome  struc- 
ture from  the  perspective  of  a  series  of  interlock- 
ing questions.  1)  What  are  the  levels  of  chromo- 
some architecture  in  the  intact  diploid  nucleus? 
How  does  the  three-dimensional  structure 
change  as  a  function  of  development,  or  progres- 
sion through  the  cell  cycle?  2)  What  is  the  chro- 
mosome architecture  of  a  given  gene  in  the  nu- 
cleus? Do  the  structural  attributes  reflect  the 
detailed  molecular  information?  3)  How  do  in- 
terphase chromosomes  condense  to  form  the  in- 
tricate mitotic  structure  at  cell  division?  The  fruit 
fly  Drosophila  melanogaster,  well  known  for  its 
genetics,  development,  and  biochemistry,  was 
chosen  as  a  model  biological  system.  Although 
the  initial  emphasis  is  structural,  molecular  ge- 
netics and  biochemistry  provide  functional 
correlations. 

Although  the  UCSF  three-dimensional  optical 
microscope  has  been  developed  to  the  point  that 
data  at  several  wavelengths  can  be  routinely  col- 
lected, even  as  a  function  of  time  (four-dimen- 
sional microscopy),  and  can  be  used  without 
computer  experience,  we  continue  to  perfect 
and  enhance  the  instrumentation.  We  have  im- 
proved the  time  resolution  for  data  collection  in 
the  four-dimensional  image  work,  permitting 
analysis  of  much  information  on  biological  struc- 
tures. We  continue  to  write  software,  with  exten- 
sive mathematical  analysis,  to  correct  systematic 
image  acquisition  problems,  to  display  results  in 
a  variety  of  formats,  and  to  model  and  analyze, 
often  quantitatively,  the  intricate  three-  or  four- 
dimensional  data. 

Four-Dimensional  Optical  Microscopy 

We  have  continued  to  study  the  structure  of  the 
cell  nucleus  in  living  Drosophila  embryos.  Nu- 
clei were  labeled  by  microinjection  of  fluores- 
cent histones  or  other  abundant  chromosomal 
proteins.  Nuclear  and  chromosome  structures 


were  followed  throughout  the  cell  cycle  during 
embryonic  development.  In  addition  to  discern- 
ing structural  changes,  we  could  now  infer 
function. 

This  approach  has  shown  several  discrete  chro- 
mosome sites  attached  to  the  nuclear  envelope. 
These  sites  are  the  last  to  decondense  during  telo- 
phase (the  stage  in  the  nuclear  division  cycle  that 
follows  chromosome  separation).  Similar  sites 
are  observed  at  the  beginning  of  prophase  (the 
stage  of  nuclear  division  involving  chromosome 
condensation) ,  with  condensation  proceeding  bi- 
directionally  away  from  these  centers. 

During  prophase,  a  remarkable  wave  of  chro- 
mosome compaction  started  at  the  centromeres 
(chromosome  regions  at  which  mitotic  spindles 
attach)  and  proceeded  toward  the  opposite  nu- 
cleus pole  that  contained  the  telomeres  (ends  of 
chromosomes) .  We  could  also  determine  that  the 
site  for  the  start  of  the  compaction  wave  coin- 
cided temporally  and  spatially  with  the  initial 
breakdown  of  the  nuclear  envelope  as  well  as  the 
site  of  the  mitotic  spindle  formation,  suggesting 
that  there  is  a  high  degree  of  temporal-spatial 
control  and  organization  in  the  mitotic  process. 

A  major  conclusion  is  that  the  structures  in 
real-time  living  nuclei  were  very  similar,  if  not 
identical,  to  the  three-dimensional  nuclear  struc- 
tures determined  in  our  ongoing  studies  at  higher 
resolution  in  fixed  embryos. 

Three-Dimensional  In  Situ  Hybridization 

We  have  analyzed  the  spatial  arrangement  of 
chromosomes  in  embryos  of  Drosophila.  One 
specific  biological  question  is  whether  homolo- 
gous chromosomes  (one  from  the  male  parent, 
the  other  from  the  female)  have  an  ordered  ar- 
rangement in  a  diploid  nucleus.  Our  previous  re- 
sults showed  that  homologous  chromosomes  are 
not  associated  from  prophase  to  anaphase  in  syn- 
cytial blastoderm  stage  embryos.  This  apparently 
contradicts  genetic  evidence  for  transvection, 
which  suggests  that  the  homologous  chromo- 
somes are  associated.  Thus  we  wished  to  test 
whether  homologous  chromosomes  are  asso- 
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ciated  in  interphase,  when  the  essential  processes 
of  gene  transcription  and  DNA  replication  occur. 

We  used  in  situ  hybridization  techniques  to 
determine  specific  chromosomal  regions  within 
a  nucleus.  The  location  of  the  hybridization  sig- 
nals is  revealed  by  labeling  these  embryos  with  a 
fluorescently  tagged  molecule.  High-resolution 
three-dimensional  optical  sectioning  microscopy 
of  such  embryos  reveals  the  location  of  the  nu- 
clear DNA  sequences. 

We  first  analyzed  the  nuclear  location  of  the 
histone  gene  cluster  that  lies  close  to  the  centro- 
meric  heterochromatin  on  chromosome  2L.  Our 
results,  using  a  histone  gene  probe,  revealed  that 
homologous  chromosomes  are  also  separated  in 
the  majority  of  interphase  nuclei  at  syncytial  blas- 
toderm-stage embryos  up  to  the  1 3th  nuclear  cy- 
cle. In  dramatic  contrast,  at  the  14th  nuclear  cy- 
cle when  cellularization  begins,  homologous 
chromosomes  are  associated  in  the  majority  of 
nuclei.  The  frequency  of  homologue  pairing 
reaches  the  maximum  (about  95  percent)  by  the 
time  of  gastrulation. 

Analysis  of  the  three-dimensional  location  of 
histone  hybridization  signals  showed  that  histone 
genes  are  located  on  the  nuclear  midline  at  the 
1 3th  nuclear  cycle  and  move  toward  the  centro- 
meric  cluster  on  the  apical  side  of  nuclei  at  the 
l4th  nuclear  cycle.  This  can  perhaps  be  ex- 
plained by  the  formation,  at  the  l4th  nuclear  cy- 
cle, of  heterochromatin,  which  process  may  be 
involved  in  initiating  the  pairing  of  homologous 
chromosomes.  Translocations  further  define  ho- 
mologue pairing,  with  strong  effects  on  homolo- 
gous chromosome  association. 

We  are  currently  using  a  wide  range  of  DNA 
probes  for  specific  chromosomal  sequences  to 
study  processes  of  homologue  association.  Our 
preliminary  results  suggest  that  frequency  and 
timing  of  association  vary  from  one  chromosomal 
locus  to  another.  For  example,  the  gene  en- 
grailed exhibits  homologue  association  at  the 
1 4th  nuclear  cycle  with  high  frequency,  but  the 
gene  Ubx  does  not.  A  generalized  picture  will  be 
obtained  once  a  larger  number  of  genetic  loci  are 
analyzed. 


A  Molecular  Dissection  of  the  Nuclear 
Periphery 

Recently  we  showed  that  the  lamin  proteins  of 
the  nuclear  envelope  (NE)  form  a  highly  discon- 
tinuous network  in  somatic  interphase  nuclei. 
Several  obvious  questions  arise.  First,  where  are 
the  other  known  components  of  the  nuclear  pe- 
riphery (pore  complexes,  chromatin)  relative  to 
this  network?  Second,  what,  if  anything,  occupies 
these  large,  lamin-empty  regions?  Third,  how  are 
these  structures  assembled  as  the  NE  reforms  dur- 
ing telophase? 

We  are  localizing  the  other  two  well-known 
components  of  the  nuclear  periphery  relative  to 
the  lamin  network,  using  monoclonal  antibodies 
directed  against  a  major  glycoprotein  component 
of  nuclear  pore  complexes  (GP-190)  and  the 
DNA-specific  stain  DAPl.  Chromatin  in  the  nu- 
clear periphery  displays  an  interesting  structural 
paradox  in  that  a  large  fraction  appears  to  be 
aligned  beneath  the  lamin  network,  but  with  very 
little  directly  contacting  lamins.  The  majority  of 
it  seems  to  be  at  a  distance  of  about  0.2  ^m.  This 
result  is  consistent  with  much  indirect  evidence 
of  a  strong  interaction  between  chromatin  and 
the  nuclear  lamina,  but  suggests  strongly  that  a 
direct  physical  contact  is  not  involved. 

We  are  injecting  lamins  and  lamin-specific 
monoclonal  antibody  Fab  fragments,  both  fluores- 
cently labeled,  into  early  Drosophila  embryos  to 
study  four-dimensional  lamin-NE  dynamics.  In 
these  experiments,  the  embryos  develop  nor- 
mally and  hatch  on  time.  We  observe  a  highly 
discontinuous  lamin  network  in  vivo,  with  inter- 
lamin  fiber  spacings  at  least  as  large  as  those  ob- 
served in  fixed  samples.  If,  however,  we  inject 
fluorescently  labeled  interphase  lamins,  a  very 
different  picture  results.  Arrested  nuclear  struc- 
tures leading  to  chromosomal/nuclear  aggregates 
are  seen.  These  studies  suggest  that  functional  as- 
says will  be  required  for  proper  interpretation  of 
the  biochemistry. 

This  general  methodology  has  a  number  of  po- 
tential applications  to  problems  of  cell  lineage, 
neural  architecture,  and  pattern  formation  in  de- 
velopment. We  are  pursuing  some  of  these  inter- 
ests in  collaboration  with  other  laboratories. 
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FAMILIAL  hypertrophic  cardiomyopathy 
(FHC)  is  a  heart  muscle  disorder  with  an  au- 
tosomal dominant  pattern  of  inheritance.  The 
disease  is  characterized  clinically  by  myocardial 
hypertrophy,  a  wide  spectrum  of  symptomatic  in- 
volvement, and  a  2-4  percent  annual  mortality 
rate  from  sudden  death,  which  can  occur  without 
warning.  Postmortem  examination  reveals  in- 
creased myocardial  mass  with  myocyte  and  myo- 
fibrillar disarray.  Diagnosis  is  based  on  typical 
clinical  features  and  the  two-dimensional  echo- 
cardiographic  demonstration  of  unexplained  left 
and/or  right  ventricular  hypertrophy.  In  the 
young,  diagnosis  is  often  complicated,  because 
hypertrophy  may  not  develop  until  after  adoles- 
cent growth  has  been  completed.  The  anatomical 
distribution  of  myocardial  hypertrophy  and  sever- 
ity of  symptoms  may  be  quite  variable,  even 
within  a  family. 

During  the  past  30  years,  the  cardiac  features  of 
FHC  have  been  extensively  reported,  but  the  eti- 
ology and  molecular  pathophysiology  have  re- 
mained speculative.  Last  year  we  reported  the  lo- 
calization of  a  gene  on  chromosome  1 4  that  can 
be  mutated  to  cause  FHC.  During  the  past  year  we 
have  demonstrated  that  this  is  the  cardiac  myosin 
heavy-chain  gene(s). 

Chromosomal  localization  of  the  disease  locus 
suggested  candidate  genes  that  might  be  responsi- 
ble for  FHC.  In  particular,  cardiac  myosin  heavy- 
chain  genes  (a  and  |8)  are  located  on  chromo- 
some 14  band  qll-13.  During  the  past  year  we 
defined  the  structure  of  these  genes  in  affected 


individuals  from  two  unrelated  families  (A  and 
B).  Affected  members  of  family  A  have  a  point 
mutation  in  exon  1 3  of  their  gene  that  changes 
an  arginine  residue  to  a  glutamine  residue.  Be- 
cause this  arginine  residue  is  highly  conserved 
during  evolution  and  because  no  normal  individ- 
uals have  this  mutation,  we  suggested  that  this 
point  mutation  is  responsible  for  the  disease  in 
affected  members  of  this  family. 

Affected  members  of  family  B  have  an  unusual 
cardiac  myosin  heavy-chain  gene  in  their  ge- 
nomes. These  individuals  have  a  third  cardiac 
myosin  heavy-chain  gene  on  one  chromosome. 
This  third  gene  is  a  hybrid  derived  from  exons 
1-27  of  the  a  gene,  and  the  remainder  of  the 
gene  is  derived  from  the  /3  gene.  This  hybrid  gene 
is  not  found  in  the  genomes  of  unaffected  individ- 
uals. From  these  studies  we  conclude  that  muta- 
tion of  these  genes  can  cause  FHC. 

We  have  also  demonstrated  that  FHC  is  a  geneti- 
cally heterogeneous  disorder,  by  studying  the  in- 
heritance of  FHC  in  two  other  unrelated  families 
(C  and  D).  In  these  families  the  disease  is  not 
genetically  linked  to  the  cardiac  myosin  heavy- 
chain  genes  on  chromosome  1 4 .  We  propose  that 
their  disease  is  due  to  defects  in  other  genes  that 
encode  products  that  either  affect  the  expression 
of  cardiac  myosin  heavy  chains  or  that  interact 
with  cardiac  myosin  heavy-chain  genes  to  pro- 
duce functional  cardiac  muscle.  We  hope  that 
these  studies  will  lead  to  a  better  understanding 
of  how  cardiac  muscle  functions  and  how  defects 
in  cardiac  muscle  proteins  lead  to  the  clinical 
symptoms  found  in  FHC  patients. 
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ALTHOUGH  there  has  been  an  explosion  of 
discoveries  over  the  last  several  decades 
concerning  the  structure  of  the  brain  at  the  cellu- 
lar and  molecular  levels,  v^^e  do  not  yet  under- 
stand how  the  nervous  system  enables  us  to  see 
and  hear,  to  learn  skills  and  remember  events,  to 
plan  actions  and  make  decisions.  The  long-range 
goal  of  this  laboratory  is  to  explain  how  neural 
systems  effect  these  complex  behaviors.  Our  gen- 
eral approach  is  to  use  what  we  know  about  the 
structure  and  function  of  identified  neurons  and 
neuronal  networks  to  construct  computational 
models  at  several  levels  of  investigation. 

At  the  biophysical  level,  the  computational 
mechanisms  are  based  on  chemical  and  electrical 
signals  within  and  between  neurons.  Among  the 
more  important  of  these  are  the  signals  at  syn- 
apses, which  carry  information  between  neurons. 
On  cortical  pyramidal  neurons,  most  of  the  syn- 
apses occur  on  thorn-like  structures  protruding 
from  the  dendritic  shaft,  called  dendritic  spines. 
The  function  of  these  spines  is  unknown,  but  is 
likely  to  be  important  because  they  are  found  on 
neurons  in  most  species. 

We  have  recently  completed  a  study  of  the  ef- 
fectiveness of  inhibitory  synapses  on  dendritic 
spines.  About  10  percent  of  the  spines  on  cortical 
pyramidal  neurons  have  both  excitatory  and  in- 
hibitory inputs.  It  had  been  thought  that  this  ar- 
rangement allowed  the  inhibitory  synapse  to 
"veto"  the  excitatory  one  when  they  were  simul- 
taneously activated.  In  our  computer  simulations 
of  these  events,  however,  we  have  found  that  the 
inhibitory  synapses  were  only  effective  at  reduc- 
ing the  magnitude  of  the  excitatory  postsyn- 
aptic potential  (EPSP)  under  very  restricted 
circumstances. 

The  reason  is  that  the  volume  of  a  spine  is  so 
small  that  even  tiny  conductance  changes  in  the 
membrane  allow  enough  ions  to  enter  and  leave 
the  spine  to  change  the  intracellular  concentra- 
tions significantly.  This  would  tend  to  reduce  the 
forces  driving  these  ions  across  the  membrane 
and  thus  reduce  the  dependent  synaptic  currents. 
For  our  simulations,  we  infer  that,  to  be  effec- 


tive, the  inhibitory  synapses  found  on  cortical 
spines  must  be  mediated  by  through  GABAg 
receptors. 

At  the  neuronal  level,  we  are  exploring  with 
computational  models  the  effects  of  mechanisms 
that  couple  the  synaptic  signals  in  the  dendrites 
to  the  spike-triggering  region  of  the  neuron, 
which  is  located  near  the  soma.  According  to  the 
traditional  view  of  dendritic  processing,  current 
injected  into  dendrites  from  synaptic  activity  is 
passively  conveyed  to  the  cell  body  by  the  den- 
drites' cable  properties.  In  recent  years  voltage- 
dependent  calcium  currents  have  been  identified 
in  dendrites  that  could  boost  the  coupling  be- 
tween synaptic  activity  and  spike  generation. 
This  is  particularly  interesting  because  an  in- 
crease in  the  excitability  of  the  neuron  often  ac- 
companies the  long-term  potentiation  (LTP)  of 
excitatory  synapses,  a  rapid  and  persistent  eleva- 
tion of  the  EPSP. 

Could  the  increase  in  excitability  seen  during 
LTP  result  from  a  change  in  voltage-dependent 
calcium  currents?  To  test  this  hypothesis,  we 
used  simulations  of  intradendritic  LTP  experi- 
ments. The  shape  of  our  simulated  neuron  was 
based  on  the  detailed  branching  pattern  of  a  hip- 
pocampal  neuron  that  had  been  injected  with  a 
dye  and  reconstructed  in  a  computer.  We  found 
that  a  small  addition  of  voltage-sensitive  calcium 
channels  to  the  dendritic  membrane  made  a  previ- 
ously subthreshold  input  suprathreshold,  with  no 
significant  change  in  the  EPSP. 

Our  most  recent  work  concerns  the  effect  of 
these  changes  in  dendritic  excitability  on  LTP 
specificity.  We  simulated  two  sets  of  synaptic  in- 
puts, one  stimulated  and  the  other  control.  Po- 
tentiation at  the  stimulated  input  had  little  effect 
on  the  control  input  if  the  latter  was  electrically 
closer  to  the  cell  body,  or  if  the  two  inputs 
were  segregated  on  different  primary  dendritic 
branches.  Specificity  is  degraded  if  the  control 
input  is  farther  away  than  the  stimulated.  These 
results  show  that  modulation  of  dendritic  excit- 
ability is  a  plausible  mechanism  for  EPSP-spike 
potentiation. 

We  are  testing  this  computational  model  of 
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EPSP-spike  potentiation  in  two  ways.  First,  we 
have  looked  for  experimental  conditions  that 
would  favor  EPSP-spike  potentiation  over  synap- 
tic potentiation.  Second,  we  are  undertaking  di- 
rect measurements  of  intracellular  free  calcium 
in  hippocampal  neurons.  The  properties  of  the 
hippocampal  network  and  its  capacity  for  infor- 
mation storage  depend  on  the  properties  of  the 
synapses  and  dendrites.  It  is  therefore  important 
to  determine  these  properties  before  models  of 
the  hippocampus  are  attempted. 

LTP  can  be  induced  at  some  synapses  by  simul- 
taneous postsynaptic  depolarization  and  binding 
of  glutamate  to  the  NMDA  (7V-methyl-D-aspartate) 
subtype  of  glutamate  receptor.  This  is  usually  ac- 
complished by  stimulating  the  synapses  at  a  high 
frequency.  Is  it  possible  to  induce  LTP  if  the  den- 
drite is  instead  depolarized  by  invasion  from  the 
cell  body  during  an  action  potential?  We  have 
tested  this  hypothesis  in  the  hippocampus  by 
pairing  antidromically  elicited  action  potentials 
with  low- frequency  synaptic  stimulation.  By  it- 
self, neither  the  antidromic  spikes  nor  the  synap- 
tic stimulation  was  sufficient  to  cause  LTP.  When 
these  two  stimuli  were  paired,  however,  there 
was  potentiation  of  the  population  spike,  al- 
though the  EPSP  did  not  seem  to  be  increased 
consistently.  This  may  be  due  to  a  specific  poten- 
tiation of  the  EPSP-spike  potentiation. 

The  increase  in  the  population  spike  upon  re- 
peated pairing  of  synaptic  and  antidromic  stimu- 
lation is  reversibly  blocked  by  the  application  of 
AP5,  which  is  an  antagonist  of  the  NMDA  recep- 
tor. This  suggests  that  the  depolarization  caused 
by  the  action  potential  does  invade  the  dendrites 
and  is  acting  through  the  same  mechanisms  that 
occur  when  the  cell  body  is  depolarized  by 
current  through  a  recording  electrode.  It  is  inter- 
esting that  Donald  Hebb  pointed  out  in  1949  that 
the  firing  of  postsynaptic  action  potentials  simul- 
taneously with  presynaptic  activity  might  be  an 
essential  condition  for  synaptic  plasticity.  This  is 
the  first  suggestion  that  action  potentials  might  in 
fact  have  a  role  to  play  in  inducing  LTP.  Much 
more  work  needs  to  be  done  to  confirm  this 
interpretation. 


If  calcium  currents  in  the  dendrites  of  hippo- 
campal pyramidal  neurons  are  indeed  responsi- 
ble for  altering  the  coupling  between  synapses 
and  the  cell  body,  then  it  should  be  possible  to 
verify  this  directly  by  measuring  the  intracellular 
calcium  ion  concentration.  Calcium  ions  are 
known  to  be  central  to  mechanisms  of  plasticity 
in  the  hippocampus  and  other  regions  of  the 
brain,  but  little  is  known  of  the  time  course  or 
spatial  distribution  of  these  changes.  Calcium- 
sensitive  fluorescent  dyes  are  first  injected  into 
single  pyramidal  neurons  in  hippocampal  slices. 
The  changes  in  Ca^^  concentration  in  both  the 
dendrites  and  soma  of  the  neuron  are  then  moni- 
tored with  a  confocal  fluorescence  microscope, 
which  allows  these  changes  to  be  detected  with 
high  spatial  and  temporal  resolution.  The  Ca^"^ 
elevations  seen  in  the  dendrites  were  faster  than 
those  seen  in  the  soma,  but  of  shorter  duration. 
This  would  be  expected,  as  their  volume  is  much 
smaller.  We  are  continuing  to  develop  these  tech- 
niques and  increase  the  sensitivity  of  our  record- 
ings so  that  we  may  be  able  to  correlate  physiolog- 
ical changes  with  changes  in  the  spatial 
distribution  of  intracellular  calcium. 

The  projections  between  areas  of  the  hippo- 
campus have  a  complex  three-dimensional  geom- 
etry that  is  difficult  to  delineate.  Anatomists  have 
studied  them  by  injecting  tracer  into  one  area  and 
serially  reconstructing  the  terminal  arborization 
of  filled  neurons.  In  collaboration  with  David 
Amaral  of  the  Salk  Institute,  we  have  examined 
such  a  data  set  from  injections  in  area  CA3  project- 
ing into  CAl  of  the  rat  hippocampus.  We  then 
used  a  mathematical  model  of  these  projections 
to  reconstruct  a  three-dimensional  map  of  the 
pathway. 

By  this  technique  we  have  uncovered  an  aspect 
of  hippocampal  structure  previously  unseen:  a  se- 
ries of  stripe-like  modulations  along  the  longitu- 
dinal axis.  We  are  now  attempting  to  demonstrate 
these  modulations  in  direct  experiments.  We  in- 
tend to  create  a  realistic  model  of  neuronal  net- 
works in  the  hippocampus  from  our  recon- 
structed pathways. 
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SEX  determination  is  a  developmental  process 
of  substantial  fundamental  and  practical  inter- 
est. In  a  number  of  vertebrates,  sex  is  determined 
by  environmental  factors.  For  example,  some 
reptiles  and  amphibians  have  a  temperature- 
dependent  sex-determining  mechanism.  In 
various  fishes,  sex  is  determined  through  social 
cues  and  interactions.  In  a  number  of  nonverte- 
brate  species,  the  somatic  sexual  phenotype  and 
the  actual  gonadal  sex  can  be  dissociated.  In 
mammals,  however,  the  presence  or  absence  of  a 

Y  chromosome  is  the  absolute  determinant  of  sex- 
ual differentiation.  The  presence  of  one  or  more 
Y-encoded  genes  is  necessary  and  sufficient  to  ini- 
tiate differentiation  of  the  embryonic  gonad  into 
a  testis.  FoIIow^ing  this  commitment,  a  variety  of 
secondary  events  result  in  the  elaboration  of  hor- 
mones that  produce  the  full  male  phenotype. 

It  is  generally  believed  that  mammalian  X  and 

Y  chromosomes  have  evolved  from  a  common  an- 
cestral homologous  chromosome  pair.  However, 
a  gene  or  genes  must  exist  on  the  Y  chromosome 
that  does  not  have  a  functional  counterpart  on  the 
X  to  effect  testicular  differentiation.  To  ensure 
this  state  of  affairs,  there  must  have  been  a  sup- 
pression of  genetic  recombination  between  the 
ancestral  X  and  Y  chromosomes  during  evolu- 
tion. Such  suppression  likely  occurred  through 
the  introduction  of  gross  rearrangements  in  ei- 
ther the  X  or  Y  chromosome  to  make  possible  the 
subsequent  divergence  of  DNA  sequences. 

In  modern-day  sex  chromosomes,  however, 
several  areas  of  persistent  X-Y  homology  can  be 
identified.  The  first  of  these  is  a  region  approxi- 
mately 3  megabases  long  in  which  absolute  se- 
quence identity  between  the  X  and  Y  chromo- 
some distal  short  arms  has  been  demonstrated. 
This  region  facilitates  pairing  and  appropriate 
segregation  of  the  X  and  Y  chromosomes  during 
male  meiosis.  The  frequent  meiotic  recombina- 
tion occurring  between  the  X  and  Y  within  this 
segment  ensures  the  homogenization  of  DNA  se- 
quences in  this  region. 

We  have  documented  the  role  of  this  segment 
of  the  X  and  the  Y  chromosomes  in  sex  chromo- 
some pairing  through  studies  of  a  patient  who 


had  a  small  deletion  of  the  needed  pairing  se- 
quences from  his  X  chromosome.  As  a  result  of 
this  abnormality,  his  sex  chromosomes  were  un- 
able to  align  with  each  other  during  meiosis.  It  is 
possible  that  more  subtle  abnormalities  of  this 
pairing  region  result  in  some  cases  of  male  steril- 
ity or  predispose  to  improper  sex  chromosome 
segregation  and  consequent  X  or  Y  aneuploidy. 

We  and  others  have  identified  other  regions  of 
X-Y  chromosome  similarity  wherein  sequences 
are  not  absolutely  identical.  Through  the  map- 
ping and  sequencing  of  these  regions  and  compar- 
ative studies  in  a  number  of  primate  species,  we 
have  developed  a  model  for  the  recent  evolution 
and  divergence  of  the  X  and  Y  chromosomes.  This 
model  suggests  that  at  least  one  pericentric  inver- 
sion of  the  Y  chromosome  has  occurred,  probably 
within  the  past  40  to  60  million  years  of  primate 
evolution.  As  a  consequence,  some  DNA  se- 
quences are  represented  on  the  long  arm  of  the  Y 
chromosome  that  have  homologues  on  the  short 
arm  of  the  X. 

Many  of  the  sequences  on  the  Y  chromosome 
appear  to  have  undergone  degeneration  and  to 
have  accumulated  mutations  that  render  them 
molecular  fossils  with  no  apparent  function. 
However,  we  have  recently  studied  the  structure 
and  expression  of  the  human  amelogenin  genes, 
which  are  located  in  the  same  general  area.  The 
amelogenin  X  and  Y  genes  are  approximately  90 
percent  similar  in  their  DNA  sequences,  but  are 
predicted  to  encode  proteins  of  varying  amino 
acid  composition.  Through  studies  of  RNA  pro- 
duction in  developing  tooth  buds,  we  have 
shown  that  both  the  X  and  Y  genes  are  functional 
and  can  be  expressed  in  males,  leading  to  the  pro- 
duction of  two  different  amelogenin  proteins. 
Thus  not  all  of  the  sequences  involved  in  the  Y 
chromosome  pericentric  inversion  have  under- 
gone degeneration. 

Another  practical  aspect  of  the  X-Y  homology 
is  that  it  can  occasionally  be  the  site  of  aberrant 
recombinational  events  producing  X/Y  translo- 
cations. We  have  studied  a  number  of  these  pa- 
tients in  some  detail  and  have  shown  that  this 
cytogenetic  abnormality  is  in  fact  the  result  of 
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unequal  exchange  within  this  region  of  homol- 
ogy between  the  X  and  Y  chromosomes. 

Another  area  of  substantial  interest  in  the  labo- 
ratory relates  to  deletions  in  X  chromosome- 
encoded  genes.  Duchenne  muscular  dystrophy  is 
a  human  genetic  disorder  resulting  from  abnor- 
rnalities  in  an  X-encoded  gene  called  dystrophin. 
The  majority  of  patients  with  Duchenne  dys- 
trophy have  deletions  of  part  or  all  of  the  dystro- 
phin gene. 

Patients  with  X-linked  ichthyosis,  a  heritable 
skin  disorder,  have  deficient  activity  of  the  en- 
zyme steroid  sulfatase  (STS).  This  gene  is  en- 
coded in  150  kilobases  of  X  chromosomal  DNA, 
and  in  90  percent  of  patients  with  this  relatively 
frequent  cutaneous  disorder,  the  entire  gene  is 
deleted.  Our  studies  have  established  that  such 
deletions  result  from  recurrent  de  novo  events  on 
the  X  chromosome.  These  involve  the  unequal 
exchange  of  DNA  catalyzed  by  a  low-copy- 
number  repetitive  element  that  flanks  the  STS 
gene. 

Thus  recombination  between  similar  but  non- 
identical  elements  appears  to  be  responsible  for 
the  genesis  of  these  common  deletional  events. 
We  are  currently  trying  to  determine  whether  this 
process  occurs  with  a  high  degree  of  frequency 
elsewhere  within  the  genome. 

The  phenomenon  of  X  chromosome  inactiva- 
tion  is  another  area  of  interest  in  our  laboratory. 
Since  all  normal  female  mammals  have  two  X 
chromosomes  and  males  have  only  a  single  X, 
there  is  a  fundamental  dosage  inequity.  This  is 
normalized  for  most  genes  by  functional  inactiva- 


tion  of  one  of  the  X's  in  female  somatic  cells  very 
early  in  embryogenesis.  Previously,  we  have 
shown  that  X  chromosome  inactivation  is  inte- 
grally related  to  DNA  methylation.  Currently  we 
are  investigating  a  model  of  X  chromosome  inac- 
tivation in  vitro  involving  the  use  of  differentiat- 
ing teratocarcinoma  cells.  We  hope  to  identify 
proteins  and  other  gene  products  that  are  impor- 
tant in  counting  the  number  of  X  chromosomes 
present  and  in  initiating  the  X  inactivation 
process. 

There  is  also  a  small  group  of  genes  that  appear 
to  escape  the  process  of  X  inactivation  even  when 
they  are  situated  on  otherwise  inactive  X  chro- 
mosomes. Our  previous  studies  have  shown  that 
these  genes  escape  inactivation  in  a  position-in- 
dependent fashion.  We  are  currently  studying  the 
regulatory  signals  involved  in  X  inactivation  and 
those  unique  features  that  render  some  genes  re- 
sistant to  this  phenomenon.  We  have  cloned  and 
characterized  the  promoter  of  the  STS  gene  as  an 
example  of  this  class  of  genes  and  have  found 
what  appears  to  be  a  novel  promoter  structure. 

We  are  trying  to  identify  other  genes  that 
escape  X  inactivation  so  that  we  can  compare  reg- 
ulatory regions  between  them  and  identify  DNA 
sequences  that  are  critical  for  initiating  and  prop- 
agating X  inactivation  signals  along  the  chromo- 
some. The  studies  described  should  provide  a 
better  understanding  of  the  evolution  of  the  sex 
chromosomes  and  insight  into  the  pathogenesis 
of  the  many  human  disorders  that  are  due  to  mu- 
tations or  structural  and  numerical  abnormalities 
of  these  chromosomes. 
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ADENOVIRUSES  are  widespread  in  nature. 
Humans  are  first  infected  when  very  young. 
Generally  the  infection  results  in  cold-like  symp- 
toms, and  the  episode  resolves  without  compli- 
cation. Injected  into  a  rat  or  hamster,  some  hu- 
man adenoviruses  induce  a  variety  of  benign  or 
malignant  tumors.  Adenoviruses  are  grouped  in 
the  class  of  DNA  tumor  viruses,  since  they  are 
tumorigenic  under  certain  conditions  and  con- 
tain DNA  chromosomes. 

Human  adenoviruses  can  be  propagated  easily 
in  cultured  human  cells.  When  human  cells  are 
infected,  the  approximately  30  viral  genes  are 
expressed,  the  viral  chromosome  is  replicated, 
and  individual  DNA  molecules  are  packaged  into 
virus-coded  protein  shells  to  generate  virus  parti- 
cles. Since  viral  gene  expression  is  tightly  regu- 
lated and  occurs  at  high  levels  compared  with 
that  of  most  cellular  genes,  adenoviruses  are  a 
useful  probe  for  the  study  of  gene  expression 
control. 

During  the  past  year,  much  of  our  effort  has 
focused  on  transcriptional  control  of  viral  gene 
expression.  The  first  viral  gene  to  be  expressed 
after  infection  of  a  cell  is  the  El  A  gene,  which 
encodes  a  protein  that  activates  expression  of  ad- 
ditional viral  genes  at  the  level  of  transcription. 
Expression  of  these  viral  genes  can  be  induced  by 
treatment  of  cells  with  cAMP. 

Most  viral  genes  activated  by  the  ElA  protein 
contain  a  small  sequence  termed  a  cAMP  re- 
sponse element  (CRE)  within  the  DNA  sequences 
of  the  transcriptional  control  region.  Cyclic  AMP 
is  a  key  player  in  a  regulatory  cascade  that  can 
induce  transcription  of  a  variety  of  cellular  genes. 
Increased  cAMP  levels  lead  to  activation  of  the 
CRE-binding  (CREB)  protein,  which  in  turn 
binds  to  the  CRE  sequence. 

The  ElA  gene  product  and  cAMP  act  in  synergy 
to  induce  transcription  of  target  genes.  We  have 
not  detected  a  change  in  the  level  of  CREB  activ- 
ity under  induced  conditions.  However,  a  second 
transcription  factor,  AP  I,  is  strongly  induced. 
AP  I  binds  to  a  DNA  sequence  termed  a  TPA 


response  element  (TRE).  The  TRE  sequence 
differs  from  the  CRE  by  only  1  base  pair,  and  AP- 1 
binds  almost  as  well  to  CRE  as  to  TRE  sequences. 
Thus  our  current  hypothesis  is  that  the  ElA  gene 
product  plus  cAMP  acts  to  raise  the  activity  level 
of  AP  I,  which  binds  to  both  CRE  and  TRE  se- 
quences and  thus  contributes  to  the  activation  of 
viral  genes. 

The  induction  of  AP-1  activity  occurs  at  two 
levels.  The  first  is  a  rapid  modification  of  the  fac- 
tor, causing  it  to  display  altered  physical  proper- 
ties. The  second  level  requires  active  transcrip- 
tion of  the  products  of  two  proto-oncogenes, 
c-fos  and  junB.  Together  these  products  consti- 
tute one  form  of  AP- 1 .  It  is  likely  that  the  modi- 
fied form  of  AP- 1  not  only  induces  expression  of 
adenovirus  genes  but  also  of  its  own  constituents, 
activating  an  autoregulatory  loop.  We  are  now 
working  to  identify  the  constituents  of  the  modi- 
fied AP-1  activity  and  to  delineate  the  role  that 
the  ElA  product  plays  in  the  modification 
process. 

The  induction  of  AP-1  activity  by  ElA  plus 
cAMP  is  transient.  A  second  protein,  encoded  by 
the  adenovirus  E4  gene,  causes  AP-1  activity  to 
return  to  basal  levels  several  hours  after  treat- 
ment of  infected  cells  with  cAMP.  Thus  the  in- 
duction of  AP  I  is  tightly  regulated  in  the  in- 
fected cell.  The  E4  protein  also  causes  a 
reduction  in  the  level  of  phosphorylation  of  the 
c-fos  component  of  AP- 1 .  We  are  presently  work- 
ing to  determine  whether  this  change  in  phos- 
phorylation is  functionally  related  to  the  reduc- 
tion in  AP-1  activity  and  to  elucidate  the 
mechanism  by  which  the  reduction  occurs. 

The  ElA  protein,  in  addition  to  inducing  a  posi- 
tively acting  transcription  factor,  AP- 1 ,  can  also 
inhibit  the  activity  of  a  negatively  acting  factor, 
which  we  have  termed  YY- 1 .  We  first  identified 
the  binding  site  for  YY-1  in  the  P5  transcriptional 
control  region  of  adeno-associated  virus,  a  defec- 
tive virus  that  depends  on  a  variety  of  adenovirus 
gene  products  for  its  replication.  The  adenovirus 
ElA  protein  activates  expression  of  the  P5  con- 


403 


Adenovirus  as  a  Model  for  Oncogenesis  and  Control  of  Gene  Expression 


trol  region.  In  the  absence  of  ElA  protein,  YY-1 
represses  P5  transcription.  In  its  presence  the  re- 
pression is  relieved,  and  the  control  region  be- 
comes transcriptionally  active. 

To  study  the  YY-1  factor,  we  prepared  some 
from  cultured  human  cells  and  obtained  a  short 
amitib  acid  sequence  from  the  purified  protein. 
This  sequence  was  used  to  design  a  short  probe 
DNA,  which  enabled  us  to  identify  and  isolate  a 
cDNA  clone  encoding  the  protein.  Protein  was 
expressed  from  the  clone  and  shown  to  bind  spe- 
cifically to  the  YY-1  recognition  site.  Sequence 
analysis  of  the  clone  revealed  that  YY-1  is  a  68- 
kDa  protein  with  a  zinc  finger  DNA-binding 
motif. 

We  are  presently  studying  the  mechanisms  by 
which  YY- 1  represses  transcription  and  by  which 
the  adenovirus  ElA  protein  relieves  the  repres- 
sion. We  also  wish  to  determine  whether  ElA 
protein  mediates  its  effects  on  the  positive-acting 
AP- 1  and  the  negative-acting  YY- 1  factors  through 
the  same  or  different  mechanisms.  That  is,  does 
the  ElA  protein  mediate  just  one  biochemical  re- 
action that  affects  AP-1  and  YY-1  differently,  or 
does  it  carry  out  multiple,  physiologically  dis- 
tinct biochemical  processes? 

In  addition  to  the  ElA  protein,  which  functions 
to  induce  transcription,  we  have  studied  the  ade- 
novirus 55-kDa  protein  induced  by  ElB.  This 
protein  also  controls  the  accumulation  of 
mRNAs,  but  it  acts  after  transcription.  It  simulta- 
neously blocks  the  accumulation  of  cellular 
mRNAs  and  enhances  accumulation  of  viral 


mRNAs.  Further  studies  indicated  that  the  pro- 
tein's discrimination  of  viral  from  cellular 
mRNAs  is  based  on  the  site  of  synthesis.  If  an  RNA 
is  transcribed  from  the  viral  chromosome,  its  ac- 
cumulation is  enhanced;  if  it  is  transcribed  from  a 
cellular  chromosome,  its  accumulation  is 
blocked. 

We  determined  the  localization  of  the  ElB  55- 
kDa  protein  within  the  infected  cell,  using  a  tech- 
nique termed  immunogold  electron  microscopy. 
The  protein  was  localized  to  the  periphery  of 
spherical  bodies  previously  shown  to  be  sites  of 
viral  DNA  replication  and  transcription. 

Localization  of  the  ElB  protein  at  the  replica- 
tion-transcription centers  is  consistent  with  its 
ability  to  enhance  accumulation  of  viral  mRNAs. 
Its  location  makes  it  available  to  these  mRNAs 
shortly  after  their  synthesis  and  processing.  But 
how  does  this  localization  prevent  cytoplasmic 
accumulation  of  host  cell  mRNAs?  It  seems  likely 
that  the  protein  interacts  with  and  draws  to 
the  periphery  of  viral  replication-transcription 
centers  a  cellular  factor  that  functions  to  move 
mRNAs  from  their  site  of  synthesis  and  processing 
to  the  nuclear  pore  for  transport  out  of  the  nu- 
cleus. Relocation  of  a  cellular  factor  required  for 
intranuclear  movement  of  RNAs  could  explain 
how  the  ElB  protein  can  inhibit  accumulation  of 
RNAs  transcribed  from  cellular  locations  and  si- 
multaneously stimulate  accumulation  of  tran- 
scripts derived  from  the  viral  chromosome.  We 
are  presently  searching  for  cellular  factors  that 
interact  with  this  protein. 
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EACH  day  humans  produce  billions  of  blood 
cells,  which  enter  the  circulation  from  their 
sites  of  origin  in  the  bone  marrow.  The  majority 
are  red  cells  (erythrocytes) ,  which  transport  oxy- 
gen, and  the  remainder  are  white  cells  (leuko- 
cytes), which  play  a  vital  role  in  preventing  in- 
fection by  bacteria,  viruses,  and  other  parasites. 
White  cells,  unlike  the  continuously  circulating 
red  cells,  migrate  from  small  blood  vessels  into 
other  organs,  where  they  participate  in  the  im- 
mune and  inflammatory  reactions  that  character- 
ize host  defense.  Different  classes  of  white  cells 
carry  out  specialized  functions:  macrophages  and 
granulocytes  ingest  and  kill  microorganisms, 
whereas  lymphocytes  recognize  foreign  antigens 
and  produce  antibodies  to  combat  them. 

To  maintain  constant  numbers  of  cells  in  the 
circulation,  the  process  of  blood  cell  develop- 
ment (hematopoiesis)  must  be  subject  to  exqui- 
sitely sensitive  regulatory  controls.  Diseases  af- 
fecting blood  cell  production  include  those  in 
which  insufficient  numbers  of  red  and  white 
cells  enter  the  circulation  (anemias  and  leuko- 
penias); those  in  which  the  numbers  of  normal 
white  cells  are  increased  (leukocytosis,  seen  in 
systemic  infections);  and  those  in  which  cells  are 
abnormal  (dysplasias)  or  cancerous  (leukemias) . 

Hematopoiesis  is  regulated  by  a  group  of  pro- 
tein growth  factors,  collectively  termed  colony- 
stimulating  factors  (CSFs)  or  interleukins.  These 
polypeptides  are  produced  by  resident  stromal 
cells  in  the  bone  marrow  as  well  as  by  circulating 
blood  cells  (the  term  interleukin  literally  mean- 
ing between  white  blood  cells) .  First  recognized 
through  their  ability  to  stimulate  immature  bone 
marrow-derived  (myeloid)  precursor  cells  to 
form  colonies  composed  of  differentiated  blood 
cell  elements,  CSFs  were  named  for  the  types  of 
colonies  they  produced.  For  example,  M-CSF  (or 
CSF-1)  specifically  induces  macrophage  colo- 
nies, G-CSF  supports  granulocyte  development, 
and  GM-CSF  stimulates  the  proliferation  and  mat- 
uration of  both  cell  types. 

CSFs,  now  produced  in  previously  unobtain- 
able quantities  by  genetic  engineering  tech- 
niques, have  become  part  of  the  clinical  arma- 


mentarium and  have  proved  particularly  useful  in 
reversing  certain  bone  marrow  failures  and  in 
heightening  impaired  host  defenses  against 
infection. 

Signal  Transduction  by  the  CSF-1  Receptor 

CSF-1  (M-CSF)  supports  the  groMT;h,  matura- 
tion, and  survival  of  macrophage  precursors  in 
the  bone  marrow  and  potentiates  the  effector 
functions  of  mature  macrophages  during  the  in- 
flammatory response.  Its  diverse  physiologic 
actions  are  mediated  through  its  binding  to  the 
CSF- 1  receptor  (CSF- 1 R) ,  a  cell  surface  glycopro- 
tein. The  receptor  consists  of  an  extracellular  li- 
gand-binding  portion,  joined  through  a  single 
membrane-spanning  segment  to  an  intracellular 
kinase  domain  capable  of  phosphorylating  other 
cellular  proteins  on  tyrosine  residues.  Receptor- 
mediated  phosphorylation  modifies  the  biochem- 
ical behavior  of  several  effector  proteins,  which 
then  relay  signals  to  the  cell  nucleus  that  effect 
changes  in  gene  expression,  DNA  synthesis,  and 
cell  division. 

CSF-1  induces  dimerization  of  its  receptor  at 
the  cell  surface,  activating  CSF- 1 R  protein  kinase 
activity  and  leading  to  the  cross-phosphorylation 
of  receptor  subunits  on  tyrosine.  The  autophos- 
phorylation  of  CSF-IR  triggers  its  association 
with  other  cellular  enzymes,  whose  biochemical 
activities  in  signal  transduction  are  modified  by 
their  binding  to  the  receptor,  their  phosphoryla- 
tion on  tyrosine,  or  both.  Mutant  CSF-IR  mole- 
cules lacking  particular  autophosphorylation 
sites  are  defective  in  some  aspects  of  signaling 
but  not  others,  suggesting  that  the  combinatorial 
actions  of  enzymes  that  associate  with  the  recep- 
tor can  in  part  determine  the  specificity  of  the 
CSF-1  response  in  different  cell  types. 

CSF- 1 R  is  normally  restricted  in  its  expression 
to  macrophages  and  their  precursors;  it  is  not  de- 
tected on  cells  of  other  hematopoietic  lineages. 
However,  introduction  of  the  gene  encoding 
CSF-IR  into  cells  that  depend  upon  other  growth 
factors  enables  them  to  respond  to  CSF- 1 .  Expres- 
sion of  CSF-IR  in  immature  myeloid  cells  allows 
CSF-1  to  replace  their  interleukin-3  requirements 
for  growth  and  viability. 
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By  contrast,  transduction  of  CSF-IR  into  imma- 
ture pre-B  lymphoid  cells  not  only  relieves  their 
normal  dependence  on  interleukin-7,  but  can  re- 
program  the  fate  of  responding  cells,  enabling 
them  to  trans-differentiate  to  mature  macro- 
phages in  response  to  CSF- 1 .  Physiologic  targets 
of  the  CSF- 1 R  kinase  must  therefore  be  expressed 
more  ubiquitously  than  the  receptor  itself  and, 
depending  on  cell  context,  can  modify  genetic 
programs  governing  both  cell  proliferation  and 
fate. 

Transforming  Potential  of  CSF-IR 

CSF- 1 R  is  encoded  by  the  FMS  proto-oncogene , 
which  can  be  converted  by  mutation  to  an  onco- 
gene that  induces  tumor  formation.  Mutations  in 
the  extracellular  domain  of  CSF-IR  activate  the 
receptor  kinase  in  the  absence  of  CSF- 1 ,  leading 
to  sustained,  unregulated  signals  for  cell  growth. 
By  mutagenizing  segments  of  human  FMS  and 
screening  "libraries"  of  mutated  genes  for  their 
ability  to  induce  cell  transformation,  we  local- 
ized particular  subdomains  within  the  CSF- 1 R  ex- 
tracellular domain  where  "activating  mutations" 
occur.  Postulating  that  similar  genetic  alterations 
might  contribute  to  leukemogenesis,  we  are  us- 
ing polymerase  chain  reaction  techniques  to  as- 
say for  the  presence  of  activating  mutations  in  the 
FMS  genes  of  myeloid  tumor  cells. 

CSF- 1 -Responsive  Genes  Required 
for  Cell  Division 

CSF- 1 R  regulates  genes  that  collectively  govern 


macrophage  growth,  survival,  differentiation, 
and  effector  functions.  When  proliferating  macro- 
phages complete  cell  division  (mitosis),  they 
enter  a  10-  to  12-hour  gap  phase  (Gl)  in  their 
cell  cycle  before  synthesis  of  chromosomal  DNA 
is  reinitiated  (S  phase).  CSF-1  is  required 
throughout  Gl  for  the  cells  to  enter  S  phase,  but 
once  they  begin  to  replicate  DNA,  they  can  com- 
plete S  phase  and  divide  in  the  absence  of  the 
growth  factor.  The  requirement  for  persistent 
CSF- 1  R-mediated  signals  throughout  G 1  implies 
that  the  expression  of  growth  factor-responsive 
genes  is  temporally  regulated  over  a  relatively 
long  period.  Conceivably,  CSF- 1 -responsive  gene 
products  synthesized  early  in  Gl  feed  back  to 
modulate  receptor-mediated  signals,  so  that  the 
genes  that  commit  cells  to  DNA  synthesis  are  ulti- 
mately induced. 

Genetic  data  accumulated  through  studies  of 
yeasts  indicate  that  a  class  of  cell  cycle  genes 
called  Gl  cyclins  might  act  to  prime  cells  for 
DNA  synthesis.  We  recently  isolated  genes  from 
CSF-1 -stimulated  mouse  macrophages  that  have 
predicted  structural  motifs  reminiscent  of  known 
cyclins  and  whose  expression  is  regulated  in  Gl . 
Related  genes  are  expressed  in  other  cell  types, 
suggesting  the  presence  of  a  novel  cyclin  gene 
family.  If  we  can  demonstrate  that  expression  of 
these  cyclins  governs  G 1  progression  in  mamma- 
lian cells,  we  may  be  mechanistically  able  to  link 
early  steps  in  CSF- 1  -mediated  signal  transduction 
with  later  events  required  for  DNA  synthesis. 
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THE  ability  of  neurons  to  regulate  and  alter  the 
strength  of  their  synaptic  connections  is 
thought  to  play  a  crucial  role  during  learning  and 
development.  Changes  in  synaptic  function  can 
be  brief,  lasting  seconds  to  minutes,  or  pro- 
longed, lasting  days  to  months.  One  important 
mechanism  by  which  neurons  regulate  synaptic 
transmission  is  by  regulating  the  amount  of  neuro- 
transmitter released  from  the  presynaptic  cell. 
Since  this  release  is  triggered  by  the  influx  of  cal- 
cium ions  into  the  presynaptic  terminal,  it  has 
been  widely  proposed  that  modulation  of  release 
results  from  the  modulation  of  presynaptic  cal- 
cium levels.  However,  there  is  little  direct  evi- 
dence to  support  this  hypothesis. 

For  the  past  several  years  we  have  focused  our 
attention  on  the  up-  and  down-modulation  of  neu- 
rotransmitter release  from  the  terminals  of  Aply- 
sia  sensory  neurons  in  response  to  the  mod- 
ulatory transmitter  serotonin  (5-HT)  and  the 
neuropeptide  FMRFamide,  respectively.  Electro- 
physiological recording  techniques  have  shown 
that  5-HT  increases  action  potential  duration  by 
closing  a  particular  potassium  channel  (the  S 
channel)  and  increases  an  inward  calcium 
current  that  is  sensitive  to  block  by  dihydropyri- 
dine  drugs  (the  L-type  calcium  channel) .  In  con- 
trast, FMRFamide  increases  the  opening  of  the  S 
potassium  channel,  decreasing  action  potential 
duration,  and  decreases  current  flow  through  a 
class  of  calcium  channels  not  blocked  by  dihy- 
dropyridines  (the  N-type  channels) . 

In  a  recent  study,  Brian  Edmonds,  Nicolas  Dale, 
Marc  Klein,  and  Eric  Kandel  (HHMI,  Columbia 
University)  have  shown  that  only  calcium  that 
enters  through  the  N-type  calcium  channels  is 
able  to  trigger  transmitter  release.  This  conclu- 
sion was  based  on  the  finding  that  application  of 
dihydropyridines  to  block  the  L-type  calcium 
channels  has  little  effect  on  transmitter  release. 
Thus  presynaptic  facilitation  with  5-HT  is 
thought  to  result  from  an  indirect  increase  in  cal- 
cium influx  via  the  N  channels  into  the  sensory 
neuron  presynaptic  terminals  due  to  the  increase 


in  action  potential  duration  (which  allows  the 
N-type  calcium  channels  to  remain  open  lon- 
ger). Presynaptic  inhibition  with  FMRFamide  is 
thought  to  result  from  a  direct  decrease  of  cal- 
cium influx  via  the  N-type  calcium  channels. 

Do  5-HT  and  FMRFamide  modulate  calcium  in- 
flux during  an  action  potential  in  the  presynaptic 
terminals  of  sensory  neurons?  To  what  extent  is 
such  modulation  due  to  changes  in  calcium  in- 
flux via  N-type  as  opposed  to  L-type  calcium 
channels?  Why  doesn't  calcium  entry  via  the  L- 
type  channels  contribute  to  transmitter  release? 

To  address  such  questions  we  have  used  the 
fluorescent  dye  fura-2,  developed  by  Roger  Tsien 
(HHMI,  University  of  California,  San  Diego), 
which  binds  calcium  and  reports  intracellular  cal- 
cium concentration,  to  study  the  effects  of  5-HT 
and  FMRFamide  on  intracellular  calcium  levels  in 
response  to  action  potential  stimuli.  In  a  previous 
study  we  showed  that  the  action  potential- 
induced  calcium  transient  was  increased  by  5-HT 
and  decreased  by  FMRFamide  in  sensory  cell  bod- 
ies, growth  cones,  and  neurites.  Over  the  past 
year  we  have  investigated  whether  such  modula- 
tory changes  occur  at  presynaptic  terminals,  by 
coculturing  sensory  neurons  with  postsynaptic 
motor  neurons.  Pseudocolor  images  of  intracel- 
lular calcium  levels  were  obtained  at  presumed 
presynaptic  sensory  neuron  terminals  at  regions 
of  contact  between  sensory  and  motor  neurons. 
The  intracellular  calcium  transient  induced  by 
action  potentials  was  markedly  enhanced  by  5- 
HT.  These  results  show  that  presynaptic  calcium 
transients  are  modulated  at  defined  regions  of 
contact  between  sensory  and  motor  neurons. 

Why  is  it  that  calcium  influx  via  the  L-type 
channels  does  not  contribute  to  transmitter  re- 
lease? One  possibility  is  that  during  an  action  po- 
tential, the  L-type  channels  contribute  relatively 
little  to  total  calcium  influx.  Alternatively,  these 
channels  may  carry  substantial  amounts  of  cal- 
cium into  the  cell,  but  the  channels  may  be  local- 
ized in  regions  of  the  cell  far  from  the  site  of 
synaptic  contacts. 
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Measurements  of  calcium  transients  in  re- 
sponse to  action  potentials  show  that,  in  the  ab- 
sence of  5-HT,  dihydropyridines  have  a  relatively 
small  effect,  reducing  calcium  entry  by  less  than 
1 0  percent.  Thus  the  lack  of  effect  of  dihydropyr- 
idines under  normal  conditions  could  simply  be 
explained  by  their  small  contribution  to  total  cal- 
cium influx.  However,  in  the  presence  of  5-HT, 
application  of  dihydropyridines  causes  a  substan- 
tial decrease  in  the  amount  of  calcium  influx, 
without  affecting  transmitter  release.  This  leads 
us  to  conclude  that  the  L-type  calcium  channels 
must  be  segregated  from  release  sites. 

How  much  of  the  increase  in  the  calcium 
transient  with  5-HT  is  due  to  modulation  of  the 
L-type  calcium  channels  and  how  much  is  due 
to  the  increased  calcium  influx  via  the  N-type 
calcium  channels  as  a  consequence  of  the  in- 
crease in  action  potential  duration?  To  answer 
this  question  we  compared  the  increase  in  the 
calcium  transient  in  response  to  5-HT  in  the  ab- 


sence and  presence  of  dihydropyridines.  About 
two-thirds  of  the  increase  is  due  to  the  increase 
in  the  L  current,  while  one-third  is  due  to  cal- 
cium influx  via  the  N-type  channels,  which  pre- 
sumably contribute  to  the  increase  in  transmit- 
ter release. 

These  results  provide  direct  evidence  in  sup- 
port of  the  hypothesis  that  alterations  in  calcium 
influx  at  presynaptic  terminals  is  involved  in 
modulating  the  strength  of  synaptic  transmission. 
However,  the  results  also  raise  several  questions 
currently  under  investigation.  To  what  extent 
does  the  change  in  calcium  account  quantita- 
tively for  the  change  in  transmitter  release?  Can 
we  visualize  local  clusters  of  L-type  and  N-type 
calcium  channels  and  show  that  only  the  N-type 
channels  are  present  at  presynaptic  release  sites? 
To  what  extent  are  long-term  changes  in  synaptic 
transmission  due  to  changes  in  calcium  entry? 
What  role  do  the  L-type  channels  play  in  cellular 
physiology? 
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Pseudocolor  image  of  intracellular  calcium 
concentration  in  a  presynaptic  process  of  an 
Aplysia  sensory  neuron.  A:  Left  panel  shows 
that,  at  rest,  resting  calcium  is  low.  After  stimu- 
lating a  brief  train  of  action  potentials,  there  is 
a  rise  in  intracellular  calcium. 

B:  After  application  of  the  facilitatory  trans- 
mitter serotonin  (5-HT),  the  same  train  of  ac- 
tion potentials  gives  rise  to  a  much  larger  in- 
crease in  calcium.  Scale  applies  to  all  panels. 

Research  of  Steven  A.  Siegelbaum. 
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OUR  goal  is  to  understand  how  the  cell  re- 
sponds to  regulatory  signals.  We  have  fo- 
cused on  two  processes:  the  transmission  of  sig- 
nals across  the  membrane,  and  the  regulation  of 
gene  expression.  The  spirit  of  the  work  is  reduc- 
tionist, rather  than  descriptive;  we  aim  to  under- 
stand the  key  steps  of  these  processes  in  terms  of 
basic  physical  and  chemical  principles.  To  do 
this,  we  must  visualize  the  detailed  structure  of 
the  relevant  macromolecules  and  the  complexes 
they  form  with  other  molecules.  The  best  way  to 
do  this  is  to  crystallize  these  complexes  and  de- 
termine their  structure  by  x-ray  diffraction. 

The  stereochemical  underpinning  for  the 
transmission  of  regulatory  signals  through  cell 
membranes  is  a  broad  and  varied  subject.  We 
have  begun  by  focusing  on  the  clinically  impor- 
tant question  of  inflammation.  When  the  signals 
are  transmitted,  phospholipases  are  stimulated  to 
attack  phospholipids,  the  main  substance  of 
membranes.  The  breakdown  products  are  used  to 
supply  precursors  or  signals  for  subsequent  reac- 
tions inside  the  cell.  The  enzyme  phospholipase 
A2  (PLA2)  is  thought  to  be  responsible  for  pro- 
ducing arachidonic  acid,  which  is  the  precursor 
of  most  of  the  small  compounds  that  mediate 
inflammation. 

Recently  we  defined  the  elusive  mechanism  by 
which  PIA2  hydrolyzes  phospholipids  and  re- 
leases arachidonic  acid.  We  worked  out  the  mech- 
anism by  solving  the  crystal  structures  of  enzyme- 
inhibitor  complexes  in  which  the  inhibitor  was 
designed — by  Michael  Gelb  of  the  University  of 
Washington — to  simulate  the  crucial  catalytic  in- 
termediate, or  "transition,"  state.  Enzymes  can 
speed  reactions  by  stabilizing  such  states,  and 
these  crystal  structures  show  clearly  how  PLA2 
does  this.  We  now  have  three  crystallographically 
independent  structures  of  these  complexes,  and 
all  show  the  same  characteristic  relationship  be- 
tween the  enzyme's  active  surface  and  the  transi- 
tion-state analogue. 

These  structures  have  additional  and  special  in- 
terest to  a  wide  scientific  audience.  We  have  de- 
fined, in  atomic  detail,  the  mechanism  of  a  cal- 


cium ion's  regulatory  activity  at  the  cell  surface. 
Moreover,  we  have  shown  how  a  soluble  protein 
interacts  with  the  face  of  the  cell  membrane.  Be- 
sides answering  general  scientific  questions, 
these  studies  afford  an  immediate  practical  gain. 
Our  structures  may  provide  a  basis  for  the  ratio- 
nal design  of  therapeutic  agents  that  block  an  in- 
appropriate or  exceptionally  severe  inflamma- 
tory response.  To  this  end,  we  have  recently 
obtained  suitable  crystals  of  PLA2  found  in  the 
inflamed  joints  of  patients  with  acute  rheumatoid 
arthritis  and  the  serum  of  women  with  toxic 
shock  syndrome.  The  crystal  structure  of  this  en- 
zyme should  provide  a  starting  point  for  the  ratio- 
nal design  of  drugs  aimed  at  these  and  related 
inflammatory  diseases. 

Ultimately  most  regulatory  signals  control  the 
expression  of  genes,  turning  some  off  and  others 
on.  Much  of  this  occurs  by  regulating  transcrip- 
tion (the  synthesis  of  messenger  RNA) .  An  essen- 
tial element  of  transcriptional  regulation  is  to  tar- 
get the  regulatory  proteins  to  the  genes  they  are 
designed  to  control.  For  example,  the  estrogen 
receptor,  which  is  an  activator  of  transcription,  is 
obviously  targeted  to  different  genes  than  its 
counterpart,  the  testosterone  receptor.  This  is 
usually  accomplished  by  a  recognition  process 
whereby  the  regulatory  protein  binds  to  a  spe- 
cific DNA  sequence  associated  with  the  genes  to 
be  controlled.  Recognition  involves  the  forma- 
tion of  a  strong  interaction  between  the  protein 
and  the  correct  DNA  sequence  and  weak  interac- 
tions with  other  DNA.  We  have  focused  our  atten- 
tion on  the  basic  chemistry  responsible  for  stabi- 
lizing specific  protein-DNA  interactions. 

Our  results  in  this  effort  are  most  advanced  in 
the  case  of  the  trp  repressor-operator  complex, 
which  regulates  the  expression  of  genes  responsi- 
ble for  tryptophan  biosynthesis  in  Escherichia 
colt.  The  structure  was  refined  to  unusually  high 
resolution  (1.9  A).  This  refinement,  coupled  to 
the  fact  that  the  same  structure  is  found  in  four 
independent  representations,  provides  us  with  a 
model  of  exceptional  clarity  and  detail. 

The  chemistry  of  the  interface  was  unex- 


409 


Chemical  Details  of  Cellular  Regulation 


pected.  Most  of  the  direct  interactions  are  hydro- 
gen bonds  that  appear  to  lash  the  protein  to  the 
sugar  phosphate  backbone  on  both  sides  of  the 
DNA's  major  groove.  In  some  instances  a  single 
phosphate  accepts  four  hydrogen  bonds.  An  inti- 
mate specific  fit  is  partly  accomplished  by  a  de- 
formation of  the  DNA,  which  we  believe  to  be 
allowed  specifically  by  the  target's  sequence  at 
little  or  no  cost  in  internal  energy.  We  refer  to  this 
sequence-dependent  deformability  as  "indirect 
read-out"  of  the  sequence. 

Most  surprising  is  the  array  of  fixed  water  mole- 
cules (20  per  complex)  that  help  mediate  the 
specific  interaction.  Ten  of  these  are  also  found 
in  the  uncomplexed  repressor  protein  and  there- 
fore represent  noncovalent  extensions  of  the  pro- 
tein's "recognizable"  surface.  Four  of  the  ten 
fixed  water  molecules  provide  six  highly  specific 
hydrogen  bonds  to  bases  that  define  the  trp  opera- 
tor as  the  target  for  the  trp  repressor.  Recent  mu- 
tational studies  designed  to  test  the  functional 
significance  of  these  specific  water-mediated 
contacts  support  the  idea  that  water  molecules 
can  mediate  molecular  recognition.  The  obliga- 
tory involvement  of  water  in  molecular  recogni- 
tion has  also  been  reported  recently  for  highly 
specific  interactions  of  proteins  with  sugars  and 
RNA  molecules. 

Our  most  recent  work  has  focused  on  under- 
standing DNA  recognition  in  higher  cells.  We  are 
trying  to  determine  the  crystal  structures  of  a  vari- 
ety of  regulatory  protein-DNA  complexes  that 
control  metabolism,  grovvT:h,  differentiation,  and 
the  expression  of  viral-specific  genes.  Some  of 


these  complexes  closely  resemble  the  products 
of  oncogenes. 

We  have  just  determined  the  crystal  structure 
of  the  DNA-binding  domain  of  the  glucocorticoid 
(Cortisol)  receptor  in  a  complex  with  its  DNA  tar- 
get. The  ability  of  this  protein  and  each  of  its 
close  homologues  to  bind  to  its  specific  target 
sequence  lies  at  the  heart  of  the  selective  expres- 
sion of  genes  in  response  to  each  of  the  different 
steroid  hormones.  Despite  their  discrimination  in 
targeting  the  correct  regulatory  sequence,  the 
DNA-binding  domains  of  all  the  steroid  receptor 
"superfamily"  (thyroid  hormone  included)  have 
a  highly  conserved  sequence  and  presumably 
very  similar  structures.  We  found  that  these  DNA- 
binding  domains  contain  two  intriguing  sub- 
structures called  zinc  fingers,  found  in  many 
genetic  regulatory  proteins.  Mutagenesis  experi- 
ments done  elsewhere  have  indicated  where  to 
focus  our  attention,  and  we  can  now  account  for 
the  mechanism  by  which  the  glucocorticoid  re- 
ceptor can  discriminate  between  its  target  DNA 
and  those  of  the  estrogen  and  thyroid  receptors. 

Work  on  other  systems  is  not  quite  so  far  along. 
We  hope  to  grow  better  DNA-protein  crystals 
containing  the  binding  domain  of  the  "leucine 
zipper  proteins,"  another  important  type  of  tran- 
scription factor  first  characterized  by  Steven 
McKnight  (HHMI  at  the  Carnegie  Institution). 
Work  also  continues  with  Laimonis  Laimins 
(HHMI  at  the  University  of  Chicago)  on  strongly 
diffracting  crystals  of  the  specific  DNA  complex 
of  a  regulator  protein,  E2,  which  can  trigger  the 
transforming  activity  of  the  potentially  onco- 
genic papilloma  virus. 
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MY  research  interests  are  focused  on  the 
analysis  of  transcriptional  regulatory  cir- 
cuits that  turn  genes  on  or  off  during  the  growth 
and  differentiation  of  B  lymphocytes,  cells  that 
produce  antibodies.  The  B  cell  lineage  is  a  very 
useful  model  for  exploring  the  molecular  basis  of 
differential  gene  activity  in  mammalian  develop- 
ment. We  are  seeking  answers  to  the  following 
questions:  What  is  the  nature  of  the  genetic  ele- 
ment(s)  linked  to  a  target  gene  that  controls  its 
transcriptional  activity?  What  is  the  nature  of  the 
regulatory  protein (s)  that  recognizes  this  genetic 
element?  How  does  the  regulatory  protein  on 
binding  near  its  target  gene  modulate  the  activity 
of  the  enzyme  complex  that  transcribes  the  gene? 
What  is  the  mechanism  by  which  a  growth  or  de- 
velopmental signal  is  transduced  by  the  regula- 
tory protein (s)  to  effect  gene  activity?  How  does 
the  structure  of  the  chromatin  within  which  the 
target  gene  is  packaged  influence  the  function  of 
the  regulatory  protein (s)? 

The  heavy-  and  light-chain  genes  encoding  the 
immunoglobulin  (Ig)  molecule  are  selectively 
transcribed  in  B  cells.  These  genes  are  assembled 
from  gene  segments  through  an  ordered  series  of 
somatic  recombination  events  that  occur  in  a  de- 
veloping B  cell.  The  heavy-chain  gene  locus  is  the 
first  to  undergo  recombination  and  transcrip- 
tional activation,  thereby  defining  the  pre-B  cell 
developmental  state.  Subsequently  one  of  two 
light-chain  gene  loci  is  recombined  and  ex- 
pressed, resulting  in  the  development  of  a  mature 
B  cell.  Ig  genes  contain  multiple  cis-acting  tran- 
scriptional regulatory  elements  that  restrict  their 
expression  to  appropriate  stages  in  the  B  lineage. 

Previous  work  by  various  research  groups  has 
resulted  in  the  identification,  characterization, 
and  cloning  of  a  B  cell-specific  regulatory  pro- 
tein, Oct-2.  This  protein  recognizes  the  octanu- 
cleotide  sequence  ATTTGCAT,  which  confers  B 
cell  specificity  to  Ig  gene  promoters.  The  same 
sequence  motif  is  also  a  functional  component  of 
the  heavy-chain  and  /c-light-chain  gene  en- 
hancers. Oct-2  can  activate  transcription  of  a  re- 
porter gene  linked  to  an  Ig  promoter  in  a  non-B 


cell.  Thus  Oct-2  appears  to  be  both  necessary  and 
sufficient  for  regulating  the  activity  of  Ig  pro- 
moters. The  deduced  amino  acid  sequence  of 
Oct-2  reveals  a  region  of  similarity  that  is  shared 
with  three  other  regulatory  proteins — Pit-1, 
Oct-1 ,  and  unc-86.  This  region  is  termed  the  POU 
box  and  includes  a  subdomain  related  to  the 
homeobox. 

Current  research  focuses  on  the  regulatory 
functions  of  Oct-2  in  B  cell  development,  as  well 
as  the  mechanism  underlying  lineage-restricted 
expression.  Oct-2  is  likely  to  represent  a  member 
of  a  regulatory  gene  hierarchy  that  controls  the 
development  of  B  cells  from  pluripotent  hemato- 
poietic stem  cells.  By  studying  the  regulation  of 
Oct-2  expression,  we  can  identify  and  isolate 
genes  interacting  with  Oct-2  in  the  hierarchy. 
The  murine  gene  encoding  Oct-2  has  been  iso- 
lated in  the  laboratory  and  is  being  analyzed  for 
regulatory  sequences  that  control  its  expression 
in  developing  B  cells. 

The  activity  of  the  Oct-2  gene  is  regulated  dur- 
ing B  cell  differentiation.  The  Oct-2  gene  is  ex- 
pressed at  low  levels  in  pre-B  cells.  Oct-2  ex- 
pression is  transcriptionally  up-regulated  upon 
pre-B  cell  differentiation,  and  higher  Oct-2  pro- 
tein levels  correlate  with  activation  of  the  Ig  k- 
light-chain  gene  locus.  Increased  expression  of 
Oct-2  in  pre-B  cells  is  induced  by  signaling  with 
the  B  cell  mitogen,  bacterial  lipopolysaccharide, 
as  well  as  the  lymphokine  interleukin-1  (IL-1). 
Transforming  growth  factor-jS,  an  inhibitor  of  k 
gene  induction  in  pre-B  cells,  blocks  the  up- 
regulation  of  Oct-2  but  not  the  activation  of  NF- 
kB,  another  regulatory  protein  that  has  been  im- 
plicated in  the  control  of  the  activity  of  the  k 
locus.  We  propose  a  model  in  which  the  con- 
certed action  of  increased  levels  of  Oct-2  and  ac- 
tivated NF-kB  controls  the  proper  stage-specific 
expression  of  the  k  locus.  Our  results  suggest  that 
NF-kB  may  also  be  involved  in  regulating  Oct-2 
transcription  during  pre-B  cell  differentiation. 
Therefore  this  transition  in  B  cell  ontogeny  may 
be  controlled  by  a  cascade  of  transcriptional 
regulators. 
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TT  w  fcMATOPOIESIS  refers  to  a  complex  develop- 
JL  JLmental  process  through  which  pluripotent 
stem  cells  in  mammalian  fetal  liver  or  adult  bone 
marrow  give  rise  to  several  types  of  terminally 
differentiated  blood  cells.  The  immune  system  in- 
corporates many  of  these  cell  types,  which  in- 
clude B  and  T  lymphocytes,  granulocytes,  mono- 
cytes, and  killer  cells,  to  protect  an  organism 
from  infection  by  a  variety  of  means.  Thus  an  ef- 
fective immune  response  depends  on  a  properly 
functioning  hematopoietic  pathway. 

When  defects  arise  in  this  pathway,  the  effects 
are  often  severe.  In  some  cases,  precursor  cells 
cannot  develop  to  maturity,  resulting  in  immuno- 
deficiency. In  other  cases,  uncontrolled  prolifera- 
tion of  developing  cells  results  in  leukemias  and 
lymphomas.  To  understand  the  basis  of  these  de- 
fects, we  need  to  explain  at  the  molecular  level 
the  regulation  of  the  hematopoietic  pathway. 

The  primary  objective  of  our  research  is  to 
identify  and  characterize  regulatory  proteins  for 
one  stage  of  hematopoiesis:  the  maturation  path- 
way for  B  lymphocytes.  These  cells  produce  anti- 
body molecules  and  therefore  play  a  central  role 
in  an  immune  response.  In  numerous  laboratories 
a  variety  of  approaches  have  been  employed  to 
define  molecules  that  regulate  B  cell  maturation. 

One  approach  has  been  to  identify  components 
of  the  bone  marrow  microenvironment  (which 
surrounds  precursor  B  cells)  essential  for  cell 
growth  and  differentiation.  One  such  component 
is  a  soluble  growth  factor  called  interleukin-7, 
which  is  secreted  by  bone  marrow-derived  cells 
and  promotes  proliferation  of  specific  pre-B  cell 
types.  Additional  growth  factors  and  stromal  cell 
membrane  proteins  have  been  identified  that  may 
promote  further  stages  of  B  cell  development. 

A  second  approach  has  been  to  characterize 
genes  that  obstruct  the  differentiation  process. 
These  genes  include  several  viral  oncogenes  and 
their  cellular  homologues.  The  Y-abl  oncogene, 
for  example,  is  leukemogenic  in  specific  pre-B 
cell  types,  preventing  them  from  further  differ- 
entiation. The  c-myc  oncogene  also  facilitates 
pre-B  cell  transformation,  and  one  study  has 
shown  that  c-myc  transformation  was  prevented 


by  artificially  altering  the  B  cell  differentiation 
pathway.  Further  studies  of  the  structure  and 
function  of  these  oncogenes  are  providing  insight 
into  the  control  of  differentiation. 

We  are  utilizing  a  third  approach  to  define  B 
cell  regulatory  molecules.  We  are  identifying 
proteins  that  directly  activate  or  inactivate  genes 
expressed  at  specific  stages  of  differentiation. 
The  regulation  of  gene  expression  appears 
usually  to  be  carried  out  by  modulation  of  the 
rate  of  initiating  transcription  (RNA  synthesis). 
Transcriptional  analysis  as  a  means  of  studying  B 
cell  differentiation  has  already  been  employed 
successfully  and  extensively  with  the  genes  that 
encode  the  antibody,  or  immunoglobulin  (Ig), 
molecules.  In  fact,  transcription  of  these  Ig  genes 
has  been  studied  in  more  detail  than  that  of  al- 
most any  other  cellular  gene. 

These  studies  have  revealed  at  least  two  impor- 
tant regulators  of  B  cell  differentiation.  Both  regu- 
latory proteins,  OCT-2  and  NF-/cB,  bind  to  DNA 
sequences  in  the  control  regions  for  Ig  genes  and 
play  a  role  in  specific  activation  of  Ig  RNA  synthe- 
sis in  developing  B  cells.  Besides  these  two  pro- 
teins, several  other  DNA-binding  proteins, 
currently  less  well  characterized,  have  been 
identified  that  also  may  play  a  role  in  regulating 
Ig  transcription. 

The  analysis  of  Ig  transcription  and  the  identifi- 
cation of  OCT-2  and  NF-/cB  have  contributed  sig- 
nificantly to  our  understanding  of  B  cell  differen- 
tiation. Surprisingly,  however,  there  has  been 
little  effort  toward  characterizing  the  transcrip- 
tional regulation  of  several  other  genes  known  to 
be  expressed  specifically  in  early  B  cells.  We 
therefore  have  chosen  to  extend  the  analysis  of  B 
cell  differentiation  by  focusing  on  the  transcrip- 
tional regulation  of  a  new  gene  that  is  regulated 
in  a  precise  fashion  during  B  cell  ontogeny: 
the  terminal  deoxynucleotidyltransferase  (TdT) 
gene.  This  gene  encodes  a  DNA  polymerase  that 
appears  to  play  a  role  in  generating  diversity 
within  the  antibody  molecules. 

We  chose  to  analyze  TdT  expression  for  four 
reasons.  First,  the  gene  is  tightly  regulated,  being 
expressed  exclusively  in  immature  lymphocytes. 
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Many  other  genes  expressed  in  early  lymphocytes 
are  also  expressed  in  other  tissues.  Second,  un- 
like the  Ig  genes,  which  are  permanently  acti- 
vated during  B  cell  differentiation,  the  TdT  gene 
is  only  transiently  activated  and  then  remains  off 
in  the  mature  cell.  This  expression  pattern  pro- 
vides the  opportunity  to  analyze  mechanisms  for 
transcriptional  activation  as  well  as  inactivation 
during  lymphopoiesis.  Third,  the  TdT  gene  is  not 
only  expressed  in  early  B  cells,  but  also  in  devel- 
oping T  lymphocytes,  which  are  responsible  for 
the  cell-mediated  immune  response.  It  will 
therefore  be  interesting  to  analyze  common  regu- 
latory proteins  and  pathways  for  these  two  dis- 
tinct lymphoid  lineages.  And  finally,  until  the  re- 
cent isolation  of  two  new  genes,  called  RAG-1 
and  RAG-2,  TdT  was  the  only  gene  available 
whose  product  was  thought  to  play  a  specific  role 
in  the  gene  rearrangement  process  responsible 
for  antibody  diversity.  Because  the  rearrange- 
ments are  likely  to  be  key  events  that  drive  B  and 
T  lymphocyte  differentiation,  it  is  important  to 
investigate  the  regulation  of  the  rearrangement 
machinery  in  order  to  understand  the  control  of 
lymphopoiesis. 

Our  analysis  of  TdT  regulation  has  suggested, 
quite  surprisingly,  that  the  TdT  control  mecha- 
nisms are  fundamentally  very  different  from  Ig 
control  mechanisms.  Moreover,  the  TdT  tran- 
scriptional control  mechanisms  may  be  represen- 
tative of  those  for  a  variety  of  genes  expressed  in 
early  B  and/or  T  cells. 

Both  the  general  structure  of  the  transcrip- 
tional control  region  within  the  TdT  locus 
(called  the  promoter)  and  the  lymphocyte-spe- 
cific DNA-binding  proteins  for  TdT  transcription 
appear  to  be  unique  when  compared  with  our 
knowledge  of  Ig  transcription.  The  TdT  promoter 
does  not  contain  a  TATA  box,  a  common  DNA  se- 
quence element  found  in  the  Ig  promoters  and  in 
most  other  promoters  that  have  been  character- 
ized in  detail.  Moreover,  the  lymphocyte-specific- 
ity of  the  TdT  promoter  appears  to  be  regulated  in 
part  by  a  novel  lymphocyte-specific  DNA-binding 
protein,  called  LyF- 1 ,  rather  than  by  the  OCT-2  or 
NF-kB  proteins. 

Most  importantly,  these  features  of  the  TdT 
promoter  are  found  in  the  promoters  for  a  num- 


ber of  other  genes  activated  in  early  B  and/or  T 
cells.  These  include  the  B  cell-specific  X5  and 
VpreB  genes  and  the  two  distinct  promoters  for 
the  T  cell-specific  Ick  gene.  All  of  these  pro- 
moters lack  TATA  boxes  and  contain  at  least  two 
binding  sites  for  LyF- 1 . 

Our  studies  currently  are  focused  on  under- 
standing the  two  unique  qualities  of  TdT  tran- 
scription described  above:  1)  the  ability  to  direct 
accurate  RNA  synthesis  in  the  absence  of  a  TATA 
element  and  2)  the  apparent  use  of  the  LyF-1  pro- 
tein to  control  the  lymphocyte  specificity  of  TdT 
expression. 

In  place  of  the  TATA  element,  which  typically 
is  located  30  nucleotides  upstream  of  the  tran- 
scription start  site,  the  TdT  promoter  contains  a 
distinct  element  that  overlaps  the  start  site.  This 
element,  which  we  call  an  initiator  (Inr),  is  like 
the  TATA  element  in  that  it  is  important  for  pro- 
moter function  and  also  pinpoints  the  RNA  start 
site  to  a  specific  nucleotide.  We  recently  have 
found  functional  Inr  elements  in  many  different 
genes,  even  though  a  precise,  consensus  DNA  se- 
quence has  not  been  detected. 

We  have  also  found,  through  analysis  of  syn- 
thetic promoters,  that  the  TATA  element  is  domi- 
nant over  the  Inr  element.  In  a  promoter  contain- 
ing both  TATA  and  Inr,  the  TATA  element  directs 
the  location  of  the  transcription  start  sites.  Other 
studies  are  providing  further  insights  into  the 
functioning  of  the  Inr  element  and  its  relation- 
ship to  TATA. 

The  LyF-1  protein  appears  to  play  a  role  in  the 
lymphocyte  specificity  of  TdT  transcription.  It  is 
found  predominantly  in  lymphoid  cells  and  binds 
tightly  to  two  sites  in  the  TdT  promoter  that  are 
important  for  lymphocyte-specific  transcription. 
We  have  purified  this  protein  in  order  to  demon- 
strate that  it  can  bind  to  promoters  for  the  other 
lymphocyte-specific  genes  mentioned  above. 
Moreover,  the  purified  protein  is  being  used  to 
reach  one  of  our  primary  goals:  isolation  of  the 
gene  encoding  LyF- 1 .  This  gene  will  allow  us  to 
understand  better  the  mechanism  by  which  LyF- 1 
regulates  TdT  expression.  Ultimately,  our  contin- 
ued analyses  of  LyF- 1  and  of  TdT  expression  are 
likely  to  broaden  our  knowledge  of  the  control  of 
hematopoiesis. 
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THE  major  aim  of  my  laboratory  is  to  extend 
the  understanding  of  early  development  of 
the  mouse,  both  by  studying  cell  lineages  and  by 
generating  mutations  that  affect  the  embryo. 

In  the  study  of  cell  lineages,  we  are  particularly 
interested  in  determining  the  origin  of  germ  cells 
and  in  establishing  the  time  of  allocation  of  cells 
to  specific  lineages  once  the  embryo  has  been 
implanted  in  the  uterus.  In  these  studies,  em- 
bryos are  infected  at  various  stages  of  develop- 
ment vv^ith  retroviruses  transducing  the  /3-galacto- 
sidase  (|8gal)  gene  of  Escherichia  coli.  The  gene, 
used  as  a  tracer,  permits  histochemical  analysis  of 
infected  cells  and  their  progeny,  w^hich  in  turn 
permits  the  establishment  of  cell  lineage  rela- 
tionships between  jSgal  cells  in  a  clone. 

Following  implantation,  the  embryo  under- 
goes a  series  of  cleavages  accompanied  by  exten- 
sive cell  mixing.  Gastrulation  begins  at  day  7  of 
development  with  the  formation  of  the  primitive 
streak.  We  have  observed  frequent  infection  of 
postimplantation  embryos  with  (8gal  viruses  ex- 
pressing the  gene  under  the  control  of  the  viral 
promoter  or  of  an  internal  phosphoglycerate  ki- 
nase 1  promoter.  Infection  of  embryos  at  day  7 
and  8  of  development  suggests  that  extensive  cell 
mixing  stops  at  this  time,  since  embryos  with  sin- 
gle clusters  of  /3gal-expressing  cells  are  readily 
observed. 

The  second  aspect  of  our  research  concerns 
the  identification  of  developmentally  regulated 
genes  in  the  embryo.  Despite  the  mapping  of  nu- 
merous mutations,  both  spontaneous  or  induced 
by  radiation  or  chemicals,  it  has  been  difiicult  to 
determine  the  molecular  defect  on  mouse  chro- 
mosomes that  causes  a  given  phenotype.  For  this 
reason,  insertion  mutagenesis,  in  which  DNA  in- 
troduced into  the  germline  can  cause  a  mutation 
by  disrupting  or  affecting  the  expression  of  a 
gene,  is  an  attractive  research  approach,  with  the 
transgene  serving  as  a  tag  for  the  molecular  clon- 
ing of  the  affected  gene.  The  DNA  can  be  intro- 
duced into  the  embryo  by  retroviral  infection  or 
direct  microinjection  into  the  zygote  pronuclei, 
or  by  using  embryonic  stem  (ES)  cells,  which  can 
be  infected  or  electroporated  and  then  selected 


and  introduced  into  embryos  to  colonize  the 
germline. 

Over  the  past  year  we  have  developed  a 
method  for  generating  large  numbers  of  muta- 
tions in  mice.  The  method  is  useful  because,  on 
the  average,  only  1  out  of  15  transgenic  strains 
exhibits  an  overt  mutant  phenotype.  To  avoid  ex- 
tensive breeding  of  useless  strains,  we  devised  a 
screen  based  on  "promoter  traps."  A  reporter 
gene  is  placed  downstream  of  a  DNA  splice  ac- 
ceptor, and  the  construct  is  then  introduced  into 
ES  cells.  Expression  of  the  reporter  gene  can  only 
originate  from  a  flanking  cellular  promoter. 
Therefore,  transgenic  mice  derived  from  selected 
ES  cells  can  be  used  both  to  trace  the  activity  of 
the  tagged  gene,  by  expression  of  the  reporter, 
and  to  mutate  the  strain. 

We  have  used  as  reporters  both  the  /3gal  gene 
and  a  fusion  protein  encoding  two  enzymes,  (8gal 
and  neomycin  phosphotransferase.  This  fusion 
protein,  /Sgeo,  allows  direct  selection  for  pro- 
moter trap  events.  Thirty- four  transgenic  lines 
have  now  been  generated  using  a  retroviral  pro- 
moter trap  vector,  and  they  are  being  examined 
both  for  patterns  of  expression  and  for  pheno- 
type. Among  the  first  13  strains  analyzed,  some 
have  been  found  in  which  the  promoter  trap 
caused  embryo  death  and  some  in  which  no  overt 
phenotype  is  associated  with  the  mutation.  This 
suggests  that  a  certain  proportion  of  genes  are 
unessential  for  normal  development.  To  identify 
the  mutated  genes,  we  have  started  to  clone  the 
sites  of  viral  integration. 

ES  cells  can  be  used  also  to  select  for  mutations 
in  specific  genes.  Efforts  in  the  laboratory  have 
focused  initially  on  the  gene  encoding  c-src,  a 
proto-oncogene  that  is  a  tyrosine  kinase.  It  is 
thought  to  play  a  role  in  the  cell  cycle  and  is 
highly  expressed  in  neurons  and  platelets.  Con- 
structs designed  to  delete  the  c-src  activity  were 
introduced  into  ES  cells  by  electroporation,  and 
homologous  recombinant  clones  were  isolated 
by  screening  with  the  polymerase  chain  reaction 
(PGR)  technique.  In  a  given  construct,  which  in- 
cludes about  8  kilobases  of  homology,  the  fre- 
quency of  such  targeted  events  is  about  1  in  1 00. 
Southern  blot  analysis  has  demonstrated  that 
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these  are  authentic  homologous  recombination 
events. 

ES  cells  have  been  used  to  generate  germline 
chimeras.  Animals  heterozygous  for  the  src  muta- 
tion exhibit  no  phenotype,  whereas  homozygous 
animals  survive  to  birth  but  are  retarded  relative 
to  their  littermates  and  die  in  the  first  few  weeks. 
Further  analysis  of  these  animals  reveals  an  osteo- 
clast defect  and  resulting  osteopetrosis.  This  re- 
sult is  surprising,  for  it  implies  that  src  may  not 
be  essential  for  cell  types  in  which  it  is  most 
highly  expressed. 

In  many  cell  types,  other  tyrosine  kinases  re- 


lated to  c-src  may  play  overlapping  roles.  We  are 
particularly  interested  in  the  c-yes  and  fyn  genes. 
To  study  these,  we  have  generated  mutations  in 
ES  cells  by  homologous  recombination  in  c-yes 
and  are  designing  similar  experiments  with  fyn. 
The  frequency  of  homologous  recombination  at 
the  c-yes  locus  has  been  about  1  in  30,  and  the 
yes  chimeras  will  be  bred  to  ascertain  contribu- 
tion to  the  germline.  The  resulting  mutant  mice 
may  be  examined  for  the  effect  of  loss  of  yes  and 
crossed  to  src-deficient  mice  to  examine  the  in- 
terplay between  different  members  of  the  src 
gene  family. 
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EGGS  have  always  fascinated  biologists.  The 
union  of  egg  and  sperm  initiates  the  complex 
developmental  processes  that  ultimately  produce 
a  new  multicellular  animal.  However,  the  roles 
played  by  egg  and  sperm  in  embryonic  develop- 
ment are  by  no  means  equivalent.  Even  nonspe- 
cific treatments  such  as  pricking  with  a  needle 
will  stimulate  many  types  of  eggs  to  develop  in 
the  absence  of  sperm.  Without  an  egg,  however, 
neither  sperm  nor  any  other  cell  can  even  begin 
the  complex  processes  that  lead  to  an  embryo  and 
ultimately  to  an  adult.  In  fact,  eggs  have  under- 
gone extensive  and  intricate  preparations  that  al- 
low them  to  support  and  direct  embryogenesis.  If 
the  mystery  of  a  new  life  could  be  condensed  into 
a  single  cell,  then  that  cell  would  surely  be 
an  egg. 

Not  surprisingly,  egg  cells  are  very  different  in 
structure  as  well  as  in  biological  capacity  from 
other  cells.  Most  chicken  cells  weigh  less  than  1 
millionth  of  an  ounce,  whereas  a  single  chicken 
egg  makes  a  nice  breakfast.  However,  in  addition 
to  size,  eggs  diverge  in  many  important  ways  from 
other  cells.  The  genes  within  the  egg's  chromo- 
somes function  differently.  These  alterations  are 
so  extensive  that  egg  chromosomes  appear  in  the 
microscope  quite  unlike  those  in  other  tissues.  As 
a  result  of  these  differences,  in  eggs  the  products 
of  many  genes  accumulate  and  are  stored  in  spe- 
cial forms  that  can  be  utilized  at  precisely  appro- 
priate times  during  embryonic  development.  In- 
deed the  presence  of  so  much  stored  material  is 
one  of  the  reasons  for  the  enormous  size  of  many 
eggs. 

A  great  deal  remains  to  be  learned  about  the 
structure  of  eggs.  How  many  gene  products  are 
specially  stored  there,  and  what  are  their  specific 
functions  later  in  development?  Is  each  product 
located  in  a  particular  place?  What  are  the  special 
mechanisms  that  allow  gene  products  to  be  pro- 
duced and  stockpiled  in  the  appropriate  manner? 
The  sheer  complexity  of  an  egg,  with  its  tens  of 
thousands  of  specific,  highly  organized  compo- 
nents, has  until  recently  prevented  all  attempts  to 
describe  egg  structure  in  molecular  detail,  much 
less  to  understand  the  logic  that  allows  this  struc- 


ture to  begin  development  into  an  even  more  in- 
tricate adult  organism.  However,  during  the  last 
1 0  years  or  so,  the  advent  of  powerful  new  meth- 
ods in  molecular  biology,  such  as  gene  cloning 
and  gene  transfer,  in  combination  with  Drosoph- 
ila  (fruit  fly)  genetics,  has  begun  to  unravel  the 
fascinating  secrets  stored  inside  an  eggshell. 

Using  Genetics  to  Study  Eggs 

It  is  now  possible  to  study  structures  as  com- 
plex as  eggs,  because  both  genetics  and  molecu- 
lar biology  allow  us  to  take  eggs  apart  gene  by 
gene.  Each  component  of  an  egg  is  specified  by  a 
gene  carried  on  the  chromosomes.  Much  research 
on  eggs  utilizes  the  fruit  fly,  since  over  70  years 
of  genetic  studies  have  revealed  far  more  about 
its  genes  than  those  of  any  other  complex  organ- 
ism, including  the  human.  A  genetic  mutation  in- 
activates one  specific  gene  among  the  20,000  to 
50,000  in  the  fruit  fly.  If  that  gene  specifies  an 
egg  component,  then  the  mutant  fly  will  produce 
defective  eggs.  Frequently,  however,  the  eggs 
can  still  partially  function.  For  example,  in  a  par- 
ticular mutant,  development  might  begin  nor- 
mally, but  then  stop  due  to  a  lack  of  stored  food. 
By  studying  what  goes  wrong  with  the  mutant 
eggs,  biologists  can  learn  much  about  what  the 
product  of  that  particular  gene  does  normally. 
For  example,  in  this  case  it  would  be  concluded 
that  the  gene  was  involved  in  producing  the  food 
stored  in  the  egg.  To  understand  eggs  it  will  be 
necessary,  at  a  minimum,  to  create  mutations  and 
carry  out  such  studies  on  all  20,000  fruit  fly 
genes,  one  by  one. 

If  researchers  were  limited  to  looking  at  egg 
defects  in  a  microscope,  progress  would  still  be 
slow.  Many  of  the  defective  eggs  would  not  con- 
tain any  detectable  problems;  they  just  wouldn't 
work.  It  is  at  this  point  that  the  ability  to  clone 
molecularly  the  gene  under  study  becomes  es- 
sential to  make  further  progress.  Cloning  is  sim- 
ply a  way  of  purifying  an  individual  gene  so  that 
its  DNA structure  can  be  determined  in  the  labora- 
tory. If  someone  could  invent  a  piece  of  filter 
paper  that  would  allow  only  one  specific  gene  to 
pass  through  while  the  other  20,000  stayed  be- 
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hind,  then  gene  cloning  could  be  replaced  by  a 
coffeemaker. 

Gene  purification  has  two  major  benefits.  First, 
the  structure  of  the  gene's  product,  a  specific 
protein  that  actually  becomes  part  of  the  egg,  can 
now,be  determined  by  DNA  sequencing.  Second, 
specific  antibodies  can  be  prepared  that  bind 
only  to  the  product  of  that  particular  gene.  When 
these  antibodies  are  used,  eggs  that  appeared  nor- 
mal in  the  microscope  can  suddenly  be  seen  to 
have  specific  defects  that  are  responsible  for  their 
inability  to  function  normally.  Such  molecular  bi- 
ological studies  allow  us  to  understand  in  much 
greater  detail  where  particular  gene  products  are 
located  in  a  egg,  where  they  go  as  development 
begins,  and  what  role  they  are  likely  to  be  playing 
in  the  process  of  development. 

Covert  Signals 

One  project  in  our  research  group  concerns  an 
unusual  role  played  by  the  egg's  nucleus  during 
oogenesis.  The  nucleus  of  a  cell  contains  its 
genes  and  is  the  site  where  gene  activity  is  con- 
trolled. Normally,  when  a  gene  becomes  active,  it 
copies  itself  and  releases  the  copies  into  the  cyto- 
plasm, where  they  instruct  the  cell  how  to  make  a 
particular  protein.  However,  the  nucleus  doesn't 
usually  target  these  copies  to  any  particular  re- 
gion of  the  cell;  they  diffuse  away  from  the  nu- 
cleus in  all  directions  through  the  cell  cytoplasm. 
This  year  we  have  carried  out  several  studies  sug- 
gesting that  Drosophila  egg  nuclei  play  a  more 
sophisticated  role  during  certain  key  times  in 
oogenesis. 

The  oocyte  nucleus  always  moves  from  a  cen- 
tral location  in  the  egg  to  a  spot  near  the  upper 
surface  of  the  egg.  To  learn  if  this  migration 
might  be  important,  we  destroyed  the  nucleus 
with  a  laser  microbeam  just  before  it  moved  up- 


ward, and  allowed  the  eggs  to  continue  develop- 
ing. If  the  nucleus  simply  broadcasts  products 
generally  throughout  the  egg,  its  loss  should 
cause  only  generalized  defects,  and  possibly 
arrest  further  egg  development.  Instead,  the  up- 
per surface  of  each  treated  egg  was  specifically 
affected;  it  now  continued  to  develop  as  though  it 
were  the  lower  surface.  (The  two  surfaces  can  be 
distinguished  by  differences  in  their  curvature 
and  in  the  eggshell.)  This  led  us  to  suspect  that 
the  egg  nucleus  was  moving  to  the  upper  surface 
in  order  to  send  specific  information  to  that  re- 
gion. The  most  likely  recipients  of  this  informa- 
tion were  some  of  the  cells  that  surround  the  en- 
tire surface  of  the  egg  as  it  develops,  called 
follicle  cells.  The  egg  nucleus  appeared  to  be 
giving  special  instructions  only  to  the  follicle 
cells  that  lie  just  above  the  upper  surface.  With- 
out these  instructions  these  cells  were  unable  to 
carry  out  functions  that  were  necessary  to  pro- 
duce a  normal  egg  upper  surface. 

Once  we  suspected  that  the  oocyte  nucleus  and 
the  follicle  cells  were  communicating,  we  sought 
to  identify  specific  genes  that  might  be  involved 
in  transferring  developmental  information.  Sev- 
eral genes  were  known  previously  whose  absence 
causes  defects  in  egg  development  very  similar  to 
those  produced  by  our  laser  treatments.  For  exam- 
ple, the  gene  torpedo  encodes  a  type  of  receptor 
protein  found  in  all  the  follicle  cells  that  may 
receive  the  signal  from  the  oocyte  nucleus.  We 
have  subsequently  identified,  cloned,  and  begun 
to  study  a  new  gene,  zeppelin,  that  is  also  re- 
quired for  upper  surface  development.  This  gene 
appears  to  be  expressed  preferentially  in  the  up- 
per follicle  cells.  It  now  appears  feasible  to  iden- 
tify most  if  not  all  the  genes  used  by  the  oocyte  to 
signal  the  upper  follicle  cells  and  to  use  them  to 
learn  exactly  what  the  transmitted  message  con- 
sists of  and  how  it  is  received  and  implemented. 
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Schematic  of  the  three-dimensional  structure  of  basic  fibroblast  growth  factor,  with  the  putative 
receptor  binding  site  highlighted  in  orange. 

From  Zhang,  J.,  Cousens,  L.S.,  Barr,  PJ.,  and  Sprang,  S.R.  1991.  Proc  Natl  Acad  Sci  USA 
88:3446-3450. 
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CELLS  communicate  with  one  another  by  se- 
creting growth  factors,  cytokines,  or  hor- 
mones and  by  responding  to  factors  produced  by 
other  cells.  These  chemical  messengers  act  by 
changing  patterns  of  gene  expression  within  the 
target  cell,  thereby  altering  its  metabolic  or  devel- 
opmental program.  One  of  the  major  unsolved 
problems  in  biology  is  to  define  the  mechanisms 
by  which  cells  specifically  recognize  these  chem- 
ical messengers  and  to  understand  how  the  mes- 
sages are  transduced  intracellularly. 

In  our  laboratory,  we  are  using  x-ray  crystallo- 
graphic  techniques  to  define  the  molecular  na- 
ture of  the  interactions  between  specific  growth 
factors  and  their  cognate  receptors.  These  recep- 
tors are  themselves  protein  molecules  distributed 
on  the  outer  surface  of  the  plasma  membrane 
surrounding  the  target  cell.  The  goal  of  our  re- 
search is  to  learn  how  receptors  specifically  rec- 
ognize growth  factors  and  how  the  receptor  mol- 
ecules' structure  and  chemical  properties  are 
changed,  if  at  all,  as  a  consequence  of  this 
interaction. 

Tumor  Necrosis  Factors 

One  family  of  factors  we  study,  called  tumor 
necrosis  factors  (TNF),  are  produced  by  cells  of 
the  immune  system.  Macrophages — white  blood 
cells  responsible  for  engulfing  cellular  debris — 
produce  large  quantities  of  TNF-a  (also  known  as 
cachectin)  when  stimulated  by  toxins  carried  on 
the  surface  of  bacteria,  as  would  occur  in  the 
event  of  infection.  The  avid  attachment  of  this 
cytokine  to  receptors  present  on  a  variety  of  cells 
in  the  body  triggers  a  series  of  events  that  dramati- 
cally alter  fat  and  triglyceride  metabolism  and 
mobilize  neutrophils.  These  events  mediate  in- 
flammation, endotoxic  shock,  and  the  wasting 
phenomenon  cachexia,  from  which  the  cytokine 
takes  one  of  its  names.  TNF-a  also  induces  a  num- 
ber of  enzymes  that  effect  the  destruction  and  re- 
modeling of  tissues. 

A  related  cytokine,  TNF-/?,  is  produced  by  T 
lymphocytes.  It  has  many  properties  in  common 
with  TNF-a,  including  the  ability  to  bind  to  the 
same  receptors,  but  the  two  cytokines  are  only  30 


percent  identical  in  amino  acid  sequence.  It  is  of 
interest  to  learn  how  such  distantly  related  mole- 
cules can  interact  with  the  same  receptor. 

The  three-dimensional  structure  of  TNF-a,  de- 
termined by  graduate  student  Michael  Eck,  re- 
vealed the  molecule  to  be  composed  of  three 
identical  protein  subunits  packed  about  a  three- 
fold axis  of  symmetry.  We  speculated  that  TNF 
may  be  capable  of  binding  three  receptors  at  once 
and  that  the  TNF-induced  aggregation  of  recep- 
tors might  trigger  the  cascade  of  molecular  events 
within  the  cell.  The  solution  of  the  structure  of 
TNF- (8  is  still  in  progress,  but  initial  results  indi- 
cate that  it  has  the  same  overall  subunit  tertiary 
structure  and  trimeric  quaternary  structure  as 
TNF-a.  Of  particular  interest,  once  more  struc- 
tural information  is  available  about  the  nature  of 
the  TNF-receptor  interactions,  will  be  to  deter- 
mine whether  different  or  similar  chemical 
groups  are  used  by  the  two  cytokines  in  recogniz- 
ing the  TNF  receptor.  The  proteins  used  in 
our  crystallographic  studies  were  provided  by 
George  Kuo  of  Chiron,  Inc.  (TNF-a),  and  Mark 
Ultsch,  Bart  DeVos,  and  Anthony  Kossiakofif  of 
Genentech,  Inc.  (TNF-/?). 

Recently  several  researchers  have  succeeded  in 
isolating  the  gene  encoding  the  receptors  that  en- 
gage TNF-a  and  -0.  Two  different  types  of  TNF 
receptor  were  found.  Surprisingly,  their  amino 
acid  sequences  bear  little  identity  to  each  other. 
Both  receptors,  however,  contain  a  structural 
"motif"  composed  of  four  inexact  copies  of  a  re- 
peat rich  in  the  sulfur-containing  amino  acid  cys- 
teine. The  cysteine  residues  may  form  crosslinks 
to  give  each  repeat  a  common  and  distinctive 
three-dimensional  structure.  Preliminary  data 
from  other  laboratories  suggest  that  each  trimer 
of  TNF-a  interacts  with  three  receptor  molecules. 

Collaborating  with  Bruce  Beutler  (HHMI,  Uni- 
versity of  Texas  Southwestern  Medical  Center  at 
Dallas),  we  have  initiated  experiments  to  express 
the  extracellular  domains  of  TNF  receptors  in 
both  mammalian  and  bacterial  cells,  which  we 
will  then  crystallize  in  both  the  presence  and  ab- 
sence of  TNF-a  and  TNF-/?.  We  expect  that  the 
three-dimensional  structures  will  show  how  the 
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two  receptors,  though  different  in  amino  acid  se- 
quence, are  able  to  recognize  the  same  cytokines. 
We  hope  also  to  learn  whether  receptor-cytokine 
engagement  induces  significant  changes  in  the 
receptor  structure  or  the  interactions  among  re- 
ceptors that  might  mediate  their  internalization 
by  cells. 

Fibroblast  Growth  Factor 

Basic  fibroblast  growth  factor  (bPGF) ,  one  of  a 
group  of  seven  structurally  related  proteins,  trig- 
gers the  induction  of  mesoderm  formation  in 
early  embryogenesis.  In  adult  animals,  the  same 
factor  initiates  processes  of  wound  repair  and  vas- 
cular regeneration.  It  is  present  in  the  extracellu- 
lar matrix,  bound  to  heparin-containing  proteo- 
glycans, where  it  stimulates  the  movements  of 
cells  to  sites  of  tissue  generation  or  repair.  The 
intracellular  domain  of  the  receptor  for  bFGF  is  a 
tyrosine  kinase,  an  enzyme  that  initiates  a  cascade 
of  intracellular  reactions  culminating  in  altered 
patterns  of  gene  expression  within  the  cell. 

The  three-dimensional  structure  of  bFGF  has 
been  determined  with  high  resolution  by  gradu- 
ate student  Jiandong  Zhang,  using  protein  sam- 
ples provided  by  Lawrence  Cousens  of  Chiron, 
Inc.  We  found  that  the  molecule  is  structurally 
related  to  members  of  the  interleukin- 1  family 
that  stimulate  the  growth  of  white  blood  cells. 
The  structure  reveals  probable  sites  for  receptor 
and  heparin  binding  at  separate  but  neighboring 
locations  on  the  molecule's  surface  and  also  sug- 
gests how  phosphorylation  of  bFGF  might  alter  its 
affinity  for  the  receptor. 

We  are  collaborating  with  Phillip  Barr  of 
Chiron,  Inc.,  and  Lewis  Williams  (HHMI,  Univer- 
sity of  California,  San  Francisco)  to  cocrystallize 
bFGF  with  its  receptor's  extracellular  domain.  At 


least  five  distinct  receptors  of  bFGF  have  been 
identified,  all  of  which  bear  extracellular  do- 
mains related  to  the  immunoglobulin  family.  We 
may  find  that  the  complexes  between  FGF  and 
receptor  dimers  (which  form  when  receptors  in- 
teract with  the  factor)  are  structurally  similar  to 
those  between  antibodies  and  antigens. 

G  Proteins 

The  signals  generated  by  the  engagement  of  ex- 
tracellular messenger  molecules  with  their  re- 
ceptors are,  in  many  cases,  transmitted  across  the 
plasma  membrane  to  members  of  the  G  protein 
family.  For  example,  when  epinephrine  binds  to 
the  i8-adrenergic  receptor,  an  intracellular  pro- 
tein called  Gsa  is  induced,  to  bind  the  nucleotide 
GTP  and  discard  a  pair  of  regulatory  subunits. 
The  GTP-bound  Gsa  is  then  an  activator  of  mem- 
brane-bound enzyme  that  catalyzes  the  synthesis 
of  the  intracellular  second  messenger  cyclic 
AMP.  Gsa  eventually  hydrolyzes  the  GTP,  allow- 
ing it  to  rebind  its  inhibitory  subunits  and  return 
to  an  inactive  state  in  which  it  can  no  longer  stim- 
ulate cyclase. 

We  are  undertaking  crystallographic  studies  of 
Gsa  to  learn  how  this  protein  may  interact  with 
other  components  of  the  signal  transduction  sys- 
tem in  which  it  operates.  The  initial  steps  toward 
a  structural  determination  of  Gsa  have  now  been 
taken  in  our  laboratory  by  research  associate 
David  Coleman,  who  has  crystallized  Gsa  and  a 
nonhydrolyzable  GTP  analogue  in  complex  with 
a  protein  expressed  in  the  laboratory  of  Alfred 
Gilman  (University  of  Texas  Southwestern  Medi- 
cal Center  at  Dallas) .  Our  goal  is  to  understand 
the  series  of  GTP-dependent  structural  changes 
within  Gsa  that  account  for  its  regulatory 
properties. 
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THE  production  and  regulated  release  of  insu- 
lin is  essential  for  normal  growth  and  the  uti- 
lization of  food.  A  deficiency  of  insulin,  or  de- 
fects in  its  action,  can  lead  to  diabetes,  a  disease 
characterized  by  high  blood  sugar  and  a  variety  of 
possible  complications,  including  blindness, 
heart  disease,  stroke,  and  increased  susceptibility 
to  certain  infections.  Diabetes  affects  2-3  per- 
cent of  the  population  of  the  developed 
countries. 

Control  of  diabetes  can  be  achieved  through 
various  combinations  of  diet,  oral  hypoglycemic 
agents,  and/or  insulin  injections,  depending  on 
type  and  severity.  Such  therapies,  however,  are 
often  less  than  fully  satisfactory;  they  may  only 
retard  the  progress  of  complications.  Abetter  un- 
derstanding of  insulin  biosynthesis,  its  regulated 
release  into  the  bloodstream,  and  its  action  on 
receptors  to  regulate  metabolism  and  growth  is 
vitally  important  to  the  development  of  new  ap- 
proaches to  the  therapy  of  this  disease. 

Insulin  is  made  in  the  islets  of  Langerhans — 
small  clusters  of  cells  in  the  pancreas.  Islet  cells 
also  secrete  other  hormones  that  appear  to  regu- 
late digestion  or  nutrient  metabolism.  These  in- 
clude glucagon,  somatostatin,  amylin,  and  pancre- 
atic polypeptide.  The  islet  hormones,  like  many 
other  regulatory  peptides  in  the  body,  are  derived 
from  larger  precursor  molecules  (proteins)  called 
preprohormones.  These  precursors  contain  addi- 
tional portions  that  serve  as  markers  to  guide  them 
along  special  intracellular  pathways,  where  they 
are  concentrated  into  storage  vesicles  and  pro- 
cessed into  their  smaller,  biologically  active 
forms.  These  are  then  secreted  into  the  blood- 
stream in  varying  proportions  to  meet  physiologi- 
cal needs.  One  goal  of  our  research  is  to  learn 
more  about  how  prohormones  like  proinsulin  are 
sorted  from  other  proteins,  concentrated  into  se- 
cretory granules,  and  processed  into  active  hor- 
mones by  specialized  protein-splitting  enzymes. 

Insulin  secretion  from  /3-ceIls  in  response  to  ele- 
vated plasma  glucose  is  a  complex  electrochemical 
process  resembling  the  transmission  of  nerve  im- 
pulses or  the  contraaion  of  the  heart.  In  the  insulin- 
producing  (S-cells,  a  specialized  sensor  mechanism 


couples  the  metabolism  of  glucose  to  ion  channels 
in  the  plasma  membrane.  Upon  depolarization  of 
the  membrane,  these  channels  allow  the  selective 
entry  of  calcium  into  the  cell,  triggering  the  release 
of  secretory  granule  contents. 

Certain  oral  hypoglycemic  agents  used  to  treat 
diabetes — the  sulfonylureas — appear  to  stimulate 
insulin  secretion  by  inhibiting  a  specialized  potas- 
sium channel  in  the  (S-cell  membrane,  which  then 
initiates  electrical  depolarization  of  the  cell.  We 
are  trying  to  learn  more  about  the  structure  of  this 
and  other  important  ion  channels  to  understand 
both  their  normal  functions  and  their  possible 
malfunction  in  some  forms  of  diabetes. 

We  are  also  studying  mutations  that  affect 
(pro)insulin  structure.  Some  of  these  mutations, 
occurring  in  families  with  mild  diabetes,  give  rise 
to  abnormal  insulin  molecules  with  greatly  re- 
duced biological  activity,  as  reflected  in  reduced 
binding  to  insulin  receptors.  Certain  other  muta- 
tions in  the  insulin  gene  primarily  affect  the  con- 
version of  proinsulin  to  insulin,  leading  to  other 
familial  disorders  known  as  hyperproinsulinemias. 

Insulin  acts  on  tissues  by  binding  to  a  large, 
complex  receptor  protein  on  the  surface  of  cells. 
This  activates  a  tyrosine  kinase  that  alters  many 
intracellular  processes  through  a  cascade  of 
phosphorylations.  Binding  of  insulin  to  the  re- 
ceptor also  leads  to  its  uptake  and  degradation  in 
the  liver  and  other  tissues.  This  process,  known  as 
receptor-mediated  endocytosis,  may  also  play  a 
role  in  the  generation  of  some  biological  re- 
sponses to  insulin  and  in  mediating  the  rapid  re- 
moval of  insulin  from  the  circulation.  By  studying 
defective  or  modified  precursor,  hormone,  and/ 
or  receptor  molecules,  we  hope  to  learn  more 
about  normal  islet  hormone  production  and  ac- 
tion and  about  derangements  in  these  processes 
that  can  lead  to  diabetes  or  other  diseases. 

Our  current  areas  of  investigation  include  the 
following: 

•  mechanisms  of  intracellular  sorting  and  pro- 
teolytic processing  of  hormone  precursors, 

•  mechanisms  regulating  insulin  secretion, 

•  mutant  human  insulin  or  insulin  receptor 
genes  and  their  functional  significance, 
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•  structure  and  function  of  the  human  insuHn 
receptor, 

•  new  hormones  of  the  islets — e.g.,  the  re- 
cently discovered  amylin  peptide,  and 

•  evolution  of  insulin  and  insulin-like  growth 
factojs  and  their  related  receptors. 

Identification  of  Prohormone-converting 

Enzymes 

Using  recently  developed  methods  of  specific 
DNA  amplification,  we  have  identified  two 
cDNAs  (PC2  and  PC3)  from  neuroendocrine  cells 
that  encode  proteolytic  domains  similar  in  struc- 
ture to  those  of  Kex2,  a  yeast  prohormone- 
processing  enzyme.  We  are  attempting  to  learn 
more  about  the  expression  and  function  of  these 
proteases  in  i8-cells  and  to  delineate  their  proteo- 
lytic properties.  Our  approach  is  to  express  the 
proteases  in  a  variety  of  cells  in  vitro,  along  with 
prohormones  such  as  proinsulin  and/or  POMC 
(proopiomelanocortin) . 

Studies  on  Potassium  Ion  Channels  of  jS-Cells 

Regulation  of  insulin  secretion  involves  the  co- 
ordinated control  of  ion  channels  in  the  /3-cell 
membrane.  We  have  recently  isolated  and  charac- 
terized cDNA  and  genomic  clones  encoding  a 
voltage-dependent  channel  isoform  expressed 
in  human  islets  and  in  a  human  insulin-producing 
tumor.  This  channel,  designated  hPCNl,  con- 
sists of  613  amino  acids  and  is  related  to  the 
Shaker  family  of  Drosophila  channels.  Two 
other  human  channel  isoforms  that  we  have 
isolated,  hPCN2  and  hPCN3,  are  homologous  to 
hPCNl,  with  55  and  65  percent  amino  acid  se- 
quence identity,  respectively. 

The  electrophysiological  characteristics  of 
hPCNl  have  been  examined  by  voltage-clamp  re- 
cordings of  oocytes  injected  with  synthetic 
hPCNl  RNA.  These  have  revealed  hPCNl  to  be  of 
the  delayed  rectifier  type.  The  channel  could 
function  to  restore  the  resting  membrane  poten- 
tial of  j8-cells  after  depolarization  and  thereby 
contribute  to  the  regulation  of  insulin  secretion. 
Efforts  are  also  under  way  to  identify  and  charac- 
terize the  ATP-dependent  channel  that  plays  a 
key  role  in  initiating  i8-cell  depolarization  in  re- 
sponse to  glucose. 

Studies  of  the  Insulin  Receptor 

We  have  studied  a  mutation  in  the  insulin  re- 
ceptor protein  that  gives  rise  to  severe  insulin- 
resistant  diabetes.  The  defect  in  this  patient  is"  a 
single  amino  acid  replacement  that  prevents  the 
receptor  precursor  from  being  processed  into 
mature  receptor  a-  and  /3-subunits  on  the  cell  sur- 


face. Studies  are  now  nearing  completion  on  mu- 
tagenesis of  the  cleavage  site  of  the  human  insu- 
lin proreceptor  to  determine  the  exact  structural 
requirements  for  its  processing  by  a  specialized 
cellular  protease  to  generate  functional  recep- 
tors. We  are  also  using  mutagenesis  to  evaluate 
more  fully  the  functional  consequences  of  disor- 
dered processing  of  the  proreceptor. 

We  have  developed  methods  for  expressing 
human  insulin  receptors  at  very  high  levels  in  in- 
sect or  mammalian  cells  in  culture.  Either  normal 
or  modified  receptor  proteins  (generated  by  mu- 
tating the  cDNA  sequence)  can  be  produced  in 
amounts  sufficient  for  detailed  study  of  their 
structures,  insulin-binding  domains,  and  biologi- 
cal signaling  properties.  In  other  studies,  we  are 
analyzing  the  complexity  of  protein  phosphoty- 
rosine  phosphatases — cellular  enzymes  that  re- 
verse insulin  receptor  activation. 

Amylin,  a  New  Product  of  the  /S-CeU 

We  recently  identified  a  new  j8-cell  precursor 
molecule  that,  when  appropriately  processed, 
gives  rise  to  a  peptide  called  islet  amyloid  poly- 
peptide (LAPP),  or  amylin.  This  peptide  is  found 
in  amyloid  deposits  in  the  islets  of  elderly  dia- 
betics and  may  play  a  role  in  impairing  /3-cell 
function.  We  have  characterized  the  gene  encod- 
ing LAPP  and  have  begun  studies  on  its  biosynthe- 
sis and  secretion  in  response  to  glucose  and  other 
factors.  It  is  produced  at  a  level  about  1  percent 
that  of  insulin  and  is  cosecreted  with  insulin. 
Studies  on  LAPP  precursors  from  nine  mammalian 
species  support  the  hypothesis  that  differences  in 
the  sequence  of  LAPP  in  its  central  region  (resi- 
dues 20-29)  in  different  species  influences  their 
relative  susceptibility  to  the  deposition  of  amy- 
loid in  the  islets  or  in  insulin-producing  tumors 
(insulinomas) . 

The  Evolution  of  Insulin  and  Insulin-like 
Growth  Factors 

We  have  succeeded  in  efforts  to  identify  primi- 
tive insulin  superfamily-related  genes  (encoding 
insulin  or  the  closely  related  insulin-like  growth 
factors  IGF-I  and  -II)  in  primitive  vertebrate  or- 
ganisms, including  cyclostomes  (the  Atlantic 
hagfish)  and  amphioxus.  The  presence  of  a  hy- 
brid insulin/IGF  molecule  in  amphioxus,  a  pro- 
tochordate,  suggests  that  the  insulin-like  growth 
factors  diverged  from  an  ancestral  preproinsulin- 
like  molecule  in  the  very  earliest  stages  of  verte- 
brate evolution.  We  are  also  studying  the  origin 
and  evolution  of  insulin  and  IGF  receptors  in 
these  lower  forms. 
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KNOWLEDGE  gained  from  basic  research  in 
the  biomedical  sciences  sometimes  pro- 
vides answers  to  key  questions  in  clinical  medi- 
cine. Occasionally,  however,  the  path  is  re- 
versed, and  clinical  studies  provide  information 
or  materials  that  help  to  unravel  basic  biological 
processes.  An  example  of  this  is  our  use  of  sera 
from  human  patients  to  determine  the  roles  of 
previously  mysterious  small  particles  in  normal 
cells. 

Particles  called  small  nuclear  ribonucleopro- 
teins  (snRNPs:  pronounced  "snurps")  are  found 
in  the  nucleus  of  the  cells  of  humans  and  other 
higher  organisms.  Each  snRNP  is  a  tight  cluster  of 
one  or  more  proteins  with  a  small  RNA  molecule. 
SnRNPs  are  abundant  in  virtually  all  human  cells 
and  are  remarkably  similar  among  various  mam- 
malian species,  suggesting  that  the  particles  must 
play  important  roles  in  cells. 

Systemic  lupus  erythematosus  (SEE)  is  one  of  a 
number  of  diseases  in  which  an  individual's  im- 
mune system  mistakenly  makes  antibodies  against 
the  body's  own  molecules.  Curiously,  molecules 
that  are  very  abundant  in  cells  and  highly  con- 
served, such  as  DNA,  are  the  most  common  tar- 
gets of  autoimmunity.  Thus  SEE  patients  often 
make  autoantibodies  against  snRNPs. 

Using  SEE  patients'  antibodies  to  probe  both 
the  structures  and  functions  of  snRNPs,  we  have 
investigated  the  roles  of  a  number  of  different 
kinds  of  snRNPs  in  gene  expression.  These  inves- 
tigations began  in  1979,  when  studies  by  Michael 
Lerner  (then  an  M.D./Ph.D.  student;  now  HHMI, 
Yale  University  School  of  Medicine)  led  to  the 
hypothesis  that  the  most  abundant  snRNP  in 
mammalian  cells  (called  the  Ul  snRNP)  might  be 
involved  in  RNA  splicing,  an  early  step  in  gene 
expression.  For  a  gene's  product  to  be  made,  the 
information  in  the  gene's  DNA  is  transcribed  into 
an  RNA  copy  (called  pre-mRNA)  that,  after  being 
"processed,"  directs  the  synthesis  of  a  protein 
product.  The  DNA  and  the  pre-mRNA  contain  seg- 
ments called  exons,  which  code  for  the  gene's 
product,  and  segments  called  introns,  which  are 
noncoding  regions  interspersed  between  the 


exons.  Before  leaving  the  cell  nucleus  as  mRNA, 
the  pre-mRNA  is  cut,  the  exons  are  spliced  back 
together,  and  the  introns  are  discarded.  The  indi- 
vidual exons  must  be  joined  precisely  and  in  the 
same  order  they  originally  occupied  in  the  gene. 
Sometimes  differences  in  the  way  exons  are 
spliced  can  lead  to  different  protein  products  in 
different  tissues. 

Evidence  that  snRNPs  play  central  roles  in  pre- 
mRNA  splicing  has  been  obtained  in  several  types 
of  experiments,  including  use  of  autoantibodies 
from  SEE  patients  to  inhibit  splicing  in  active  cell 
extracts.  We  now  know  that  participation  of  the 
most  abundant  snRNPs  in  mammalian  cells  (the 
Ul ,  U2,  U5,  and  U4/U6  particles)  is  essential  and 
that  splicing  requires  assistance  from  the  snRNP 
proteins  as  well  as  their  RNA  molecules.  SnRNPs 
act  to  recognize  the  splice  junctions  and  the  so- 
called  intron  branch  point  (where  an  unusual 
RNA  structure  is  formed  as  an  intermediate  in 
splicing)  and  then  assemble  together  to  align  the 
exon  ends  so  that  precise  splicing  can  occur.  In 
this  sense,  snRNPs  are  much  like  ribosomal  sub- 
units  (which  also  contain  both  RNA  and  protein) 
that  assemble  onto  an  mRNA  to  translate  it  into 
protein. 

Current  efforts  in  splicing  are  directed  at  un- 
derstanding exactly  how  RNA- RNA  interactions  in 
the  active  splicing  body  (called  the  spliceosome) 
contribute  to  catalysis,  particularly  during  the 
first  step  of  intron  removal.  Here,  we  have  inves- 
tigated a  bizarre  type  of  splicing  where  exons 
from  two  separate  gene  transcripts  are  joined 
(trans-splicing)  and  have  learned  that  trans-splic- 
ing does  not  require  the  Ul  snRNP,  because  one 
of  the  transcripts  contains  intrinsic  information 
that  substitutes  for  Ul .  Investigation  of  the  mini- 
mal sequences  and/or  structures  needed  will  fur- 
ther elucidate  the  role  of  snRNPs  in  the  spliceo- 
some. The  possibility  that  the  most  abundant 
nuclear  RNP  currently  of  unknown  function, 
which  contains  7SK  RNA,  may  serve  to  recycle 
snRNPs  after  splicing  is  completed  is  likewise  be- 
ing actively  pursued. 

Mammalian  cells  also  contain  many  minor 


425 


Autoantibody  Probes  for  Mammalian  Gene  Expression 


snRNPs  that  are  closely  related  to  the  splicing 
snRNPs.  One  is  the  U7  snRNP,  which  is  only 
1/1,000  as  abundant  as  the  splicing  snRNPs.  We 
have  recently  demonstrated  that  it  participates  in 
forming  the  3'  ends  of  histone  mRNAs  by  using 
base-pairing  to  recognize  a  specific  sequence  in 
the  pre-mRNA  just  downstream  of  the  cut  site. 
Other  related  low-abundance  snRNPs,  which  we 
anticipate  all  function  in  some  aspect  of  mRNA 
metabolism,  include  viral  snRNPs.  For  instance, 
in  marmoset  cells  infected  by  Herpesvirus  sai- 
miri,  which  causes  malignant  transformation  of 
the  T  cells  of  the  immune  system,  there  are  five 
small  RNAs  encoded  by  the  viral  genome.  They 
are  hypothesized  to  contribute  to  either  mRNA 
processing  or  stabilization,  thereby  enhancing 
the  process  of  cellular  transformation. 

Another  type  of  patient  autoantibody  is  di- 
rected against  a  different  class  of  small  RNPs  lo- 
calized in  the  nucleolus,  where  ribosomal  RNA 
processing  and  ribosome  assembly  occur.  Re- 
cently, in  collaboration  with  Barbara  Sollner- 
Webb's  laboratory  at  the  Johns  Hopkins  Univer- 
sity School  of  Medicine,  we  established  that  the 
most  abundant  nucleolar  snRNP  (containing  U3 
RNA)  is  essential  for  the  first  step  of  ribosomal 
RNA  processing.  The  possibility  that  other  nu- 
cleolar snRNPs  act  in  subsequent  steps  is  now  be- 
ing investigated.  A  separate  study  is  designed  to 
dissect  the  signals  that  dictate  the  delivery  of 
these  snRNPs  to  their  nucleolar  site  of  action. 


Yet  another  autoantibody  type  precipitates 
EBERs,  two  small  RNAs  specified  by  Epstein-Barr 
virus  (EBV),  the  causative  agent  of  infectious 
mononucleosis  and  also  implicated  in  several  hu- 
man cancers.  Since  EBERs  are  among  the  few  viral 
products  that  are  expressed  in  EBV-transformed 
cells,  they  must  be  important  to  the  induction  or 
maintenance  of  the  transformed  state.  We  have 
recently  discovered  a  highly  abundant,  highly 
conserved  cell  protein  that  binds  the  EBERs  and 
hope  that  its  characterization  will  lead  us  to  an 
elucidation  of  EBER  function. 

Finally,  we  are  studying  the  enzyme  telomer- 
ase,  which  synthesizes  the  ends  of  chromosomes 
of  all  higher  organisms.  In  the  unicellular  ciliate 
Tetrahymena,  telomerase  has  been  shown  to  be 
an  RNP.  We  have  demonstrated  telomerase  activ- 
ity in  mammalian  cell  extracts  and  are  now  seek- 
ing to  identify  the  mammalian  snRNP.  It  ap- 
pears to  be  responsible  for  the  "healing"  of  bro- 
ken chromosomes,  as  exemplified  by  certain 
thalassemias. 

Thus  autoantibodies  are  potent  probes  for  de- 
ciphering some  of  the  fundamental  reactions  oc- 
curring in  all  mammalian  cells,  those  involved  in 
gene  expression.  Characterization  of  new  cellu- 
lar particles  like  snRNPs  is  significant  for  future 
studies  of  basic  cellular  processes  and  how  they 
might  be  altered  by  disease.  Furthermore,  our  re- 
search has  provided  new  ways  of  diagnosing  pa- 
tient autoantibodies,  which  are  helpful  in  the 
diagnosis  and  treatment  of  diseases  like  SLE. 
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OUR  general  long-term  goal  is  to  determine 
the  detailed  molecular  mechanisms  by 
which  those  proteins  and  nucleic  acids  that  are 
involved  in  the  central  dogma  of  molecular  biol- 
ogy (DNA  replication,  transcription,  and  transla- 
tion) achieve  their  biological  functions.  Virtually 
all  aspects  of  the  maintenance  and  expression  of 
information  stored  in  the  genome  involve  inter- 
actions between  proteins  and  nucleic  acids.  We 
are  seeking  to  provide  a  structural  and  chemical 
basis  for  these  fundamental  processes. 

Synthetase-tRNA  Complex 

Enzymes  called  aminoacyl-tRNA  synthetases 
translate  the  genetic  code  by  attaching  the 
correct  amino  acid  to  a  tRNA  containing  the  ap- 
propriate anticodon.  Of  significant  current  inter- 
est is  how  these  synthetases  can  accurately  distin- 
guish among  the  60  or  so  similar  tRNA  molecules. 
Furthermore,  how  does  RNA  recognition  by  a 
protein  differ  from  DNA  recognition?  Finally,  can 
the  structure  of  the  present-day  synthetase-tRNA 
complex  provide  any  insights  into  the  evolution 
of  this  central  process  and  the  evolution  of  the 
genetic  code  itself? 

We  have  determined  the  crystal  structure  of 
glutaminyl-tRNA  synthetase  (GlnRS),  a  64,000- 
molecular- weight  monomeric  protein,  com- 
plexed  with  tRNA^'"  and  ATP.  The  GlnRS  consists 
of  four  domains  arranged  to  give  an  elongated 
molecule  that  interacts  with  the  inside  of  the  L- 
shaped  tRNA  from  its  anticodon  to  its  acceptor 
end.  GlnRS  specifically  recognizes  the  correct 
tRNA  by  interactions  with  the  three  bases  of  the 
anticodon  and  with  base  pairs  of  the  amino  acid 
acceptor  stem  of  the  tRNA.  The  three  bases  of  the 
anticodon  are  unstacked  and  splayed  out;  each 
base  binds  into  a  separate  recognition  pocket  on 
the  enzyme.  The  extensive  hydrogen-bonding  in- 
teractions between  the  protein  and  the  anticodon 
bases  make  the  enzyme  specific  for  the  two  gluta- 
mine  anticodons  (UUG  and  CUG)  but  none  of  the 
62  other  possible  anticodons. 

The  protein  domain  that  contains  the  active 


site  has  a  structure  similar  to  that  of  the  analogous 
domains  of  the  tyrosyl-  and  methionyl-tRNA  syn- 
thetases and  similar  to  another  7  of  the  20  synthe- 
tases. Synthetases  for  10  amino  acids  belong  to  a 
second  unrelated  class  of  synthetases. 

Regulation  of  Gene  Expression 

In  Escherichia  coli  a  reduction  in  glucose  con- 
centration results  in  a  rise  in  the  levels  of  a  sec- 
ond-messenger molecule,  cAMP,  and  subse- 
quently to  an  increase  in  the  proteins  that 
metabolize  other  sugars.  This  is  achieved  because 
cAMP  binds  to  the  catabolite  gene  activator  pro- 
tein (CAP),  which  in  turn  binds  to  specific  se- 
quences at  transcription  start  sites,  activating  the 
transcription  of  the  catabolite  genes.  We  wish  to 
know  how  the  binding  of  cAMP  promotes  the  se- 
quence-specific DNA  binding  of  CAP  and  how 
this  binding  then  activates  the  transcribing  en- 
zyme, RNA  polymerase. 

We  have  now  determined  the  structure  of  CAP 
cocrystallized  with  a  30-bp  DNA  fragment  and 
cAMP.  The  earlier  CAP  •  cAMP  structure  had 
shown  that  each  subunit  of  this  dimer  consists  of 
two  domains,  the  larger  of  which  binds  cAMP. 
The  two  small  domains  are  seen  to  bind  DNA  with 
the  helix-turn-helix  interacting  in  the  major 
groove  as  anticipated.  Strikingly,  this  complex 
shows  a  severely  bent  DNA,  with  an  overall  bend 
of  about  90°.  Most  of  the  bend  is  achieved  by  two 
large  kinks  of  about  40°  each.  The  relationships 
between  this  CAP-induced  DNA  bend  and  tran- 
scription activation  are  presently  being  pursued. 

Replication  of  DNA 

E.  coli  DNA  polymerase  I  functions  primarily  in 
the  repair  of  DNA  but  is  homologous  to  polymer- 
ases involved  in  replication.  We  have  determined 
the  structure  of  the  Klenow  fragment,  a  portion 
of  Pol  I  that  retains  the  polymerase  and  a  3  -  to 
5'-editing  exonuclease  activity.  We  have  shown 
that  another  structural  domain,  with  a  cleft  large 
enough  to  bind  duplex  DNA,  contains  the  active 
site  for  the  polymerase  reaction,  whereas  a 


427 


Structural  Studies  of  Protein-Nucleic  Acid  Interaction 


smaller  domain  has  the  active  site  for  the  exonu- 
clease  activity.  Using  site-directed  mutagenesis, 
we  have  made  an  enzyme  devoid  of  the  editing 
exonuclease  activity  and  determined  its  struc- 
ture. We  have  grown  two  crystal  forms  of  this 
protein  complexed  with  a  small  DNA  substrate.  A 
high-resolution  structure  of  one  crystal  form 
shows  a  single-stranded  tetranucleotide  bound  to 
the  exonuclease  active  site.  The  second  crystal 
form,  grown  under  conditions  when  the  enzyme 
is  active,  is  presumed  to  have  DNA  at  the  polymer- 
ase active  site  and  has  recently  been  solved.  Al- 
though there  are  clear  changes  in  the  structure  of 
the  polymerase  domain,  no  bound  duplex  DNA 
has  yet  been  visualized.  These  structures  begin  to 
address  the  issues  of  how  these  two  active  sites 
work  together  on  the  same  DNA  substrate  and 
how  they  both  function  to  enhance  the  DNA- 
copying  fidelity  of  this  and  other  polymerases. 

Genetic  Recombination 

We  have  recently  determined  the  crystal  struc- 
tures of  two  proteins  that  are  involved  in  genetic 
recombination.  One  protein,  called  resolvase, 
catalyzes  a  site-specific  recombination  between 
two  duplex  DNAs  of  identical  sequence.  Resol- 
vase is  the  product  of  a  transposable  element  (a 
jumping  gene)  that  can  move  throughout  the  E. 
colt  population  spreading  drug  resistance  genes. 
This  protein  can  bind  to  a  specific  duplex  DNA 
sequence,  align  two  DNA  segments  having  the 
same  sequence,  cleave  the  two  DNA  duplexes, 
rearrange  the  duplexes,  and  re-ligate  them,  re- 
sulting in  a  recombinational  event.  We  have  de- 
termined the  structure  of  the  catalytic  domain  of 
this  enzyme  at  2.5  A  resolution.  This  structure 
helps  to  explain  the  phenotypes  of  many  resol- 
vase mutant  proteins  but  does  not  suggest  a  mech- 


anism for  this  recombinase.  We  have  recently  co- 
crystallized  this  protein  with  a  31 -bp  fragment 
that  contains  the  recombination  site,  whose 
structure  should  provide  clues  to  the  mechanism 
of  this  reaction. 

E.  colt  recA  protein  is  essential  for  general  re- 
combination in  E.  colt.  Using  the  energy  of  ATP 
hydrolysis,  recA  protein  promotes  the  pairing  of 
homologous  duplex  DNAs  in  preparation  for  re- 
combination. The  structure  of  recA  protein  has 
been  refined  at  2.3  A  resolution.  The  subunit 
forms  a  helical  filament  in  the  crystal  very  similar 
to  that  formed  on  DNA  and  thus  enables  us  to 
understand  the  many  mutant  recA  proteins  made 
during  the  past  decade  and  relate  its  structure  to 
its  functions  in  nucleotide  binding,  DNA  binding, 
and  the  SOS  response.  Our  goal  is  to  understand 
how  ATP  hydrolysis  and  the  homologous  pairing 
of  DNA  are  coupled. 

mv  Proteins 

We  have  made  large  quantities  of  two  proteins, 
reverse  transcriptase  and  fragments  of  an  activa- 
tor protein  called  Tat,  that  are  specified  by  HIV 
(human  immunodeficiency  virus).  Toward  our 
goal  of  crystallizing  and  establishing  the  struc- 
tures of  biologically  relevant  complexes,  we  have 
shown  that  fragments  of  Tat  bind  specifically  to  a 
portion  of  the  HIV  RNA  called  TAR.  Initiation 
complexes  formed  using  human  tRNA'''^,  an  HIV 
RNA  fragment,  and  reverse  transcriptase  are  being 
explored.  We  have  recently  calculated  an  excel- 
lent 5  A  resolution  electron  density  map  of  re- 
verse transcriptase  and  should  soon  extend  this 
study  to  3-5  A  resolution.  Our  long-term  goal  is 
to  provide  a  structural  basis  for  designing  inhibi- 
tors of  these  proteins  that  might  function  as  use- 
ful anti-AIDS  drugs. 
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Representation  of  the  crystal  structure  o/Escherichia  coli  CAP'DNA  complex.  The  transcription 
factor,  catabolite  activator  protein  (CAP),  is  represented  as  an  a-carbon  backbone  in  blue. 
Bound  cAMP  is  shown  in  red.  The  30- bp  DNA  bound  to  CAP  in  the  crystal  is  shown  in  partial 
space  filling  representation  with  the  bases  in  light  blue  and  the  sugar  phosphate  backbone  in 
yellow.  The  DNA  is  bent  90°  by  CAP  binding,  initiated  primarily  at  two  40°  kinks. 
Research  of  Thomas  A.  Steitz. 
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Visualization  of  axonal  projections  from  photoreceptor  neurons  of  the  com- 
pound eye  to  the  optic  ganglia  in  the  brain  o/Drosophila.  This  section  has  been 
stained  with  two  different  neuron- specific  antibodies  that  recognize  photore- 
ceptor cell  membranes  (red)  and  neuronal  nuclei  (green),  respectively.  Pho- 
toreceptor axons  can  be  seen  to  terminate  in  two  distinct  regions  of  the  brain. 
From  Selleck,  S.B.,  and  Steller,  H.  1991.  Neuron  6:83-99. 
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THE  overall  goal  of  our  research  is  to  under- 
stand how  the  enormous  diversity  and  speci- 
ficity of  individual  neurons  arise  during  develop- 
ment and  how  these  neurons  establish  highly 
specific  patterns  of  interconnections.  We  address 
these  questions  primarily  by  exploring  the  Dro- 
sophila visual  system,  where  it  is  possible  to  ap- 
ply powerful  genetic  and  molecular  techniques 
to  study  the  development  of  individual  cell  types. 
We  have  identified  mutations  that  affect  the  pro- 
jection pattern  of  specific  axons,  the  production 
of  neurons  from  neuronal  precursor  cells,  and  the 
pattern  of  cell  death,  and  have  begun  to  character- 
ize several  of  these  at  the  cellular  and  molecular 
level.  We  hope  that  our  results  will  eventually 
contribute  to  a  better  understanding  of  human 
neurological  disorders. 

Axon  Guidance  and  Neuronal  Cell 
Recognition 

We  are  studying  two  diff'erent  stages  of  visual 
system  development  to  investigate  the  cellular 
and  molecular  mechanisms  by  which  axons  find 
and  recognize  their  proper  synaptic  partners.  The 
optic  nerve  of  the  Drosophila  larva  is  a  simple, 
well-described  model  system.  Connectivity  de- 
fects of  the  larval  optic  nerve  can  be  rapidly  and 
reliably  detected  in  mutant  embryos  by  staining 
with  neuron-specific  antibodies.  In  addition,  a 
simple  behavioral  test,  larval  phototaxis,  pro- 
vides an  assay  for  functional  connections  of  the 
larval  optic  nerve.  This  has  allowed  us  to  screen 
systematically  for  mutants  with  abnormal  axonal 
projections,  which  can  be  subsequently  analyzed 
in  detail  with  respect  to  defects  in  axonal  guid- 
ance, target  recognition,  and  synapse  formation. 

We  have  identified  a  small  number  of  loci  that 
are  required  for  the  establishment  of  stable  con- 
nections between  the  larval  optic  nerve  and  its 
target  cells  in  the  developing  embryonic  brain. 
One  of  the  corresponding  genes,  disconnected 
{disco),  has  been  analyzed  at  the  molecular  level 
and  appears  to  encode  a  transcription  factor.  We 
are  currently  testing  the  idea  that  disco  regulates 
the  expression  of  cell  adhesion  and/or  cell  recog- 


nition molecules  that  are  required  for  the  estab- 
lishment of  stable  connections  between  the  larval 
optic  nerve  and  its  target  cells  in  the  brain. 

More  recently  we  have  begun  to  study  axon 
guidance  and  neuronal  cell  recognition  in  the 
adult  visual  system.  The  compound  eye  of  Dro- 
sophila consists  of  approximately  800  repeating 
units  called  ommatidia.  Each  ommatidium  con- 
tains eight  photoreceptor  neurons,  which  repre- 
sent three  major  cell  types  that  project  to  differ- 
ent target  cells  in  the  optic  ganglia.  The  major 
class  of  photoreceptors,  Rl-6,  establishes  synap- 
tic connections  with  neurons  in  the  first  optic 
ganglion,  the  lamina.  Photoreceptor  axons  from 
R7  and  R8  project  deeper  into  the  brain  to  differ- 
ent regions  of  the  second  ganglion,  the  medulla. 
Early  during  visual  system  development,  all  eight 
photoreceptor  axons  from  each  ommatidium 
grow  as  a  bundle  into  the  developing  brain.  The 
growth  cones  of  these  axons  have  to  navigate  over 
a  long  distance  and  make  a  number  of  highly  spe- 
cific choices. 

We  would  like  to  understand  what  signals 
guide  axons  to  their  proper  destinations  and  how 
these  signals  are  generated,  received,  and  inter- 
preted. To  address  these  questions,  we  have 
screened  for  mutations  that  perturb  the  projec- 
tion pattern  of  photoreceptor  cells  at  early  devel- 
opmental stages,  when  axons  enter  the  brain.  We 
have  found  a  number  of  mutants  with  severely 
abnormal  patterns  of  axon  ingrowth.  The  devel- 
opmental and  genetic  characterization  of  this  ma- 
terial is  in  progress. 

Innervation,  Neurogenesis,  and  Survival 
of  Target  Cells 

It  has  been  noticed  for  many  years  that  synaptic 
input  can  have  a  profound  influence  on  the  fate 
and  differentiation  of  target  cells.  Cell  death  in 
the  absence  of  incoming  projections  is  a  dramatic 
example  of  how  innervation  can  affect  develop- 
mental decisions,  and  many  neurological  dis- 
orders are  thought  to  arise  from  defective  interac- 
tions between  neurons  and  their  targets.  In 
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Drosophila  the  proper  development  of  the  adult 
optic  ganglia,  the  central  nervous  system  portion 
of  the  visual  system,  depends  on  innervation  from 
the  eye.  In  the  absence  of  retinal  innervation, 
adult  flies  entirely  lack  the  first  optic  ganglion, 
the  lamina,  which  receives  direct  synaptic  input 
from  the  outer  photoreceptor  cells  Rl-6. 

We  have  recently  discovered  that  the  birth  of 
lamina  neurons  is  controlled  by  innervation  from 
the  developing  eye.  The  arrival  of  photoreceptor 
axons  in  the  brain  induces  a  wave  of  mitotic  activ- 
ity that  produces  the  lamina  neurons.  These  re- 
sults suggest  a  novel  mechanism  for  matching  the 
number  of  target  neurons  in  the  first  optic  gan- 
glion to  the  number  of  incoming  photoreceptor 
axons,  and  they  explain  how  developmental 
synchrony  between  the  Drosophila  retina  and 


first  optic  ganglion  is  achieved.  We  are  now  using 
several  different  approaches  to  elucidate  the  de- 
tailed cellular  and  molecular  mechanisms  under- 
lying this  process. 

While  the  importance  of  retinal  innervation  on 
the  development  of  the  adult  optic  ganglia  of 
Drosophila  is  well  documented,  little  is  known 
about  retrograde  effects  of  the  brain  on  photore- 
ceptor cells  in  the  compound  eye.  We  have 
recently  discovered  the  first  evidence  for  the  ex- 
istence of  such  retrograde  effects  in  the  Drosoph- 
ila visual  system.  Although  photoreceptor  cells 
develop  normally  in  the  absence  of  connections 
to  the  optic  ganglia,  their  continued  survival  re- 
quires these  connections.  This  situation  is  remi- 
niscent of  trophic  interactions  widely  described 
in  vertebrates. 
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USING  the  nematode  Caenorhabditis  ele- 
gans,  our  laboratory  takes  a  molecular  genet- 
ics approach  to  basic  questions  in  developmental 
biology  and  neurogenetics:  What  are  the  molecu- 
lar mechanisms  by  which  cells  interact  to  estab- 
lish a  spatial  pattern  of  cell  types?  What  is  the 
genetic  and  cellular  basis  for  morphogenesis? 
What  establishes  the  asymmetry  of  individual 
cells?  How  are  the  instructions  for  innate  behav- 
ior encoded  in  the  genome?  Our  major  strategy  is 
to  identify  mutations  that  make  cells  or  animals 
misbehave  and  then  to  study  the  functions  of  the 
genes  defined  by  these  mutations,  using  a  combi- 
nation of  molecular  cloning  and  genetic  analysis. 
A  second  strategy  is  to  clone  nematode  homo- 
logues  of  genes  identified  in  mammals  and  then 
to  elucidate  the  functions  of  those  genes  in 
nematodes. 

In  this  past  year  we  focused  on  the  develop- 
ment and  function  of  the  C.  elegans  male  tail. 
The  concerted  morphogenesis  of  male-specific 
neural  cells,  along  with  specialized  muscle  and 
epidermal  cells,  forms  the  copulatory  male  tail, 
which  allows  males  to  mate  with  hermaphro- 
dites. Because  C.  elegans  hermaphrodites  are  in- 
ternally self-fertilizing — each  animal  producing 
both  sperm  and  ova — male  mating  is  a  dispens- 
able behavior.  Thus  mutant  strains  defective  in 
male  mating  can  be  easily  propagated  and  the 
mating  process  studied.  We  have  used  a  simple 
behavioral  test — the  ability  of  males  to  sire  prog- 
eny— to  isolate  more  than  100  mutants  that  are 
unable  to  mate.  Some  mutant  males  have  obvious 
defects  in  the  development  of  male-specific 
structures.  Others,  called  Cod  (for  copulation 
defective) ,  are  anatomically  normal  yet  defective 
in  mating  behavior. 

Of  the  males  with  obvious  abnormal  tails,  we 
have  focused  on  those  with  abnormal  spicules — 
innervated  structures  crucial  to  successful  mat- 
ing. Each  of  the  two  spicules  comprises  nine 
cells:  two  sensory  neurons,  one  motoneuron,  and 
six  supporting  cells.  So  far  we  have  identified  sev- 
eral genes  that  act  at  various  stages  in  develop- 
ment of  the  spicules.  For  example,  two  genes  are 
necessary  for  production  of  the  hardened  cuticle 
that  surrounds  each  spicule. 


We  have  identified  several  genes  that  are  re- 
quired for  the  decision  between  alternative  neu- 
roblast fates.  Two  cells,  Ba  and  B/3,  each  normally 
generate  distinct  sets  of  spicule  cells.  Two 
genes  promote  differentiation  as  B/?,  while  four 
others  promote  differentiation  as  Ba.  Of  the 
former  class,  the  let-60  gene  (lethal  gene  60)  is 
of  special  interest  because  we  have  discovered 
that  the  level  of  its  activity  determines  whether 
the  neuroblasts  in  question  become  Ba  or  B^:  low 
let-60  activity  makes  a  cell  BjS,  high  let-60  activ- 
ity makes  a  cell  Ba.  This  set  of  genes  also  controls 
the  fates  of  two  other  sets  of  cells  whose  fates 
during  development  depend  on  signals  from 
neighboring  cells,  suggesting  that  these  genes  en- 
code a  general  mechanism  that  controls  the  fates 
of  cells  during  development.  The  products  of  let- 
23  and  let-60  are  homologous  to  mammalian 
proto-oncogenes,  epidermal  growth  factor  (EGF) 
receptor  and  ras,  respectively,  suggesting  that 
common  mechanisms  might  control  the  fates  of 
cells  in  mammals  and  nematodes. 

Male  mating  is  the  most  complex  nematode  be- 
havior. By  studying  the  Cod  mutants,  we  hope  to 
elucidate  how  genes  control  each  step  in  this  be- 
havior. We  have  isolated  a  set  of  mutants,  have 
characterized  the  mating  defect  of  each  strain, 
and  have  begun  placing  these  mutations  on  the 
genetic  map.  Most  of  the  24  mutants  analyzed  are 
defective  at  only  a  single  step  in  the  mating  pro- 
cess. These  steps  include  1)  attraction  to  her- 
maphrodites, 2)  maintaining  contact  with  her- 
maphrodites, 3)  location  of  the  vulva,  4) 
insertion  of  spicules,  and  5)  transfer  of  sperm. 
For  example,  a  mutant  male  defective  in  step  3 
will  endlessly  circle  the  hermaphrodite  search- 
ing for  the  vulva.  A  minority  of  the  mutants  are 
ineffective  at  more  than  one  step.  Having  mutants 
blocked  at  defined  steps  will  allow  us  to  identify 
genes  and  cells  necessary  to  specify  this  innate 
behavior. 

We  are  also  identifying  the  cells  responsible  for 
each  step  in  mating  behavior  by  killing  individual 
cells  or  structures  with  laser  microbeam  irradia- 
tion and  observing  the  consequences  for  behav- 
ior. For  example,  the  spicule  motoneuron  and  at 
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least  one  of  the  two  spicule  sensory  neurons  are 
necessary  for  spicule  insertion. 

The  establishment  of  cellular  asymmetry  is  a  fun- 
damental yet  relatively  unexplored  aspect  of  cell  reg- 
ulation. We  have  begun  to  study  this  problem  in  the 
context  of  the  male  B  lineage,  M^hich  generates  the 
spicules  as  described  above,  and  in  a  cell  called 
the  2°  vulval  precursor  cell.  The  lin- 18  gene  (cell 
lineage  gene  18)  is  necessary  for  the  asymmetry  of 
the  2°  vulval  precursor  cell  and  is  the  starting  point 
for  molecular  genetic  studies  of  this  problem.  We 
have  found  that  the  2°  cells  will  orient  posteriorly  in 
the  animal,  unless  they  receive  a  signal  from  the 
developing  gonad  to  orient  anteriorly.  This  signal  is 
distinct  from  the  well-characterized  signal  that  in- 
structs these  cells  to  be  2°  cells,  and  not  other  types 
of  cells. 


In  a  project  initiated  this  year,  we  are  analyzing 
the  function  of  the  nematode  homologues  of  sig- 
nal transduction  molecules  identified  in  mam- 
mals. We  have  cloned  the  nematode  gene  similar 
to  the  raf- 1  protein  kinase  and  placed  it  on  the  C. 
elegans  genomic  map.  To  identify  the  cells  ex- 
pressing G  protein  genes,  we  have  constructed 
chimeric  genes  consisting  of  the  transcriptional 
control  regions  of  two  G  protein  a-subunit  genes 
linked  to  a  "reporter"  gene.  We  then  create  trans- 
genic nematodes  by  introducing  the  engineered 
genes  back  into  the  nematode  through  microin- 
jection of  DNA.  Both  genes  are  expressed  primar- 
ily in  neurons,  including  neurons  in  the  male  tail. 
We  plan  to  use  these  chimeric  genes  as  markers 
for  neuronal  types  to  examine  the  molecular  anat- 
omy of  the  Cod  mutants. 
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A remarkable  feature  of  the  brain  is  its  ability 
to  modify  itself.  Neuronal  activity  can 
change  the  properties  of  nerve  cells  so  that  the 
neural  circuits  responsible  for  the  brain's  com- 
putations become  physically  different.  This  abil- 
ity is  termed  neuromodulation. 

To  understand  neuromodulation  one  must 
have  an  idea  of  how  neuronal  circuits  are  con- 
structed and  how  signals  are  passed  from  one 
brain  cell  (neuron)  to  another  in  these  circuits. 
Each  neuron  in  the  brain  receives  information 
from  about  10,000  neurons  and,  in  turn,  sends 
information  to  about  10,000  others.  Information 
is  transmitted  from  one  nerve  cell  to  the  next  at 
special  points  of  contact  known  as  synapses. 
Neural  circuits,  then,  are  formed  by  chains  of  neu- 
rons that  are  interconnected  at  synaptic  contacts 
in  incredibly  complicated  ways. 

Each  time  a  nerve  impulse  arrives  at  a  synapse, 
the  sending  cell  releases  a  small  quantity  of  a 
chemical  known  as  a  neurotransmitter.  The  neu- 
rotransmitter diffuses  rapidly — in  less  than  a 
thousandth  of  a  second — to  the  receiving  neu- 
ron, where  it  binds  onto  special  proteins,  called 
receptors,  embedded  in  the  receiving  cell's  sur- 
face membrane.  When  the  neurotransmitter 
binds  onto  the  receptor,  it  causes  it  to  change 
shape  and  to  reveal  a  submicroscopic  pore 
through  which  an  electric  current  can  flow.  This 
current,  carried  by  ions,  constitutes  the  signal 
within  the  receiving  neuron  and  is  used  to  deter- 
mine when  the  receiving  cell  itself  will  produce 
impulses. 

Two  large  classes  of  receptors  are  known  -,  excit- 
atory and  inhibitory.  In  the  receiving  neuron,  ex- 
citatory receptors  cause  changes  that  tend  to  pro- 
duce nerve  impulses,  and  the  inhibitory 
receptors  do  the  opposite:  they  prevent  the  gener- 
ation of  nerve  impulses.  Thus  the  two  classes  of 
synapses  act  as  an  accelerator  and  a  brake  on 
nerve  cell  activity.  About  two-thirds  of  the  syn- 
apses in  the  brain  are  excitatory,  and  the  rest  are 
inhibitory. 

An  important  determinant  of  how  the  circuits 
function  is  the  strength  of  the  various  synaptic 
contacts.  If  a  synapse  is  strong,  it  will  be  effective 


in  helping  to  produce  (or  prevent,  if  inhibitory) 
nerve  impulses  in  the  receiving  cell,  whereas 
weak  synapses  will  be  much  less  effective. 
Clearly,  what  a  neuronal  circuit  computes  will 
depend  on  the  strengths  of  its  various  synapses. 

Our  laboratory  has  discovered  that  the  strength 
of  excitatory  synapses  is  under  control  of  the 
brain  activity  itself.  Thus  the  efficiency  with 
which  one  neuron  transmits  information  to  an- 
other is  subject  to  neuromodulatory  control  of 
still  other  neurons.  This  control  is  quite  specific 
and  can  be  targeted  to  particular  neurons  in  a 
circuit. 

The  mechanism  of  this  neuromodulatory  con- 
trol is  a  version  of  one  very  common  in  biochem- 
istry. A  special  enzyme  known  as  protein  kinase  A 
(PKA)  adds  a  phosphate  group  to  the  receptor, 
and  this  modified  receptor  then  has  a  greater  ef- 
fect on  the  receiving  neuron.  To  understand  this 
mechanism  more  completely  requires  some  addi- 
tional information  about  how  receptors  work. 

When  the  receptor  binds  its  neurotransmitter 
— glutamate,  in  the  case  of  the  excitatory  recep- 
tors— and  opens  its  pore  for  ions  to  flow  through, 
the  receptor  usually  remains  in  its  open  state  for 
about  a  thousandth  of  a  second.  Techniques  are 
available  that  permit  the  neurobiologist  to  record 
the  currents  that  flow  through  single  receptors  so 
that  we  can  observe  them  as  they  open  and  close. 

When  PKA  is  applied  to  small  patches  of  mem- 
brane containing  excitatory  receptors,  these  re- 
ceptors are  modified:  the  altered  (phosphory- 
lated)  receptors  dwell  in  the  open  state  5  to  10 
times  longer  than  usual,  and  the  signal  received  at 
the  synapse  is  5  to  10  times  larger  for  each  im- 
pulse arrival.  The  increased  strength  thus  occurs 
through  increases  in  the  open  time  of  the 
receptors. 

But  what  controls  the  PKA?  Our  laboratory  has 
found  some,  but  not  all,  of  the  answers  to  this 
question.  PKA  is  activated  by  a  rather  compli- 
cated cascade  of  events  that  involve  the  familiar 
messenger  molecule  cAMP.  This  cascade  is  ulti- 
mately engaged  by  the  action  of  chemicals  known 
as  neuromodulators  that  are  released  by  other 
neurons  when  they  are  active.  Many  neuromodu- 
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lators  operate  through  the  cAMP  cascade,  but  the 
specific  neuromodulator  used  in  this  instance, 
and  the  particular  neuronal  pathway  employed 
for  the  neuromodulatory  signal,  have  not  yet 
been  identified. 

The,  experiments  described  above  used  neu- 
rons from  a  particular  region  of  the  brain — the 
hippocampus — that  were  grown  in  tissue  cul- 
ture. In  order  to  be  sure  that  the  neuromodula- 
tory mechanisms  investigated  were  not  some  pe- 


culiarity of  cultured  neurons,  we  have  also 
demonstrated  that  the  same  effects  can  be  found 
in  brain  slices  of  the  hippocampal  region. 

Thus  our  laboratory  has  discovered  a  powerful 
mechanism  through  which  neuronal  activity  can 
regulate  synaptic  strength  and  thereby  modify  the 
computations  carried  out  by  neural  circuits.  The 
next  challenge  is  to  discover  how  the  brain  actu- 
ally makes  use  of  this  mechanism  to  switch  from 
one  compensational  state  to  another. 
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MOST  animals  large  enough  for  us  to  see  with 
the  naked  eye  consist  of  billions  or  trillions 
of  individual  cells  arranged  in  extremely  com- 
plex and  highly  conserved  patterns.  To  generate 
such  patterns,  cells  in  a  developing  organism 
must  know  where  they  are  and  choose  what  to  do 
based  on  this  information.  Research  in  our  labora- 
tory is  directed  toward  understanding  how  such 
spatial  information  is  generated  and  interpreted 
during  development. 

For  many  years  considerable  attention  has  been 
given  to  the  concept  that  gradients  of  diffusible 
"form-generating"  substances,  or  morphogens, 
might  provide  this  spatial  information.  For  exam- 
ple, a  localized  source  of  a  diffusible,  but  unsta- 
ble, morphogen  would  generate  a  stable  gra- 
dient, which  could,  in  turn,  provide  a  series  of 
concentration  thresholds  that  would  determine 
distinct  cellular  behaviors  (e.g.,  the  develop- 
ment of  particular  pattern  elements)  as  a  function 
of  distance  from  the  source. 

During  the  past  10-15  years,  the  pioneering 
embryologic  and  genetic  experiments  of  Chris- 
tiane  Niisslein-Volhard  and  her  colleagues  have 
established  that  several  distinct  morphogens 
specify  the  basic  body  plan  of  the  fruit  fly  Dro- 
sophila early  in  development.  One  of  these,  the 
protein  product  of  the  gene  bicoid  {bed),  con- 
trols most  aspects  of  the  anterior  pattern  of  the 
body,  while  the  products  of  two  other  genes, 
nanos  (nos)  and  torso  (tor),  seem  to  specify  the 
pattern  of  the  remaining  posterior  and  terminal 
portions.  Our  principle  aim  is  to  determine  the 
roles  and  modes  of  action  of  each  of  these  differ- 
ent morphogen  systems. 

Control  of  Anterior  Body  Pattern 
by  the  bed  Morphogen 

Dr.  Niisslein-Volhard  and  her  colleagues  estab- 
lished that  transcripts  of  the  bed  gene  are  synthe- 
sized and  then  transported  to  the  anterior  pole  of 
the  egg  during  oogenesis.  Following  fertilization, 
these  tightly  localized  transcripts  are  translated, 
and  the  resulting  protein  diffuses  from  its  site  of 
synthesis,  generating  a  concentration  gradient. 
To  determine  how  the  bed  protein  gradient  dic- 


tates anterior  body  pattern,  we  have  sought  to 
identify  direct  targets  of  bed  action  and  to  deter- 
mine the  mechanisms  that  allow  different  con- 
centrations of  protein  to  control  how  these  tar- 
gets respond. 

In  an  initial  series  of  experiments,  we  estab- 
lished that  the  bed  gradient  is  responsible  for 
generating  an  opposing  gradient  of  another  regu- 
latory protein,  caudal  {cad).  Although  we  have 
yet  to  determine  the  exact  mechanism  by  which 
this  response  occurs,  our  data  suggest  that  bed 
protein  may  repress  translation  of  maternally  de- 
rived cad  transcripts  that  are  ubiquitously  distrib- 
uted in  the  early  embryo.  More  recently,  we  have 
identified  a  second  target  of  bed  action,  and  in 
this  case  determined  the  mechanism  involved. 

The  target  is  a  zygotic  regulatory  gene,  hunch- 
back {hb),  which  is  normally  activated  in  a  well- 
defined  anterior  portion  of  the  early  embryo.  We 
have  shown  that  bed  protein  directly  binds  and 
transcriptionally  activates  the  hb  gene  in  a  fash- 
ion that  depends  critically  on  bed  protein  con- 
centration. Hence,  the  anteroposterior  gradient 
of  bed  protein  provides  a  distinct  threshold  dic- 
tating where  the  hb  gene  is  expressed. 

These  and  other  experiments  have  also  pro- 
vided evidence  that  the  bed  gradient  triggers  a 
series  of  other  distinct  responses  by  the  same 
mechanism — each  response  governed  by  the  par- 
ticular binding  affinity  of  a  different  target  gene 
for  the  bed  protein.  The  ability  of  the  bed  protein 
to  control  the  patterns  of  subordinate  signaling 
molecules  by  at  least  two  distinct  mechanisms,  as 
well  as  the  demonstration  that  it  acts  directly  on 
several  distinct  targets,  clearly  indicates  how  the 
graded  distribution  of  a  single  morphogen  can 
provide  a  complex  set  of  instructions  for  organiz- 
ing body  pattern. 

Control  of  Posterior  Body  Pattern  by  the 
nos  and  hb  Morphogens 

The  development  of  posterior  pattern  is  largely 
dependent  on  the  nos  morphogen,  which  ap- 
pears to  spread  from  a  tightly  localized  source  at 
the  posterior  pole  during  early  embryogenesis.  It 
was  initially  thought  that  this  factor  operates  like 
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the  bed  morphogen  in  directly  activating  a  num- 
ber of  subordinate  signaling  molecules  in  spe- 
cific domains  of  the  early  embryo.  However,  we 
have  recently  conducted  experiments  showing 
that  this  view  is  incorrect  and  have  provided  evi- 
denceu  that  nos  functions  only  to  ensure  the 
graded  expression  of  another  morphogen — sur- 
prisingly, the  hb  protein. 

As  discussed  above,  graded  bed  protein  di- 
rectly activates  transcription  of  the  hb  gene  in  a 
broad  anterior  domain  of  the  embryo.  The  hb 
gene,  however,  is  also  transcribed  maternally: 
upon  fertilization,  maternal  hb  transcripts  situ- 
ated in  the  posterior  half  of  the  embryo  appear  to 
be  blocked  translationally  and  subsequently  de- 
stabilized by  localized  nos  activity.  Thus,  like 
bed,  nos  also  generates  differential  hb  protein 
expression  along  the  anteroposterior  axis,  albeit 
by  a  different  molecular  mechanism. 

Differential  hb  activity,  whether  generated 
under  the  control  of  the  bed  or  nos  determinants, 
appears  to  be  involved  in  specifying  most  of  the 
posterior  body  pattern.  To  test  whether  hb  pro- 
tein functions  as  the  primary  gradient  morphogen 
organizing  the  posterior  body  pattern,  a  series  of 
mutant  embryos  have  been  generated  that  lack 
both  the  anterior  (bed}  and  terminal  (tor)  mor- 
phogen systems,  but  retain  the  posterior  system 
(nos)  and  maternal  hb  transcripts  derived  from  0 
to  4  copies  of  the  hb  gene.  Under  these  circum- 
stances, the  formation  of  posterior  body  pattern  is 
solely  dependent  on  the  graded  distribution  of  hb 
protein  derived  from  maternal  transcripts.  More- 
over, different  threshold  concentrations  of  hb 
protein  can  be  shown  to  determine  distinct 
boundaries  of  expression  of  subordinate  regula- 
tory molecules,  most  notably  the  gap  gene  pro- 
teins Kriippel  (Kr) ,  knirps  (knf) ,  and  giant  (gt) , 
which  specify  the  position  and  polarity  of  most  of 
the  abdominal  segments. 

These  experiments  indicate  that  posterior  body 
pattern  is  established  under  the  control  of  an  hb 
morphogen  gradient,  much  as  anterior  body  pat- 
tern is  controlled  by  the  bed  gradient.  Since  the 


bed  gradient  can  itself  create  an  hb  protein  gra- 
dient by  its  direct  and  localized  effect  on  hb  tran- 
scription, it  has  the  capacity  in  principle  to  spec- 
ify virtually  the  entire  body  pattern.  These 
findings  reveal  a  basic  asymmetry  in  the  control 
of  the  anteroposterior  body  plan  and  provide  an 
indication  of  how  a  relatively  simple  morphogen 
gradient  like  bed  can  generate  a  more  elaborate 
and  long-ranging  system  of  spatial  cues. 

Control  of  Terminal  Body  Pattern 
by  the  tor  Receptor 

Recent  studies  in  other  laboratories  have  estab- 
lished the  existence  of  a  third  primary  determi- 
nant system  that  distinguishes  end  portions  of  the 
body  pattern  from  the  middle.  A  critical  determi- 
nant in  this  system  appears  to  be  a  receptor  tyro- 
sine kinase  encoded  by  the  tor  gene.  In  studies 
performed  in  this  laboratory,  the  tor  protein  has 
been  shown  to  be  expressed  uniformly  along  the 
surface  of  early  embryos,  even  though  genetic  ex- 
periments indicate  that  it  must  normally  be  active 
at  both  poles  but  inactive  in  the  middle  portion 
of  the  body.  Hence  the  tor  kinase,  as  anticipated 
from  its  deduced  molecular  structure,  appears  to 
function  as  a  ubiquitous  surface  receptor  that  is 
activated  only  in  the  vicinity  of  the  poles. 

Additional  genetic  and  immunohistochemical 
experiments  have  confirmed  this  conclusion  and 
suggested  that  tor  is  activated  by  a  localized  11- 
gand  initially  tethered  to  the  extracellular  matrix 
surrounding  the  embryo.  Finally,  using  a  muta- 
tion of  the  tor  gene  that  causes  temperature- 
sensitive  constitutive  activity  of  the  receptor,  dif- 
ferent levels  of  tor  activity  have  been  shown  to 
specify  distinct  portions  of  the  terminal  pattern. 
Thus  polarized  activity  of  the  tor  kinase  at  the 
surface  of  the  embryo  may  specify  terminal  pat- 
tern by  generating  one  or  more  gradients  of  intra- 
cellular signaling  molecules — perhaps  phos- 
phorylated  substrates  of  the  protein.  The 
specification  of  terminal  pattern  may  therefore 
differ  from  that  of  anterior  and  posterior  pattern 
only  in  the  mechanism  by  which  graded  expres- 
sion of  the  active  morphogen  is  generated. 
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MANY  cells  send  out  signals  by  secreting  mes- 
senger molecules.  These  molecules  are  rec- 
ognized by  specific  receptors  on  target  cells.  The 
most  striking  example  of  this  communication  pro- 
cess is  found  in  neurotransmission — the  relay  of 
information  from  one  neuron  to  another  in  the 
nervous  system.  Neurotransmission  occurs  at  syn- 
apses, highly  specialized  contact  sites  between 
two  nerve  cells.  At  a  synapse,  a  presynaptic  neu- 
ron secretes  a  signal  that  is  recognized  by  the  ad- 
jacent postsynaptic  cell.  A  similar  communica- 
tion process  is  also  responsible  for  the  regulation 
of  body  metabolism  by  hormones,  except  that  in 
this  case  several  messenger-secreting  endocrine 
cells  are  clustered  together  and  send  their  signals 
to  widely  distributed  cells  in  the  body. 

My  laboratory  explores  one  aspect  of  this  im- 
portant communication  process — the  mecha- 
nism by  which  signal  molecules  like  neurotrans- 
mitters or  hormones  are  released  from  cells. 
These  molecules  are  synthesized  in  the  secreting 
cells  and  packaged  into  intracellular  membrane- 
bound  vesicles.  Upon  stimulation,  the  cell's  sig- 
nal molecules  are  released  by  exocytosis,  which 
consists  of  the  fusion  of  the  vesicle  membranes 
with  the  plasma  membrane  and  the  concurrent 
secretion  of  the  vesicle  contents.  Our  goal  is  to 
characterize  the  components  and  mechanisms  of 
the  cell  that  are  involved  in  the  assembly,  exocy- 
tosis, and  recycling  of  synaptic  vesicles. 

The  first  goal  in  exploring  synaptic  vesicle 
functions  consists  of  the  characterization  of  their 
protein  constituents.  The  cellular  processes  lead- 
ing to  the  accumulation  of  neurotransmitters  in 
synaptic  vesicles  and  their  release  by  exocytosis 
must  act  on  the  synaptic  vesicle  membrane. 
Therefore  we  are  focusing  on  the  proteins  of  syn- 
aptic vesicles,  with  the  aim  of  understanding 
their  structure  and  function  in  the  context  of 
membrane  traffic  in  neurons. 

Since  synaptic  vesicles  have  rather  specialized 
functions,  they  are  comparatively  simple  organ- 
elles. Preliminary  estimates  suggest  that  synaptic 
vesicles  have  fewer  than  50  major  protein  compo- 
nents. Although  the  composition  of  these  organ- 
elles has  not  yet  been  determined  in  molecular 


detail,  they  may  be  amenable  to  such  an  analysis. 
In  addition  to  providing  insight  into  synaptic 
functions,  the  study  of  the  molecular  structure  of 
these  vesicles  will  provide  insight  into  the  struc- 
tures of  organelles  in  general. 

The  membrane  proteins  of  synaptic  vesicles 
fall  into  two  basic  functional  categories:  trans- 
port proteins  instrumental  in  the  active  uptake  of 
neurotransmitters  by  synaptic  vesicles,  and  struc- 
tural proteins  that  mediate  the  vesicles'  interac- 
tions with  other  cellular  membranes  and  the  cyto- 
skeleton  during  exocytosis  and  endocytosis  of  the 
vesicles.  The  number  and  abundance  of  proteins 
in  the  second  class  is  much  greater  than  in  the 
first  class.  This  correlates  with  the  greater  com- 
plexity of  the  transport,  fusion,  and  fission  reac- 
tions of  synaptic  vesicles  as  compared  with  their 
neurotransmitter  uptake  processes. 

In  the  past  several  years,  more  than  10  proteins 
that  are  abundant  in  the  synaptic  vesicle  mem- 
brane have  been  characterized  in  my  laboratory. 
The  transport  functions  of  synaptic  vesicles  de- 
pend on  a  proton  pump  in  their  membrane;  the 
activity  of  this  pump  establishes  an  energy  gra- 
dient across  the  vesicle  membrane.  This  energy 
gradient  is  utilized  for  the  active  accumulation  of 
neurotransmitters.  Two  subunits  of  the  synaptic 
vesicle  proton  pump  have  been  cloned  in  my  lab- 
oratory, and  we  are  currently  studying  its  assem- 
bly and  regulation.  Some  synaptic  vesicles  also 
contain  an  electron  transport  protein  in  their 
membrane  to  convey  reducing  equivalents  for  in- 
travesicular  enzymes.  This  protein,  cytochrome 
b56l,  has  also  been  purified  and  cloned  in  this 
laboratory.  We  are  now  in  the  process  of  trying  to 
identify  and  characterize  the  actual  neurotrans- 
mitter transport  proteins  in  the  synaptic  vesicle 
membrane. 

The  second  class  of  proteins  are  those  presum- 
ably involved  in  the  exocytosis  and  re-endocyto- 
sis  of  synaptic  vesicles.  These  proteins  should  be 
present  on  all  vesicles  independent  of  their  neu- 
rotransmitter content.  The  most  abundant  among 
these  are  the  three  integral  membrane  proteins — 
synaptotagmin  {M^  65,000),  synaptophysin  {M^ 
38,000),  and  synaptobrevin  {M,  18,000) — and 
the  synapsins,  a  group  of  four  related  peripheral 
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membrane  proteins.  The  primary  structures  of 
these  proteins  were  determined  in  this  labora- 
tory, and  specific  antibodies  and  different  ex- 
pression systems  are  being  used  to  probe  their 
functional  roles. 

Synaptophysin  has  a  complex  structure  consist- 
ing of  a  homotrimer  that  spontaneously  re-forms 
into  a  homohexamer  upon  reduction  of  its  disul- 
fide bonds.  Synaptotagmin  and  synaptobrevin  are 
highly  conserved  synaptic  vesicle  proteins  that 
we  have  also  characterized  in  the  fruit  fly  Dro- 
sophila  melanogaster.  Synaptotagmin  contains 
two  copies  of  a  Ca^"^-binding  domain  found  in 
protein  kinase  C.  Recombinant  synaptotagmin 
also  avidly  binds  phospholipids,  suggesting  that 
it  may  have  a  role  in  the  Ca^'^-regulated  steps  of 
synaptic  vesicle  exocytosis. 

The  synapsins  are  the  major  phosphoproteins 
of  the  nerve  terminal  and  interact  with  the  cyto- 
skeleton  in  a  phosphorylation-dependent  man- 


ner. The  four  synapsins  are  the  differentially 
spliced  products  of  two  separate  genes.  They  are 
composed  of  mosaics  of  shared  and  unique  do- 
mains and  are  differentially  distributed  among 
synapses.  The  synapsins  presumably  have  a  role 
in  the  vesicle  traffic  to  and  from  the  plasma  mem- 
brane and  may  regulate  the  number  of  vesicles 
available  for  exocytosis.  They  are  stoichiometri- 
cally  phosphorylated  at  multiple  sites  by  multi- 
ple protein  kinases,  as  a  function  of  the  activation 
state  of  the  nerve  terminal. 

Together,  our  studies  have  characterized  a  siz- 
able portion  of  the  synaptic  vesicle  proteins  and 
are  approaching  the  dissection  of  the  synaptic  ves- 
icle pathway  in  molecular  terms.  Our  current  in- 
vestigations of  the  interactions  of  synaptic  vesicle 
proteins  with  other  proteins  of  the  synaptic  nerve 
terminal  are  an  attempt  to  understand  the  role  of 
these  proteins  in  regulating  neurotransmitter 
release. 
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THE  primary  focus  of  our  research  is  to  under- 
stand the  molecular  events  involved  in  the 
regulation  of  mammalian  cell  growth.  Since  cells 
grow  in  response  to  the  presence  of  extracellular 
mitogens  (growth  factors) ,  we  are  attempting  to 
define  the  mechanisms  by  which  growth  factors 
function.  A  complex  series  of  events  follows  mi- 
togen-receptor  binding,  including  the  rapid  gen- 
eration of  second  messenger  signals  in  the  cell's 
cytosol  and  plasma  membrane.  In  turn,  these 
events  lead  to  the  activation  of  a  set  of  "immedi- 
ate-early" genes. 

During  the  past  few  years,  work  from  our  own 
and  other  laboratories  has  shown  that  an  impor- 
tant subset  of  these  genes  encodes  transcriptional 
factors — proteins  that  activate  or  repress  another 
set  of  genes.  Furthermore,  upon  mitogenic  stimu- 
lation these  genes  are  inducible  in  a  wide  variety 
of  mammalian  cells,  suggesting  that  they  are  part 
of  a  general  program  involved  in  cell  growth. 
More  importantly,  by  virtue  of  their  structure  and 
induction  kinetics,  they  are  likely  to  play  broad 
roles  as  "third  messengers,"  by  coupling  early 
biochemical  processes  to  long-term  changes  in 
gene  expression  required  to  modulate  not  only 
cell  growth  but  other  cellular  processes  such  as 
differentiation.  Therefore,  our  long-term  goal  is 
to  understand  how  these  proteins  function  as  nu- 
clear signal  transducers. 

To  date,  our  laboratory  has  identified  four  early 
growth  response  (EGR)  genes  (EGRl-4).  Each 
encodes  a  protein  with  three  Cys2-His2  zinc  finger 
motifs,  structures  previously  described  in  other 
transcriptional  regulatory  proteins.  The  pre- 
dicted finger  domains  are  nearly  identical  to  each 
other  and  are  closely  related  to  that  of  another 
protein  that  may  play  a  critical  role  in  the  genesis 
of  a  childhood  kidney  tumor  (Wilms'  tumor). 
These  proteins  function  as  transcription  factors 
and  bind  to  similar  target  sequences.  Thus  they 
can  interfere  with  each  other's  ability  to  activate 
a  test  target  gene.  However,  the  physiologically 
important  target  gene(s)  for  the  EGR  proteins  in 


the  context  of  cell  growth  remains  to  be 
identified. 

We  are  currently  attempting  to  define  a  func- 
tional role  for  the  EGRl  transcription  factor  in 
myeloid  differentiation  and  in  tumorigenesis. 
The  human  EGRl  gene  located  on  chromosome  5 
band  q31  is  deleted  in  the  blast  cells  of  many 
patients  with  therapy-related  acute  myeloid  leu- 
kemia. These  patients  succumb  to  this  disease 
after  successful  treatment  of  their  primary  malig- 
nancy years  earlier.  Recent  fine-mapping  data 
continue  to  support  the  hypothesis  that  EGRl  is 
located  in  the  critical  region  of  chromosome 
5q31,  the  smallest  common  deleted  region  in 
these  patients.  If  these  data  continue  to  hold  up 
as  the  critical  region  is  further  refined,  we  will 
use  pulse-field  electrophoresis  and  blotting  as 
well  as  polymerase  chain  reaction  methodology 
and  DNA  sequencing  to  investigate  the  structure 
of  the  cytogenetically  normal  EGRl  allele.  To- 
ward this  end,  we  have  recently  cloned  and  se- 
quenced the  human  EGRl  cDNA.  The  overall  hy- 
pothesis under  consideration  is  that,  like  the 
retinoblastoma  gene,  the  p53  gene,  and  the 
Wilms'  tumor  gene,  EGRl  might  be  a  negative 
regulator  of  cell  growth,  at  least  in  the  context  of 
myelopoiesis. 

Another  emerging  focus  of  our  laboratory  is  the 
isolation  of  genes  for  transcription  factors  in- 
volved in  the  development  of  a  solid  organ,  the 
kidney.  Using  oligonucleotide  probes  derived 
from  conserved  regions  of  zinc  finger  proteins, 
we  have  identified  several  clones  expressed  at 
high  levels  in  the  kidney.  Molecular  characteriza- 
tion of  these  clones  and  their  expression  patterns 
during  development  are  in  progress.  As  an  alter- 
native approach,  we  are  also  attempting  to  charac- 
terize the  genetic  elements  responsible  for  ex- 
pression of  the  Tamm-Horsfall  protein.  This 
glycoprotein  is  produced  in  all  mammalian  spe- 
cies, in  a  developmentally  regulated  and 
nephron-specific  manner,  and  should  serve  as  a 
useful  starting  point  for  defining  aspects  of  kid- 
ney transcription  and  development. 
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THE  liver  has  unusual  properties  of  regenera- 
tion. When  the  two  largest  lobes  are  re- 
moved, leaving  the  smaller  lobes  completely  in- 
tact, the  remaining  cells  can  grow  and  divide 
until  the  liver  regains  its  former  size,  whereupon 
growth  ceases.  Many  growth  factors  have  been 
implicated  in  regulating  this  process,  but  the 
mechanisms  remain  poorly  understood.  After 
many  years  of  study  the  same  interesting  ques- 
tions about  liver  regeneration  remain:  What 
makes  the  liver  start  regenerating?  What  regulates 
liver  growth  during  regeneration,  allowing  the 
liver  to  maintain  its  normal  architecture?  What 
makes  the  liver  stop  regenerating  when  it  has  re- 
gained its  initial  size? 

Understanding  liver  regeneration  will  help  ex- 
plain how  the  organ  responds  to  toxic  damage  or 
infections  like  hepatitis.  Additionally,  because 
increasing  numbers  of  liver  transplants  are  being 
performed  and  successful  transplants  require 
liver  regeneration,  it  is  essential  to  understand 
the  biological  and  molecular  bases  for  liver  cell 
growth. 

My  colleagues  and  I  are  interested  in  determin- 
ing what  genes  are  expressed  during  liver  regen- 
eration and  how  their  protein  products  are  in- 
volved in  regulating  the  process.  It  is  important 
to  determine  if  these  genes  are  identical  to  those 
that  regulate  the  grov^h  of  all  cells.  We  are  study- 
ing liver  cell  growth  in  two  systems:  a  continu- 
ously growing  liver  cell  line  that  responds  to  in- 
sulin as  a  growth  factor,  and  regenerating  liver 
tissue  from  rats. 

Early  studies  identified  several  genes  that  are 
rapidly  expressed  in  response  to  growth  factors  in 
liver  cells  and  in  commonly  studied  cells  like 
lymphocytes  and  fibroblasts,  respectively  impor- 
tant in  the  immune  system  and  scar  formation. 
Many  of  these  genes  encode  proteins  that  func- 
tion in  the  cell  nucleus,  possibly  regulating  the 
cascade  of  gene  expression  occurring  when  cells 
grow.  Some  of  these  genes  have  altered  expres- 
sion in  cancer  cells,  contributing  to  the  aberrant 
proliferation. 

To  isolate  novel  genes  involved  in  liver  cell 
growth,  we  have  used  a  technique  that  allows 


subtraction  of  genes  expressed  in  nongrowing 
cells  from  those  expressed  in  growth-stimulated 
cells.  Thus  genes  that  are  uniquely  expressed  in 
growing  liver  cells  have  been  isolated  and  can  be 
studied.  We  have  identified  approximately  50 
such  genes  through  analysis  of  their  sequence 
and  through  predictions  concerning  their  protein 
structure.  Many  of  the  genes  are  expressed  in 
other  growing  cells  and  seem  to  be  pan  of  the 
general  growth  response,  while  some  are  specific 
to  liver. 

In  the  past  year,  our  studies  have  focused  on 
understanding  the  actions  of  the  proteins  en- 
coded by  some  of  these  genes  and  their  potential 
roles  in  regulation  of  liver  regeneration.  Because 
we  isolated  so  many  novel  genes,  it  was  important 
to  establish  criteria  for  determining  which  of  the 
encoded  proteins  are  likely  to  have  important  reg- 
ulatory roles  in  liver  regeneration.  We  decided  to 
explore  further  the  exact  roles  of  novel  proteins 
falling  into  four  functional  categories. 

The  first  category  includes  proteins  that  regu- 
late the  expression  of  genes.  Because  so  many 
genes  need  to  be  turned  on  for  liver  regeneration 
to  proceed,  proteins  that  regulate  gene  expres- 
sion are  likely  to  be  important. 

The  second  category  are  proteins  that  are  se- 
creted from  cells  and  may  function  to  regulate 
the  growth  of  surroimding  cells.  The  liver  must 
maintain  its  cellular  architecture  during  regener- 
ation, and  because  it  is  made  up  of  many  different 
cell  types,  intercellular  communications  must 
exist  during  regeneration.  Secreted  proteins 
could  be  involved  in  mediating  such  intercellular 
signals. 

The  third  category  includes  proteins  whose  ex- 
pression is  specific  to  regenerating  liver  alone 
among  growing  cells,  and  we  found  several  in 
this  category,  including  a  few  that  are  highly  ex- 
pressed. These  proteins  could  be  important  in 
liver-specific  growth. 

The  fourth  category  includes  several  genes 
whose  expression  is  induced  in  regenerating 
liver  but  abnormally  high  in  the  liver  tumor  cell 
line  that  grows  in  response  to  insulin,  even  prior 
to  insulin  treatment.  These  could  function  as  on- 
cogenes in  the  liver  tumor  line. 
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LRF- 1  (liver  regeneration  factor)  is  one  of  the 
proteins  that  we  have  further  characterized.  It  is 
highly  expressed  in  regenerating  liver  and  func- 
tions as  a  DNA-binding  protein,  controlling  the 
expression  of  target  genes.  It  falls  into  the  cate- 
gory , of  so-called  leucine  zipper  proteins,  in 
which  a  stretch  of  amino  acids  containing  evenly 
spaced  leucine  residues  allows  one  molecule  of 
LRF-1  to  interact  with  another,  or  with  related 
proteins.  LRF-1  has  a  characteristic  temporal  ex- 
pression pattern  during  liver  regeneration  and  ac- 
tivates or  inhibits  the  expression  of  target  genes 
in  a  singular  manner,  implying  that  LRF-1  has  a 
unique  role  in  regulating  events  in  the  regenera- 
tive process. 


We  have  begun  studying  the  function  of  a  pro- 
tein secreted  during  liver  regeneration  that  could 
have  a  role  as  a  growth  factor.  We  found  that  a 
gene  encoding  a  protein  that  binds  to  so-called 
insulin-like  growth  factor,  termed  insulin-like 
growth  factor-binding  protein- 1  (IGFBP-1),  is 
highly  expressed  during  the  early  phases  of  liver 
regeneration.  IGFBP-1  is  not  expressed  in  other 
growing  cells  we  have  examined.  This  protein 
has  been  implicated  in  enhancing  the  growth- 
promoting  effects  of  insulin-like  growth  factor. 
Because  IGFBP-1  could  potentially  act  as  a 
growth  factor  during  liver  regeneration,  we  have 
further  studied  its  expression  and  function  in 
liver  regeneration. 
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THE  lymphoid  immune  system  comprises  two 
major  types  of  cells:  1)  the  B  cell,  which  se- 
cretes antibodies  that  bind  to  and  target  foreign 
substances  for  elimination  by  the  immune  sys- 
tem, and  2)  the  T  cell,  which  is  capable  of  pro- 
ducing a  variety  of  molecules,  known  as  lympho- 
kines,  that  regulate  the  functions  of  other  cells 
involved  in  immune  responses.  T  cells  are  also 
capable  of  identifying  cells  expressing  foreign 
proteins  and  destroying  them  by  cell-mediated  cy- 
totoxicity. The  goals  of  our  laboratory  are  to  un- 
derstand the  molecular  events  associated  with 
the  development  of  the  lymphoid  immune  sys- 
tem and  to  define  the  mechanisms  by  which  the 
functions  of  these  cells  are  controlled. 

The  central  role  of  the  lymphoid  immune  sys- 
tem in  the  natural  resistance  to  infectious  dis- 
eases has  been  demonstrated  by  the  infectious 
problems  encountered  by  patients  with  both  con- 
genital and  acquired  immunodeficiency  syn- 
dromes. Many  of  the  serious  infectious  and  neo- 
plastic complications  associated  with  the 
acquired  immune  deficiency  syndrome  (AIDS) 
are  the  result  of  depletion  of  the  helper  cells,  a 
subset  of  T  cells.  A  better  understanding  of  the 
molecular  mechanisms  associated  with  genera- 
tion of  both  B  and  T  cells  during  development 
would  aid  in  our  ability  to  understand  and  treat 
various  immunodeficiencies. 

B  cells  derive  their  name  from  the  bursa  of  Fab- 
ricius,  a  developmental  organ  in  birds  that  is  re- 
quired for  the  generation  of  mature  B  cells.  Mam- 
mals lack  a  bursa  of  Fabricius  but  still  appear  to 
be  able  to  generate  a  B  cell  immune  system.  Over 
the  past  several  years,  our  laboratory  has  investi- 
gated the  role  of  this  organ  in  B  cell  development 
in  the  chicken.  We  are  attempting  to  characterize 
differences  in  the  generation  of  B  cells  in  mam- 
mals and  avian  species  to  account  for  this  lack  in 
mammals.  The  major  role  of  the  B  cell  immune 
system  is  to  generate  the  approximately  10  mil- 
lion different  antibody  molecules  needed  to  pro- 
tect the  body  from  foreign  substances. 

Our  studies  have  helped  demonstrate  that  the 
primary  molecular  mechanism  by  which  chick- 


ens generate  this  large  number  of  different  anti- 
body molecules  is  different  from  that  used  by 
mammals.  Antibody  diversity  in  the  chicken  is 
achieved  by  gene  conversion,  a  molecular  mecha- 
nism that  occurs  during  B  cell  development  in 
the  bursa  of  Fabricius  and  that  requires  the  induc- 
tion of  cell  proliferation  within  the  bursa  of  Fab- 
ricius. We  have  begun  to  characterize  the  require- 
ments for  this  proliferation.  These  studies  have 
led  to  insights  into  how  genes  encoding  immuno- 
globulin molecules  are  initially  assembled  and 
modified  during  development. 

It  has  recently  been  shown  that  immunoglobu- 
lin gene  conversion  also  occurs  in  mammals.  The 
molecular  events  that  are  used  to  generate  gene 
conversions  in  chickens  may  also  be  used  to 
create  the  somatic  mutations  that  arise  in  mam- 
malian immunoglobulin  genes  during  B  cell  de- 
velopment. Somatic  mutations  have  been  impli- 
cated in  enhancing  the  ability  of  a  cell  to  respond 
to  foreign  substances  but  also  in  the  development 
of  leukemia  and  lymphomas.  Our  studies  in  the 
chicken  should  teach  us  more  about  this  impor- 
tant but  potentially  dangerous  process. 

T  cells  have  been  divided  in  the  past  into  two 
major  subsets — helper  T  cells,  which  produce 
the  lymphokines  that  regulate  immune  re- 
sponses, and  cytotoxic  T  cells,  which  can  kill 
cells  expressing  foreign  proteins.  A  number  of 
years  ago  the  molecular  requirements  for  the 
grovvT:h  and  expansion  of  cytotoxic  T  cells  were 
defined.  Those  studies  led  to  the  discovery  of  the 
lymphokine  interleukin-2  (IL-2)  and,  subse- 
quently, to  a  variety  of  therapeutic  trials  using 
IL-2  to  increase  cell-mediated  immunity  in  pa- 
tients with  cancer.  Unfortunately,  conditions  for 
the  growth  of  helper  T  cells  have  yet  to  be  de- 
fined. An  understanding  of  the  molecular  mecha- 
nisms that  regulate  the  function  and  proliferation 
of  helper  T  cells  is  now  of  particular  interest, 
since  it  is  helper  T  cells  that  are  deficient  in  pa- 
tients with  AIDS. 

Over  the  past  several  years,  we  have  been  in- 
vestigating molecular  mechanisms  that  regulate 
helper  T  cells.  We  have  defined  a  helper  T  cell 
activation  pathway  that  appears  to  regulate  the 
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coordinate  production  of  a  variety  of  lympho- 
kines.  Activation  of  this  pathway  involves  CD28, 
a  specific  receptor  on  a  T  cell's  surface.  The  mole- 
cule that  normally  stimulates  this  activation  path- 
way is  expressed  on  B  cells,  macrophages,  and 
thymic  epithelial  cells.  Thus  this  receptor-ligand 
interaction  serves  as  a  means  for  different  cell 
types  involved  in  the  immune  response  to  com- 
municate with  each  other. 

Within  the  T  cell,  the  CD28  activation  pathway 
appears  to  control  the  rate  at  which  the  mRNAs 
for  these  genes  are  degraded,  rather  than  the  rate 
at  which  the  mRNAs  are  produced.  Stimulation  of 
this  pathway  after  mitogenic  activation  of  cells 
produces  exponential  growth  of  helper  T  cells. 
As  a  result,  we  have  been  able  to  keep  helper  T 


cells  in  exponential  growth  for  over  six  months. 
This  is  exciting,  because  for  the  first  time  we  are 
able  to  grow  normal  human  helper  T  cells  in  cul- 
ture for  long  periods  of  time.  In  theory,  we  can 
now  generate  an  infinite  number  of  helper  cells. 
Cells  grown  in  this  way  are  being  tested  for  their 
ability  to  enhance  an  individual's  immune  re- 
sponse to  cancer.  In  addition,  we  may  be  able  to 
develop  therapies  to  enhance  the  production  of 
helper  T  cells  within  the  body.  Presently,  how- 
ever, we  do  not  know  whether  the  activation 
pathway  directly  leads  to  the  proliferation  of 
helper  T  cells  or  whether  a  particular  lympho- 
kine  or  combination  of  lymphokines  produced  as 
a  result  of  this  stimulation  is  responsible.  Studies 
to  clarify  this  issue  are  under  way. 
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THE  fruit  fly  Drosophila  melanogaster  pro- 
vides an  ideal  model  system  for  studying  the 
development  of  higher  eukaryotes.  Three- 
quarters  of  a  century  of  biological,  physiological, 
and  genetic  experiments,  combined  with  recent 
intensive  molecular  studies,  have  led  to  a  greater 
understanding  of  its  development  than  that  of  any 
other  higher  organism. 

Halfway  through  the  fly's  life  cycle,  a  pulse  of 
the  steroid  hormone  ecdysone  triggers  a  dramatic 
morphological  transformation,  from  a  relatively 
immobile  feeding  larva  to  a  highly  motile,  repro- 
ductively  active  adult  fly.  We  are  studying  the 
molecular  basis  of  the  ecdysone-induced  regula- 
tory mechanisms  that  allow  metamorphosis  to 
proceed. 

When  the  larva  begins  to  undergo  metamorpho- 
sis, its  salivary  glands  contain  giant  polytene 
chromosomes,  which  can  be  visualized  by  light 
microscopy.  These  500-fold  overreplicated,  in- 
terphase chromosomes  lie  in  register  beside  one 
another.  A  characteristic  banding  pattern  along 
the  length  of  the  polytene  chromosomes  allows 
any  gene  of  interest  to  be  located  precisely.  Re- 
gions of  the  genome  that  are  being  actively  tran- 
scribed are  often  represented  by  large  regions  of 
decondensed  chromatin  that  can  be  seen  as  puff's 
in  these  chromosomes.  Thus  the  transcriptional 
activity  of  specific  genes  at  specific  times  can  be 
followed  by  observing  the  appearance  and  disap- 
pearance of  puff^s  during  development. 

Approximately  10  puffs  can  be  distinguished 
when  the  salivary  gland  chromosomes  first  be- 
come large  enough  to  see.  These  puffs  remain 
until  the  end  of  larval  development,  when  the 
burst  of  ecdysone  triggers  a  dramatic  change  in 
the  puffing  pattern  coincident  with  the  onset  of 
metamorphosis.  Approximately  six  puffs  are  in- 
duced directly  by  the  steroid  hormone.  These 
early  ecdysone-inducible  puffs  appear  to  encode 
regulatory  proteins  that  repress  their  own  expres- 
sion and  induce  the  formation  of  more  than  100 
late  puffs.  This  second  wave  of  puffs  is  believed 
to  encode  the  proteins  responsible  for  initiating 
metamorphosis. 

By  isolating  and  characterizing  the  ecdysone- 


inducible  genes  that  lie  within  the  early  puffs,  we 
hope  to  learn  how  these  genes  are  induced  by  the 
hormone  and  how  their  encoded  proteins  might 
function  in  a  regulatory  capacity.  In  a  broader 
sense,  this  project  provides  a  model  system  for 
characterizing  the  role  of  steroid  hormones  in  reg- 
ulating gene  expression,  as  well  as  addressing  the 
question  of  how  gene  hierarchies  are  controlled 
during  development. 

Our  current  studies  center  around  E74,  an  ec- 
dysone-inducible gene  that  is  located  within  the 
large  early  puff  at  position  74EF  in  the  polytene 
chromosomes.  This  unusually  complex  gene  en- 
codes three  nested  mRNAs  that  derive  from 
unique  start  sites  but  share  a  common  3'  end.  The 
distal  promoter  directs  the  synthesis  of  a  60-kb 
primary  transcript  that  is  spliced  to  form  the  6-kb 
E74A  mRNA.  Two  other  promoters,  located  40  kb 
downstream  from  the  E74A  promoter,  direct  the 
synthesis  of  4.8-  and  5.1-kb  E74B  mRNAs.  Al- 
though the  E74A  and  E74B  mRNAs  are  distinct 
from  one  another  by  virtue  of  their  unique  5' 
exons,  the  majority  of  these  mRNAs  are  identical, 
derived  from  a  common  set  of  three  3'  exons. 

This  nested  arrangement  of  the  E74  transcripts 
leads  to  the  synthesis  of  two  related  E74  proteins 
that  have  unique  amino-terminal  domains  joined 
to  a  common  carboxyl-terminal  domain.  The 
amino-terminal  domains  are  rich  in  acidic  amino 
acids,  whereas  the  common  carboxyl-terminal 
domain  is  rich  in  basic  amino  acids.  This  charge 
distribution  is  reminiscent  of  yeast  transcrip- 
tional activators  and  thus  consistent  with  the  no- 
tion that  E74  may  encode  regulatory  proteins.  In 
addition,  the  sequence  of  the  carboxyl  terminus 
of  the  E74  proteins  is  very  similar  to  a  portion  of 
the  protein  encoded  by  the  els  oncogene.  This 
85-amino  acid  ETS  domain  defines  a  family  of 
proteins  and  has  been  shown  to  function  as  a  site- 
specific  DNA-binding  domain  that  recognizes  a 
purine-rich  DNA  sequence.  Studies  of  oncogene- 
related  proteins,  such  as  E74,  may  help  us  learn 
more  about  how  the  normal  counterparts  of  these 
disease  genes  function  during  development. 

Biochemical  analysis  of  the  E74A  protein  has 
revealed  that  it  binds  DNA  in  a  site-specific  man- 
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ner  and  that  its  target  sequence  resembles  that 
recognized  by  other  ETS-domain  proteins.  By  us- 
ing antibody  detection  techniques  to  localize  the 
E74A  protein  bound  to  the  giant  polytene  chro- 
mosomes, we  are  now  moving  to  the  next  step  in 
the  regulatory  hierarchy.  A  variety  of  binding 
sites  have  been  identified  in  this  manner,  includ- 
ing both  early  and  late  ecdysone-induced  puffs. 
These  target  genes  will  be  used  for  biochemical 
studies  to  determine  the  function  of  E74A  DNA 
binding.  In  addition,  we  plan  to  characterize 
some  target  genes  to  learn  what  role  they  might 
play  in  metamorphosis. 

The  precise  timing  of  the  genetic  response  to 
ecdysone  can  be  clearly  seen  in  the  pattern  of 
pufifs  that  arise  at  the  onset  of  metamorphosis. 
Our  characterization  of  E74  transcription  has 
provided  insights  into  how  timing  can  be  built 
into  a  genetic  regulatory  hierarchy.  Ecdysone  di- 
rectly activates  the  E74A  promoter,  resulting  in  a 
dramatic  induction  of  the  6-kb  mRNA.  This  tran- 
script does  not  appear  in  the  cytoplasm,  how- 
ever, until  1  hour  after  promoter  activation.  This 
delay  corresponds  closely  to  the  time  it  takes  for 
RNA  polymerase  to  traverse  the  60-kb  transcrip- 
tion unit,  indicating  that  the  length  of  the  E74A 
unit  functions  as  a  timer  to  delay  significantly  the 
appearance  of  the  encoded  mRNA.  The  unusual 
length  of  the  E74A  primary  transcript  sets  it  apart 
from  most  transcription  units  in  Drosophila, 
which  are  only  slightly  longer  than  the  final  pro- 
cessed mRNA.  Ecdysone  also  directly  activates 
the  20-kb  E74B  transcription  unit.  In  agreement 


with  its  primary  transcript  length,  mature  E74B 
mRNA  appears  between  15  and  30  minutes  after 
ecdysone  addition.  Thus  the  structure  of  the  E74 
gene  dictates  an  invariant  order  of  appearance  of 
its  transcripts  in  response  to  ecdysone.  The  ear- 
lier appearance  of  E74B  mRNA  is  enhanced  by  its 
activation  at  an  approximately  25-fold  lower  ec- 
dysone concentration  than  E74A.  E74B  is  further 
distinguished  from  E74A  by  its  repression  at  a 
significantly  higher  ecdysone  concentration  than 
that  required  for  its  induction,  close  to  the  con- 
centration required  for  E74A  activation. 

These  regulatory  properties  lead  to  an  ecdy- 
sone-induced switch  in  E74  expression,  with  an 
initial  burst  of  E74B  transcription  followed  by  a 
burst  of  E74A  transcription.  E74A  and  E74B  also 
show  different  patterns  of  transcription  in  four 
ecdysone  target  tissues.  These  studies  provide  a 
means  to  translate  the  profile  of  a  steroid  hor- 
mone pulse  into  different  amounts  and  times  of 
regulatory  gene  expression  that,  in  turn,  could 
direct  different  developmental  responses  in  a 
temporally  and  spatially  regulated  manner. 

E74  provides  a  paradigm  for  our  studies  on  the 
molecular  basis  of  metamorphosis.  We  hope  to 
use  this  gene  to  begin  to  understand  how  the  hor- 
monal signal  is  transduced  to  result  in  specific 
morphogenetic  changes.  Our  next  step  is  to  de- 
termine which  genes  are  regulated  by  E74  and  to 
characterize  the  products  they  encode,  as  well  as 
their  mode  of  regulation  by  E74.  Our  long-term 
goal  is  to  isolate  more  early  ecdysone-inducible 
genes  to  extend  our  understanding  of  this  com- 
plex developmental  process. 
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THE  orderly  development  of  the  mammalian 
embryo  requires  the  appropriate  activation 
and  subsequent  modulation  of  genes  in  a  tissue- 
specific  manner.  The  mechanisms  by  which  this 
is  achieved  are  the  subject  of  our  investigations. 
One  classical  approach  to  this  problem  is  to  study 
in  depth  the  elements  responsible  for  a  specific 
gene's  expression  in  a  limited  subset  of  tissues. 
We  have  taken  this  approach,  using  the  mouse 
a-fetoprotein  (AFP)  gene  as  the  model. 

Over  several  years  we  have  identified  the  cis- 
acting  regulatory  elements  that  activate  tran- 
scription of  this  gene  in  only  three  cell  types  in 
the  mouse — the  fetal  liver,  fetal  gut,  and  visceral 
endoderm  of  the  yolk  sac.  These  include  three 
200-base-long  segments  of  DNA  that  exhibit  prop- 
erties of  tissue-specific  cellular  enhancers  and  lie 
substantially  upstream  of  the  gene.  The  fourth 
positive  element  lies  within  the  first  125  bp  up- 
stream of  the  gene  and  participates  in  the  tissue- 
specific  activity  of  the  gene's  promoter  by  inter- 
acting with  a  liver-specific  transcription  factor, 
HNF-1. 

The  AFP  gene  undergoes  a  dramatic  decline  in 
transcription  after  birth;  this  decline  is  mediated 
by  a  fifth  cis-acting  element  that  lies  between  the 
most  proximal  enhancer  and  the  promoter  of  the 
gene.  We  have  used  germline  transformation  to 
show  that  this  negatively  acting  element  is  acti- 
vated at  birth  and  interferes  with  the  continued 
presence  of  positive  transcription  factors,  not 
only  in  the  context  of  the  AFP  gene,  but  when  it  is 
placed  in  the  same  position  within  the  albumin 
gene,  which  is  not  normally  repressed  after  birth. 
The  demonstration  that  the  negative  repressor 
must  lie  between  these  strong  positive  elements 
to  function  explains  why  only  the  AFP,  and  not 
the  nearby  albumin  gene,  is  negatively  regulated 
in  neonatal  liver. 

The  decline  in  hepatic  AFP  mRNA  after  birth  is 
influenced  by  raf,  an  unlinked  gene.  Mice  that 
carry  a  mutant  allele  of  m/exhibit  10-  to  20-fold 
higher  levels  of  AFP  mRNA  in  the  adult.  We  used 
transgenic  mice  to  establish  that  the  raf  gene 


product  does  not  act  through  the  negative  tran- 
scriptional apparatus  but  rather  affects  the  con- 
centration of  AFP  mRNA  post-transcriptionally, 
by  changing  the  stability  and/or  the  processing  of 
its  transcript.  Thus  the  10,000-fold  decline  in 
AFP  RNA  after  birth  is  achieved  by  changes  in 
both  transcription  and  stability  of  the  AFP 
transcripts. 

Our  long-standing  interest  in  liver  transcrip- 
tional repression  led  us  to  examine  another  fetal- 
specific  gene,  the  H19  gene.  Initially  we  were 
surprised  to  see  that  this  gene  contains  no  long 
open  reading  frame,  normally  found  in  genes  that 
encode  protein  products.  We  have  subsequently 
shown  that  the  RNA  is  not  associated  with  polyri- 
bosomes in  the  cell  but  is  sequestered  in  a  parti- 
cle in  the  cytoplasm.  This  leaves  us  with  the  chal- 
lenge of  understanding  the  role  of  a  gene 
transcribed  and  processed  as  a  classical  RNA  poly- 
merase II  gene,  whose  product  is  developmen- 
tally  regulated  in  both  endoderm  and  mesoderm, 
but  which  does  not  encode  a  protein. 

One  means  of  understanding  the  function  of 
any  gene  is  to  generate  both  dominant  and  reces- 
sive mutations  in  it.  To  obtain  dominant  mutants 
in  the  mouse,  we  have  introduced  excess  copies 
of  the  H19  gene  into  the  mouse  germline,  using 
microinjection  into  zygotes.  No  transgenic  mice 
that  express  the  gene  have  been  recovered  at 
birth.  The  lethality  occurs  abruptly,  after  14  days 
into  the  normal  20-day  gestation  period  of  the 
mouse. 

Insight  into  the  basis  of  the  lethality  caused  by 
the  extra  copies  of  the  H19  gene  came  from  its 
location  on  the  distal  tip  of  chromosome  7.  This 
region  has  been  shown  to  undergo  genomic  im- 
printing— when  both  copies  of  this  region  are 
inherited  from  the  mother,  the  animals  die  late  in 
gestation.  By  taking  advantage  of  differences  in 
the  H19  gene  between  different  species  of 
mouse,  we  showed  that  only  the  mother's  copy  of 
the  H19  gene  is  active  in  a  normal  mouse.  By 
introducing  extra  copies  of  the  gene  into  the 
germline  of  mice,  we  had  altered  the  carefully 
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controlled  gene  dosage,  and  the  result  was 
lethality. 

Our  laboratory  is  also  developing  techniques 
to  study  the  large  number  of  mutations  affecting 
development  that  have  been  described  in  the 
mouse.  Few  of  these  have  been  approached  at  the 
molecular  level,  primarily  because  there  were  no 
experimental  tools  available  to  gain  access  to  the 
mutated  genes.  The  advent  of  techniques  that  al- 
low the  investigator  to  map  and  clone  megabase 
amounts  of  DNA  at  a  time — specifically  pulsed- 
fiield  gel  electrophoresis  and  yeast  artificial  chro- 
mosomes— has  eliminated  this  constraint.  We 
have  begun  a  long-term  project  to  develop  tech- 
niques that  will  allow  us  to  clone  such  genes  and 


to  establish  their  identities  by  complementation 
analysis  via  introduction  of  yeast  artificial  chro- 
mosomes into  embryonic  stem  cells. 

A  complete  library  of  mouse  DNA  in  yeast, 
currently  under  construction,  is  being  used  to 
isolate  the  mouse  Fused  gene,  which  affects 
neural  ectoderm  development  in  the  early  mouse 
embryo.  A  650-kb  segment  of  DNA  has  been 
cloned  near  the  FusedXocus,  and  a  large  interspe- 
cies backcross  is  being  used  to  localize  its  rela- 
tionship to  the  Fused  gene  itself.  To  prove  that 
the  Fused  gene  has  been  identified,  large  seg- 
ments of  DNA  containing  it  will  need  to  be  intro- 
duced back  into  the  mouse  germline.  Strategies 
for  accomplishing  this  through  embryonic  stem 
cells  are  in  progress. 
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THE  main  research  interest  of  our  laboratory  is 
the  mechanism  by  which  the  genetic  infor- 
mation stored  in  DNA  molecules  is  retrieved  in  a 
controlled  and  orderly  fashion  during  the  bio- 
chemical process  called  transcription,  which 
subsequently  leads  to  the  production  of  specific 
proteins  in  animal  cells.  We  have  devised  various 
means  of  isolating  the  individual  components  of 
the  cell  responsible  for  transcription  and  have  re- 
constructed this  complex  reaction  in  the  test 
tube.  In  this  way  we  can  study  how  specific  genes 
are  turned  on  and  off  during  cell  growth  and  de- 
velopment. The  mechanisms  that  govern  the 
switching  on  and  off  of  genes  are  of  fundamental 
importance  in  understanding  the  normal  meta- 
bolic processes  that  maintain  and  perpetuate  liv- 
ing cells,  as  well  as  in  deciphering  the  basis  of 
disease  and  other  cellular  or  genetic  disorders. 

Biochemical  Analysis  of  Cancer  Genes 

A  living  cell  contains  hundreds  of  thousands  of 
protein  molecules,  each  carrying  out  its  proper 
function.  If  either  the  production  or  the  action  of 
such  proteins  is  altered,  severe  malfunction  can 
result,  such  as  uncontrolled  growth  leading  to 
cancer.  Among  a  normal  cell's  constituents,  cer- 
tain key  molecules  have  the  potential  to  be  onco- 
genic, or  tumor  causing.  Such  molecules  are 
called  oncoproteins.  Our  group  previously  iso- 
lated a  family  of  rare  proteins  from  human  cells 
that  have  been  shown  to  be  oncoproteins,  en- 
coded by  the  genes  jun  and  fos.  These  regulatory 
proteins  are  normally  involved  in  controlling  the 
action  of  many  other  genes  in  the  cell,  but  when 
their  activities  are  perverted — for  example,  by 
viruses — they  can  lead  to  the  production  of 
cancer-causing  cells. 

Recent  advances  in  the  study  of  the  nuclear  on- 
cogene reveal  that  the  ability  of  the  jun  pro- 
tein to  activate  transcription  is  regulated  by  a 
cell-type-specific  inhibitor  that  interacts  with  a 
unique  portion  of  the  molecule,  rendering  it  less 
potent.  The  isolation  and  characterization  of  this 
specific  negative  regulator  should  help  unravel 
the  molecular  signaling  pathways  responsible  for 


transducing  information  from  the  outside  of  the 
cell  into  the  nucleus,  where  gene  expression  is 
controlled.  Such  inhibitors  of  jun  activity  may 
represent  new  members  of  the  anti-oncogenic  or 
tumor  suppressor  family  of  biological  regulatory 
molecules. 

Studies  of  Trans-activating  Proteins  That 
Regulate  Gene  Expression 

A  major  hurdle  has  been  the  development  of 
biochemical  techniques  that  allow  the  purifica- 
tion of  these  rare  and  fragile  proteins.  Through 
the  use  of  specific  DNA-afi&nity  chromatography 
procedures  pioneered  in  our  laboratory,  we  can 
now  isolate  transcription  proteins  that  in  turn  en- 
able us  to  clone  the  genes  that  encode  these  im- 
portant regulatory  proteins.  The  ability  to  isolate 
this  biologically  important  class  of  genes  pro- 
vides a  powerful  approach  for  studying  their 
structure  and  function. 

In  the  past  several  years  our  laboratory  has  iso- 
lated and  characterized  some  10  difi'erent  genes 
that  are  directly  responsible  for  the  tissue-selec- 
tive, temporally  programmed,  and  basal-level 
control  of  gene  expression  in  animal  cells.  These 
studies  are  beginning  to  reveal  new  concepts  re- 
garding the  surprisingly  modular  construction  of 
these  proteins,  as  well  as  their  unusual  plasticity 
and  functional  flexibility.  Most  importantly,  spe- 
cific structural  motifs  that  lie  within  these 
proteins  have  been  recognized  as  carrying  out 
distinct  functions.  Our  findings  provide  the  theo- 
retical basis  for  analysis  of  other  as  yet  undiscov- 
ered transcription  factors  and  will  greatly  aid  our 
ability  to  decipher  their  mechanisms  of  action. 

How  Promoter-Specific  Regulators  Trigger 
Transcription 

One  fundamental  mystery  is  the  mode  of  action 
by  which  sequence-specific  DNA-binding  pro- 
teins such  as  the  prototype  human  factor  Spl  di- 
rect transcriptional  interactions.  To  address  this 
critical  issue,  we  recently  fractionated  and  iso- 
lated the  multiple  components  necessary  to  re- 
constitute transcription.  In  the  process  of  dissect- 
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ing  the  general  transcriptional  apparatus,  we  dis- 
covered two  components  that  serve  as  the  func- 
tional bridge  between  upstream  trans-activators 
and  the  initiation  complex.  These  novel  factors 
appear  to  be  part  of  the  missing  link  that  directs 
promoter-selective  transcription  in  animal  cells, 
and  we  anticipate  that  they  will  be  members  of  a 
diverse  and  essential  class  of  regulatory  proteins. 

Transcription  of  Developmentally 
Regulated  Genes 

One  of  our  long-term  interests  is  to  understand 
mechanisms  underlying  the  regulation  and  ex- 
pression during  development  of  higher  organ- 
isms. We  have  begun  to  address  this  issue  in  two 
ways:  first,  by  initiating  a  series  of  in  vitro  exper- 
iments aimed  at  dissecting  the  transcriptional 
regulation  of  Drosophila  melanogaster  genes, 
including  alcohol  dehydrogenase  (Adh),  Ultra- 
bithorax  (Ubx) ,  Antennapedia  (Antp) ,  dopa  de- 
carboxylase (Ddc),  and  hunchback.  A  major 
advance  was  the  development  of  in  vitro  tran- 
scription reactions  from  staged  Drosophila  em- 
bryos that  accurately  initiate  RNA  synthesis  and 
recapitulate  the  temporal  program  of  transcrip- 
tion displayed  by  these  tissue-specific  and  devel- 
opmentally regulated  genes.  A  second  approach 
has  been  to  investigate  the  regulatory  mechanism 
of  RNA  polymerase  initiation  factors  in  vivo. 
Various  systems  have  been  adapted  to  introduce 


altered  genes  back  into  cells  or  whole  organisms 
to  study  their  patterns  of  expression. 

These  in  vitro  and  in  vivo  studies  have  re- 
cently led  to  two  exciting  results.  First,  a  negative 
regulator  of  a  developmentally  important  gene 
has  been  identified  by  direct  biochemical  means, 
and  its  mode  of  operation  can  now  be  dissected. 
The  finding  of  specific  transcriptional  repressors 
is  of  particular  importance,  because  it  is  thought 
that  an  interplay  of  positive  activators  and  nega- 
tive regulators  is  seminal  to  the  spatially  re- 
stricted patterns  of  expression  observed  during 
embryogenesis. 

A  second  interesting  finding  arising  from  our 
studies  of  Drosophila  is  the  identification  and 
subsequent  biochemical  characterization  of  a 
transcription  factor  that  appears  to  govern  the  ex- 
pression of  genes  in  cells  of  the  central  nervous 
system.  The  gene  encoding  this  neurogenic- 
specific  activator  has  recently  been  isolated,  and 
its  structure  is  expected  to  reveal  interesting  in- 
formation. In  addition,  the  use  of  fruit  flies  al- 
lows us  to  probe  the  developmental  and  tissue- 
specific  function  of  this  neurogenic  regulator  in  a 
rapid  and  highly  informative  manner  not  readily 
accessible  in  mammalian  cells.  These  studies  are 
expected  to  yield  new  insights  concerning  the 
tissue-specific  distribution  and  temporal  timing 
of  expression  during  development. 
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THE  most  critical  step  in  the  vertebrate  im- 
mune response  is  the  recognition  of  antigens 
by  lymphocytes.  This  task  is  accomplished  by  two 
sets  of  glycoproteins:  immunoglobulins  (Igs)  and 
T  cell  antigen  receptors  (TCRs).  The  most  ex- 
traordinary feature  of  these  proteins  is  their  struc- 
tural variability,  much  of  which  originates  from 
the  ability  of  the  encoding  gene  segments  to  un- 
dergo somatic  rearrangement. 

All  TCRs  were  initially  thought  to  be  composed 
of  a  heterodimeric  protein  composed  of  a-  and 
j8-subunits.  However,  the  search  for  the  genes  en- 
coding these  polypeptides  led  to  the  identifica- 
tion of  a  third  rearranging  gene,  7,  that  was  later 
shown  to  encode  one  of  the  two  subunits  of  an- 
other heterodimeric,  TCR  75.  The  characteristics 
of  this  new  type  of  TCR  and  the  development  and 
function  of  the  T  lymphocytes  that  express  it,  76 
T  cells,  have  been  the  major  focus  of  our  research 
during  the  past  several  years. 

The  two  types  of  T  cell — those  expressing  TCR 
76  and  TCR  afi — most  certainly  arise  through  two 
separate  cell  lineages  that  originate  from  a  com- 
mon stem  cell.  The  molecular  mechanism  where- 
with the  two  cell  lineages  segregate  from  the 
stem  cell  is  of  considerable  interest,  not  only  for 
immunologists  but  also  for  cellular  biologists  in 
general.  Our  previous  studies  suggested  that 
there  is  a  DNA  element,  called  a  silencer,  asso- 
ciated with  a  7  gene  through  which  its  expression 
is  repressed  in  afi  T  lineage  cells.  Our  data  also 
suggested  that  the  fate  of  a  given  precursor  T  cell 
is  determined  by  whether  or  not  it  contains  fac- 
tors that  can  specifically  interact  with  the  si- 
lencer element  and  thereby  repress  the  asso- 
ciated 7  gene. 

We  have  now  identified  and  characterized  the 
silencer  element — in  fact,  two  elements,  one  lo- 
cated within  and  the  other  3'  to  a  7  gene.  Each  of 
these  elements  alone  can  repress  7  gene  tran- 
scription in  an  a/3  lineage  cell  but  not  in  a  yb 
lineage  cell,  and  the  two  elements,  where  present 
in  the  same  lineage  cell,  work  synergistically. 
We  have  also  identified  a  third  DNA  element  be- 
tween the  two  silencer  elements  that  enhances  7 


gene  transcription,  called  an  enhancer.  We  sus- 
pect that  each  of  these  transcriptional  elements 
(silencers  and  enhancer)  interacts  with  an  array 
of  factors,  some  of  which  are  differentially  pres- 
ent in  ajS  and  76  T  cell  lineages,  and  that  the  dif- 
ferential synthesis  or  activation  of  these  factors  is 
the  key  for  the  segregation  of  the  two  T  cell 
lineages. 

One  of  the  most  intriguing  features  of  76  T  cells 
is  the  compartmentalization  of  their  subsets  with 
different  TCR  repertoires  to  various  epithelia.  Us- 
ing transgenic  mice  that  were  produced  by  the 
microinjection  of  a  pair  of  rearranged  TCR  7  and  6 
genes  into  the  pronuclei  of  fertilized  eggs,  we 
have  shown  that  a  given  76  T  cell  "homes"  to  a 
particular  epithelium,  being  guided  by  a  mecha- 
nism independent  of  the  specificity  of  its  76  TCR. 
Furthermore,  using  mice  whose  thymuses  had 
been  surgically  removed,  we  confirmed  our  ear- 
lier conclusion  obtained  with  thymus-less 
"nude"  mice:  Development  of  76  T  cells  that 
home  to  spleen  and  lymph  nodes  is  thymus  de- 
pendent, while  that  of  76  T  cells  associated  with 
gut  epithelium  is  not. 

Another  intriguing  feature  of  the  compartmen- 
talized 75  T  cells  is  that  virtually  no  TCR  diversity 
is  present  among  the  members  of  some  of  these 
subsets,  namely  the  subset  associated  with  epi- 
dermis (called  skin-associated  intraepithelial 
lymphocytes,  or  s-IEL)  or  the  subset  associated 
with  the  epithelial  layer  of  vagina,  uterus,  and 
tongue  (called  vut-IEL).  The  structural  homoge- 
neity of  the  75  TCR  in  these  tissues  is  all  the  more 
striking  because  their  genes,  like  Ig  and  TCR  a 
and  (8  genes,  have  the  capacity  for  diversity  and 
because  this  capacity  is  indeed  utilized  in  some 
other  parts  of  the  body,  such  as  spleen  and  lymph 
nodes. 

By  culturing  fetal  thymus  outside  the  body  in 
the  presence  of  a  monoclonal  antibody  that  will 
specifically  bind  to  76  TCR,  we  have  now  demon- 
strated that  the  remarkable  homogeneity  of  76 
TCR  of  s-IEL  and  vut-IEL  is  a  result  of  the  strong 
selection  of  T  cell  clones  expressing  the  particu- 
lar 76  TCRs  through  their  interaction  with  un- 
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known  determinants  expressed  in  fetal  thymuses 
(called  positive  selection).  While  neither  the 
role  of  s-IEL  or  vut-IEL  or  the  nature  of  the  select- 
ing determinants  (called  ligands)  is  known,  such 
strong  cellular  selection  of  T  cells  with  a  particu- 
lar TCR  structure  (and  hence  specificity)  sug- 
gests its  pivotal  role  in  the  functioning  of  these  T 
cells  in  the  respective  peripheral  site. 

We  previously  showed  that  at  least  part  of  the 
determinant  recognized  by  a  spleen-type  yd  TCR 
(KN6  76  TCR)  is  encoded  by  a  novel  gene  (722*") 
that  maps  in  the  major  histocompatibility  com- 
plex (MHC)  and  has  structural  similarity  with  the 
MHC  class  I  genes  (the  products  of  which  are 
known  to  present  antigen-derived  peptides  to  a/? 
TCR).  This  and  related  findings  by  others  sug- 
gested that  the  products  of  a  few  dozen  class  I 
genes  clustered  in  the  so-called  Tla  subregion  of 
MHC,  whose  function  has  been  unknown,  may 
generally  serve  as  peptide-presenting  molecules 
for  76  TCR.  Our  recent  finding  that  the  product  of 
one  of  these  class  I  genes,  73*,  is  expressed,  ap- 
parently exclusively,  on  the  surface  of  gut  epithe- 
lia  cells  supports  this  hypothesis.  (Note  that  a  par- 
ticular 75  T  cell  subset  referred  to  as  i-IEL  is 
localized  in  gut  epithelium.) 

While  we  have  accumulated  a  substantial 
amount  of  information  with  respect  to  develop- 
ment, diversity,  and  tissue  localization  of  7^  T 
cells,  their  role  in  the  immune  system  remains 
elusive.  To  help  shed  light  on  this  most  funda- 


mental issue,  we  have  been  producing  mutant 
mice  deficient  in  a/3  or  76  T  cells.  We  accomplish 
this  by  manipulating  a  TCR  gene  in  a  test  tube, 
injecting  it  into  the  nuclei  of  a  cultured  embry- 
onic stem  (ES)  cell,  returning  the  DNA-injected 
stem  cell  into  an  early  embryo  (a  blastocyst),  and 
finally  injecting  the  embryo  into  a  pseudopreg- 
nant  mother.  The  chimeric  newborn  are  tested  by 
breeding  them  further  for  germline  transmission 
of  the  engineered  TCR  gene,  and  the  germline- 
transmitted  mice  are  bred  by  sister-brother  mat- 
ing to  generate  mice  with  homozygous  mutation. 
(The  entire  procedure  is  called  the  ES  cell  gene- 
targeting  technique.) 

To  date  we  have  succeeded  in  obtaining  mice 
in  which  a  mutated  TCR  13,  a,  or  8  gene  is  germ- 
line  transmitted.  We  intend  to  analyze  the  im- 
mune system  of  these  mutant  mice  before  and 
after  various  immunizations,  after  grafting  with 
normal  and  tumor  cells,  and  after  infecting  them 
with  a  variety  of  pathogens.  We  expect  these  stud- 
ies to  help  not  only  in  identifying  the  roles  of  76  T 
cells  but  also  in  redefining  the  role  of  some 
T  cells. 

Finally,  we  have  also  been  applying  the  ES  cell 
gene-targeting  technique  to  a  calcium/calmodu- 
lin-dependent  protein  kinase.  This  kinase  is 
highly  enriched  in  the  postsynaptic  density  of 
hippocampus  and  is  suspected  to  be  involved  in 
establishing  and  maintaining  long-term  memory 
in  mammals.  Our  goal  is  to  determine  whether 
and  how  this  kinase  is  involved  in  memory. 
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THE  overall  goal  of  my  laboratory  is  to  gain  a 
better  understanding  of  information  process- 
ing both  inside  individual  living  cells  and  in  net- 
works of  neurons.  Our  preferred  approach  is 
through  the  rational  design,  synthesis,  and  use  of 
new  molecules  to  detect  and  manipulate  intra- 
cellular biochemical  signals,  usually  by  optical 
means  such  as  fluorescence  readout  or  photo- 
chemical release  of  messenger  substances.  For  ex- 
ample, we  have  created  fluorescent  dye  mole- 
cules that  detect  calcium  ions  (Ca^"^)  with  great 
specificity  and  sensitivity,  so  that  while  the  cells 
are  living  and  performing  their  normal  functions, 
we  can  image  Ca^^  levels  inside  cells  with  a  spa- 
tial resolution  of  a  micron  or  so  and  a  temporal 
resolution  of  a  fraction  of  a  second.  These  dyes 
have  found  wide  application  in  cell  biology, 
since  a  rise  in  intracellular  Ca'^^  levels  is  one  of 
the  commoner  mechanisms  by  which  cell  mem- 
branes control  biochemical  events,  such  as  mus- 
cle contraction,  synaptic  transmission,  glandular 
secretion,  enzyme  activation,  embryonic  fertiliza- 
tion, and  growth  stimulation. 

The  detection  of  intracellular  signals  such  as 
Ca^*  is  doubly  important.  It  should  help  in  trac- 
ing the  complex  biochemistries  involved  in  such 
signaling,  and  it  affords  a  nondestructive  way  to 
watch  the  activity  of  many  individual  cells  simul- 
taneously. The  latter  ability  is  particularly  rele- 
vant to  understanding  how  neural  networks  pro- 
cess information  by  harnessing  many  individual 
but  interconnected  neurons  in  parallel.  The  domi- 
nant established  techniques  for  monitoring 
neural  activity  either  listen  intensively  to  a  single 
neuron  at  a  time  or  record  some  smeared-out 
average  of  what  thousands,  millions,  or  billions 
of  cells  are  doing.  If  we  can  continue  to  improve 
the  spatial  and  temporal  resolution  of  present 
Ca^^  imaging,  we  may  succeed  in  eavesdropping 
on  conversations  within  small  groups  of  individu- 
ally identified  neurons  or  in  taking  snapshots  of 
the  instantaneous  state  of  activity  of  yet  larger  en- 
sembles. Because  optical  monitoring  is  inher- 
ently good  for  following  multiple  events  in  paral- 
lel, it  would  be  a  major  help  in  analyzing  the 


workings  of  the  brain,  the  most  awesome  and 
complex  molecular  assembly  known. 

A  recent  example  of  molecular  engineering  is 
our  development  of  a  fluorescent  sensor  for 
cAMP.  This  important  intracellular  messenger 
plays  a  crucial  role  in  the  actions  of  a  great  many 
hormones,  in  the  sensing  mechanisms  for  odors 
and  tastes,  and  in  the  mechanisms  of  learning  and 
memory.  In  this  case  we  did  not  design  the  sens- 
ing molecules  from  scratch  but  rather  modified 
the  natural  protein  that  cells  normally  use  to  re- 
spond to  cAMP.  In  collaboration  with  Susan  Tay- 
lor and  her  laboratory,  we  have  attached  fluores- 
cent labels  to  cAMP-dependent  protein  kinase  in 
such  a  way  that  cAMP  not  only  activates  the  nor- 
mal activity  of  this  enzyme  but  produces  an  imme- 
diate optical  signal  that  we  can  image  microscopi- 
cally. This  labeled  protein  enables  us  to  visualize 
cAMP  levels,  to  show  that  neighboring  cells  can 
have  diff^ering  responses  to  neurotransmitter  and 
drug  stimulation,  and  to  see  that  a  subunit  of  the 
enzyme  can  move  in  and  out  of  the  nucleus  as  the 
cAMP  rises  and  falls.  While  it  is  in  the  nucleus,  it 
is  ideally  placed  to  modify  gene  expression. 

A  complementary  area  of  interest  is  the  use  of 
light  both  to  visualize  intracellular  biochemistry 
and  to  perturb  it  in  a  controlled  manner  to  see 
how  the  cell  or  tissue  responds.  Light  is  a  stimu- 
lus that  is  wonderfully  controllable  in  space  and 
time.  Of  course  most  cells,  other  than  specialized 
tissues  like  the  retina,  are  not  particularly  sensi- 
tive to  light,  so  the  trick  is  to  design  and  synthe- 
size light-sensitive  organic  molecules  that,  intro- 
duced into  cells,  release  important  messenger 
substances  upon  illumination.  Through  the  work 
of  many  researchers,  ourselves  included,  such 
photochemical  releaser  molecules  are  available, 
not  only  for  Ca^^  but  also  other  important  bio- 
chemical signals  such  as  cAMP,  inositol  phos- 
phates, and  diacylglycerol.  We  have  also  recently 
developed  the  first  molecule  that  can  do  the  op- 
posite function — that  is,  gobble  up  the  messen- 
ger upon  illumination.  All  these  photochemical 
perturbations  are  valuable  to  show  whether  the 
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biochemical  signal  is  merely  coincidental  or  is 
necessary  or  sufficient  for  various  cell  responses. 

One  problem  is  that  the  photochemistry 
currently  used  is  irreversible,  so  that  once  each 
molecule  has  been  photolyzed  and  has  released 
or  absorbed  its  cargo,  it  cannot  be  recycled  back 
to  its  initial  state.  Therefore  the  experiment  is 
soon  over,  because  the  molecules  have  been  used 
up.  The  next  challenge  is  to  find  reversible  photo- 
chemistry in  which  light  of  one  wavelength 
causes  messenger  release,  and  light  of  a  different 
wavelength  triggers  reuptake.  By  alternating 
these  two  colors  of  illumination,  one  would  then 
be  able  to  generate  artificial  oscillations  in  mes- 
senger level  whose  effects  could  be  compared 
with  those  of  the  many  known  natural  oscilla- 
tions. Furthermore,  steady  illumination  with  ad- 
jacent spots  of  the  two  wavelengths  should  create 
a  standing  spatial  gradient  of  messenger  level, 
which  would  help  test  whether  analogous  natural 
gradients  are  important  in  controlling  the  direc- 
tion of  cell  movement  and  outgrowth,  as  often 
hypothesized. 

Eventually  we  hope  to  extend  optical  methods 
to  detect  macromolecular  biochemical  signals 
such  as  protein  phosphorylation  or  gene  tran- 


scription. These  events  currently  are  assayed  by 
grinding  millions  of  cells,  so  that  time  resolution 
is  limited  and  differences  between  individual 
cells  or  subregions  are  impossible  to  discern.  Our 
experience  with  imaging  ionic  messengers  and 
cAMP  suggests  that  cells  have  considerable  indi- 
viduality and  complex  behavior  patterns.  These 
were  missed  with  destructive  population  assays, 
which  might  be  somewhat  analogous  to  studying 
human  psychology  on  the  basis  only  of  anony- 
mous nationwide  averages  in  which  the  respon- 
dents are  executed  after  each  poll.  We  therefore 
seek  continuous,  nondestructive  readout  from 
single  cells.  Possible  approaches  include  mi- 
croinjection of  peptides  whose  fluorescence  is 
altered  by  phosphorylation,  or  development  of 
membrane-permeant  fluorogenic  substrates  for 
reporter  enzymes  whose  nucleotide  sequences 
can  be  fused  to  genes  or  promoter  sequences  of 
interest. 

Our  projects  encompass  a  wide  range  of  disci- 
plines, including  organic  synthesis,  theoretical 
and  experimental  optical  spectroscopy  and  pho- 
tochemistry, protein  chemistry,  computerized 
microscopy  and  image  processing,  cell  biology, 
and  neurobiology. 
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TTTTUMAN  biochemical  genetics  has  been  a 
JLXfruitful  area  of  study  since  its  beginning 
with  the  work  of  Sir  Archibald  Garrod  early  in  this 
century.  Inherited  defects  in  our  body's  chemis- 
try or,  as  Garrod  called  them,  inborn  errors  of 
metabolism,  are  intrinsically  interesting  and 
serve  as  important  models  for  all  genetic  diseases. 
My  colleagues  and  I  have  been  involved  in  the 
study  of  several  aspects  of  these  disorders,  in- 
cluding clinical  diagnosis,  biochemical  charac- 
terization, delineation  of  pathophysiologic  mech- 
anisms, development  of  new  therapeutic 
approaches,  and  molecular  studies  of  the  caus- 
ative mutations. 

We  have  focused  on  disorders  of  amino  acid 
metabolism,  particularly  those  involving  two 
fundamentally  important  areas  of  metabolism: 
the  urea  cycle,  which  is  involved  in  the  conver- 
sion of  excess  nitrogen  from  a  toxic  to  a  nontoxic, 
readily  excreted  form,  and  the  tricarboxylic  acid 
cycle,  an  essential  component  of  energy  metabo- 
lism. Recently  we  have  extended  these  interests 
to  include  inborn  errors  in  the  biogenesis  of  the 
peroxisome,  a  ubiquitous,  subcellular  organelle 
that  contains  about  40  enzymes  important  in  a 
variety  of  anabolic  and  catabolic  processes. 

One  of  the  amino  acid  disorders  that  we  are 
studying  is  an  inborn  error  of  ornithine  metabo- 
lism known  as  gyrate  atrophy  of  the  choroid  and 
retina  (GA) .  This  progressive,  blinding  chorioret- 
inal degeneration  with  associated  cataract  forma- 
tion is  inherited  as  an  autosomal  recessive  trait. 
The  primary  biochemical  defect  is  deficiency  of 
the  enzyme  ornithine-5-aminotransferase  (OAT), 
which  results  in  an  approximate  1 0-fold  accumu- 
lation of  ornithine  in  all  bodily  fluids. 

Despite  the  systemic  nature  of  the  metabolic 
abnormality  in  GA,  the  clinical  phenotype  is  lim- 
ited to  the  eye.  Thus  GA  is  one  of  a  very  few  iso- 
lated, inherited  retinal  degenerations  for  which  a 
primary  biochemical  defect  is  known.  As  such, 
GA  serves  as  a  model  for  more  common  inherited 
retinal  degenerations  whose  biochemical  basis 
remains  enigmatic.  We  have  been  involved  in  a 
variety  of  GA  studies,  including  the  development 
and  testing  of  potential  therapies,  investigation  of 


possible  explanations  for  the  sensitivity  of  the  ret- 
ina to  the  systemic  biochemical  abnormalities, 
elucidation  of  regulatory  mechanisms  control- 
ling the  expression  of  the  OAT  gene,  and  determi- 
nation of  the  molecular  defects  causing  GA.  The 
last  depends  on  the  results  of  earlier  studies  in 
our  laboratory,  including  isolation  and  sequenc- 
ing of  a  full-length  cDNA  clone  of  human  liver 
OAT  and  determination  of  the  structure  and  orga- 
nization of  the  human  OAT  gene. 

The  OAT-catalyzed  reaction  is  an  essential  step 
in  the  pathway  that  interconnects  the  urea  cycle 
with  the  tricarboxylic  acid  cycle.  Thus  it  is  not 
surprising  that  the  regulation  of  OAT  is  complex 
and,  in  liver,  coordinated  with  other  urea  cycle- 
related  enzymes.  We  recognized  a  sequence  mo- 
tif in  the  5'-flanking  region  of  OAT  that  is  similar 
to  sequences  in  the  5'-flanking  region  of  several 
other  urea  cycle-related  genes,  including  orni- 
thine transcarbamylase,  argininosuccinate  synthe- 
tase, arginase,  and  glutamine  synthetase.  We  des- 
ignated this  candidate  cis-acting  regulatory 
element  a  urea  cycle  element  (UCE) .  Expression 
studies  with  hybrid  constructs  of  various  portions 
of  the  5'-flanking  region  of  the  OAT  gene  linked 
to  a  reporter  gene  indicate  that  the  UCE  motifs 
have  functional  roles.  DNase  I  footprint  assays 
and  gel  retardation  experiments  with  a  series  of 
synthetic  oligonucleotides  corresponding  to  the 
normal  and  mutant  variations  of  the  UCE  se- 
quence indicate  that  at  least  three  proteins,  each 
with  distinct  but  overlapping  recognition  sites, 
bind  to  the  UCE. 

Using  a  concatamer  of  the  UCE  sequence  as  a 
radiolabeled  probe,  we  have  cloned  cDNAs  for 
two  specific  DNA-binding  proteins  that  are  candi- 
dates for  trans-acting  regulators  of  the  homeo- 
static  mechanisms  that  adjust  ureagenic  capacity 
to  nitrogen  intake.  Transient  transfection  experi- 
ments in  which  we  co-transfect  one  of  these 
cDNAs  with  a  second  construct  containing  two  or 
four  UCE  in  the  promoter  of  a  reporter  gene  indi- 
cate that  the  protein  product  of  our  cDNA  clone 
does  activate  the  expression  of  genes  with  UCE  in 
their  promoters. 

To  better  understand  the  coordinated  regula- 
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tion  of  the  urea  cycle-related  enzymes,  we  re- 
cently have  embarked  on  a  project  to  clone  the 
genes  for  additional  enzymes  that  are  metaboli- 
cally  related  to  OAT.  For  several  of  these,  purified 
protein  or  sequence  data  are  not  available;  there- 
fore we  are  cloning  these  genes  by  functional 
complementation  of  mutant  variants  of  Saccha- 
romyces  cerevisiae.  To  date  we  have  cloned  a 
full-length  cDNAfrom  humanpyrroline-5-carbox- 
ylate  reductase  by  this  method.  Preliminary 
studies  support  the  feasibility  of  using  the  same 
strategy  to  clone  pyrroline-5-carboxylate  dehy- 
drogenase. These  two  enzymes  catalyze  reactions 
involving  the  product  of  the  OAT  reaction,  and 
the  latter  is  the  site  of  the  primary  defect  in  the 
human  disorder,  type  II  hyperprolinemia.  We  an- 
ticipate that  the  genes  for  these  enzymes  will  be 
subject  to  transcriptional  regulation  that  interacts 
with  that  of  OAT. 

We  are  continuing  an  extensive  molecular  anal- 
ysis of  the  OAT  mutations  that  cause  GA  in  the 
probands  of  79  families  from  around  the  world. 
The  functional  consequences  of  the  detected  mu- 
tations are  tested  by  expressing  the  mutant  allele 
in  a  Chinese  hamster  cell  line  that  lacks  endoge- 
nous OAT  protein  or  mRNA.  Of  the  27  alleles  we 
have  detected,  1 5  have  been  confirmed  as  delete- 
rious, 10  are  presumed  to  be  deleterious,  1  is 
neutral,  and  1  is  synonymous.  In  addition  to  pro- 
viding a  means  for  accurate  molecular  diagnosis 
of  GA,  we  anticipate  that  our  examination  of 
these  defects,  particularly  the  18  missense  muta- 
tions, will  enhance  our  understanding  of  OAT 
structure  and  function.  We  are  exploring  strate- 
gies to  express  OAT  in  quantities  sufficient  for 
crystallographic  analysis  of  OAT  structure. 

Our  studies  of  the  mutations  causing  GA  have 
highlighted  the  remarkable  allelic  heterogeneity 


at  the  OAT  locus.  With  one  exception  (L402P, 
the  most  prevalent  allele  in  Finnish  GA  patients), 
each  mutant  allele  accounts  for  less  than  10  per- 
cent of  the  total.  Survey  of  the  genotype  of  pro- 
bands from  our  79  GA  pedigrees  indicates  that 
the  35  mutant  alleles  so  far  discovered  by  our- 
selves and  others  account  for  only  70  percent  of 
the  total  mutant  OAT  genes  in  this  population. 

A  final  OAT-related  project  depends  on  the  uti- 
lization of  a  cluster  of  OAT-related  sequences  (all 
located  on  the  proximal  portion  of  the  short  arm 
of  the  X  chromosome)  to  develop  a  large-scale 
genetic  map  of  this  region  of  the  human  genome. 
The  genes  for  several  diseases  involving  the  retina 
map  to  this  region.  At  this  point  we  have  cloned 
and  organized  two  regions,  each  about  a  mega- 
base,  in  Xpl  1 .2-Xpl  1.3,  which  are  separated  by 
a  1-3  megabase  interval.  We  are  beginning  to 
identify  the  active  genes  in  these  areas  and  will 
examine  their  possible  role  in  inherited  retinal 
degenerations. 

We  have  also  begun  an  investigation  of  inborn 
errors  of  peroxisome  biogenesis  and  function. 
Zellweger  syndrome,  a  neurodevelopmental  dis- 
order fatal  in  infancy,  is  the  disease  paradigm. 
Cells  and  tissue  from  these  patients  exhibit  defi- 
ciency of  virtually  all  peroxisomal  enzymes.  In 
collaboration  with  Hugo  Moser,  we  are  cloning 
the  genes  for  integral  membrane  proteins  unique 
to  peroxisomes.  The  first  of  these,  the  70-kDa 
peroxisome  membrane  protein  (PMP),  is  a 
member  of  the  multiple-drug  resistance  (MDR) 
superfamily  of  proteins,  which  also  includes  the 
CFTR  protein  involved  in  cystic  fibrosis.  We  have 
cloned  and  sequenced  a  full-length  cDNA  for  the 
human  PMP  and  are  in  the  process  of  determining 
if  mutations  in  this  gene  are  responsible  for  any  of 
the  inborn  errors  of  peroxisomal  biogenesis. 
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THE  maintenance  of  a  viable  basal  state  in  eu- 
karyotic  cells  (homeostasis),  and  the  re- 
sponses of  cells  to  environmental  stimuli,  involve 
a  fine-tuned  regulatory  network  that  is  modulated 
by  transient  changes  in  the  intracellular  levels  of 
ionized  calcium.  The  molecular  mechanisms  by 
which  quantitative  changes  in  calcium  concen- 
tration are  transduced  into  qualitative  changes  in 
cell  behavior  (altered  cell  motility,  proliferation, 
receptor  capping,  cell  morphology,  etc.)  involve 
the  reversible  interaction  of  calcium  with  a  class 
of  protein  receptors  that  bind  it  selectively  in  the 
presence  of  higher  concentrations  of  other 
ions,  such  as  magnesium.  A  prototypical  member 
of  this  class  of  calcium-binding  proteins  is 
calmodulin. 

Calmodulin  is  ubiquitous  among  the  plant  and 
animal  kingdoms  and  has  multiple  biological 
roles.  It  fulfills  these  roles  through  its  presence  as 
an  integral  subunit  of  a  diverse  array  of  enzymes, 
cytoskeletal  structures,  and  membrane-asso- 
ciated transport  systems.  Insight  into  the  funda- 
mentals of  this  regulatory  network  would  provide 
a  framework  from  which  to  interpret  biological 
phenomena  and  guide  clinical  treatments.  It  is 
the  goal  of  our  research  to  elucidate  the  calmod- 
ulin-mediated  signal  transduction  system  and 
thereby  gain  a  more  complete  understanding  of 
how  calcium  regulates  cell  processes  and  how 
this  homeostatic  network  might  be  altered  in  cer- 
tain disease  states  or  susceptibilities. 

Calmodulin-regulated  Protein  Kinases 

One  class  of  enzymes  that  have  calmodulin  as 
an  integral  calcium-binding  regulatory  subunit 
are  the  protein  kinases.  Protein  kinases  catalyze 
the  transfer  of  phosphate  from  an  ATP  molecule 
to  a  specific  site  on  certain  proteins  {phosphory- 
lation), resulting  in  a  rapid  alteration  of  the 
phosphorylated  protein's  biological  functioning. 

In  the  basal  or  "resting"  state  of  the  cell,  the 
active  site  of  the  protein  kinase  is  inhibited  by 
another  region  of  the  kinase  molecule  {autoinhi- 
bition).  When  calcium  binds  to  calmodulin, 
which  is  loosely  bound  to  the  kinase,  the  cal- 
modulin's three-dimensional  structure  is  subtly 


changed.  This  in  turn  results  in  a  change  in  the 
three-dimensional  arrangement  of  the  protein  ki- 
nase molecule  that  relieves  the  inhibition  of  the 
kinase  active  site.  The  kinase  can  now  phosphor- 
ylate  a  specific  set  of  proteins  in  the  cell  and 
thereby  set  off  a  cascade  of  events. 

The  basal  state  is  reestablished  by  at  least  two 
closely  coupled  reactions:  the  removal  of  phos- 
phate from  the  phosphorylated  protein  by  an  en- 
zyme called  a  phosphatase,  and  the  dissociation 
of  calcium  from  calmodulin  with  a  resultant  res- 
toration of  the  protein  kinase  to  its  autoinhibited 
state.  During  this  past  year,  we  elucidated  the  pri- 
mary structure  of  one  of  these  calmodulin- 
regulated  protein  kinases — called  nonmuscle 
myosin  light-chain  kinase  (nmMLCK) — that  is 
found  in  all  nonmuscle  tissues  examined. 

We  have  also  determined,  through  use  of  mo- 
lecular genetic  and  protein  chemistry  technolo- 
gies, some  fundamental  features  of  both  the  pro- 
tein kinase  and  calmodulin  that  are  required  for 
these  two  proteins  to  recognize  each  other  in  a 
milieu  of  macromolecules,  with  the  resultant  for- 
mation of  a  fully  functional  calcium  signal-trans- 
duction  complex. 

By  serendipity,  we  also  discovered  a  novel  ge- 
netic relationship  among  this  protein  kinase  and 
some  other  proteins  produced  from  the  same 
gene.  As  a  result,  we  have  extended  our  investiga- 
tions to  include  an  analysis  of  how  the  various 
proteins  with  different  biological  functions  are 
made  from  this  single  gene  and  a  more  detailed 
localization  of  the  gene  in  the  human  genome. 

Calmodulin-regulated  Ion  Channels 

Ion  channels  are  cell  membrane  proteins  that 
regulate  ion  transport  across  the  membrane.  They 
do  not  appear  to  be  enzymes  like  the  protein  ki- 
nases, but  represent  another  class  of  cellular  pro- 
teins whose  functions  can  be  regulated  by  cal- 
modulin. Our  previous  studies  of  mutant 
organisms  with  ion  channel  defects  demonstrated 
that  it  is  possible  to  have  inherited  mutations  of 
calmodulin  that  can  selectively  alter  one  cal- 
modulin-regulated pathway.  (Previously  it  was 
thought  that  any  inherited  mutation  of  calmodu- 
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lin  would  be  lethal.)  During  this  past  year,  we 
continued  our  studies  of  the  structural  features  of 
calmodulin  required  for  proper  targeting  to  cell 
membranes  and  for  regulation  of  ion  channel- 
mediated  cell  behavior.  These  studies  showed 
that  a-single  amino  acid,  unchanged  in  calmodu- 
lins from  plant  and  animal  species,  is  required  for 
full  ion  channel  regulatory  activity. 

A  spin-off  from  this  study  was  the  production  of 
new  proteins,  by  using  designer  (modified) 
genes,  that  are  useful  diagnostic  reagents  for  fu- 
ture explorations  of  the  molecular  mechanisms 
involved  in  ion  channel  regulation.  Knowledge 
about  how  calmodulin  affects  the  activity  of  ion 


channels  will  provide  insight  into  the  features  of 
another  class  of  proteins  and  regulatory  pathways 
modulated  by  calcium  through  calmodulin,  thus 
complementing  the  knowledge  gained  from  stud- 
ies of  the  protein  kinase  family  of  calmodulin- 
regulated  proteins. 

The  fact  that  protein  kinases  are  modulated  by 
ions  whose  flux  is  regulated  by  ion  channels,  and 
that  ion  channels  can  be  regulated  by  protein  ki- 
nase-catalyzed  phosphorylations,  indicates  the 
potential  of  studies  focused  on  selected  calmod- 
ulin-regulated  proteins  for  developing  a  cohesive 
model  of  the  feedback  and  cross  talk  that  occurs 
in  vivo. 
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THE  nucleus  of  a  blood  or  skin  cell  has  the 
capacity,  when  put  back  into  an  enucleated 
egg,  to  give  rise  to  an  entire  animal,  containing 
perhaps  a  thousand  cell  types.  This  demonstrates 
that  a  given  cell  type  contains  all  the  genetic  in- 
formation needed  to  generate  all  other  types. 
How,  then,  are  the  different  cells  in  the  body  gen- 
erated? The  answer  is  that  different  sets  of  genes 
are  used  to  define  different  cell  types — e.g., 
muscle-specific  genes  are  expressed  only  in  mus- 
cle cells,  and  nerve-specific  genes,  only  in  nerve 
cells.  This  solution  has  been  confirmed  in 
hundreds  of  experiments  by  many  investigators 
over  the  past  10-20  years. 

But  how  are  these  different  batteries  of  cell 
type-specific  genes  activated  in  the  correct 
places  and  times  in  the  embryo?  The  goal  of  our 
laboratory  is  to  answer  this  question,  and  we  have 
chosen  to  ask  specifically  how  muscle  cells  are 
generated  during  development. 

MyoD  is  a  Master  Regulatory  Gene 
for  Myogenesis 

Our  initial  experiments  identified  a  gene, 
MyoD,  by  the  criteria  that  it  was  expressed  in 
myoblasts  but  not  in  fibroblasts,  nor  in  mutant 
muscle  cell  lines  that  had  lost  the  capacity  to  dif- 
ferentiate into  muscle.  Detailed  analysis  showed 
that  MyoD  is  expressed  only  in  skeletal  muscle 
cells  and  their  committed  myoblast  precursors. 
When  MyoD,  however,  is  introduced  into  fibro- 
blasts using  a  viral  long  terminal  repeat  (LTR)  to 
promote  constitutive  expression,  these  cells  acti- 
vate the  myogenic  program  and  become  muscle. 
A  large  number  of  other  differentiated  cell  types 
— fat,  pigment,  brain,  cartilage,  etc. — can  be 
converted  to  muscle  by  LTR-induced  expression 
of  MyoD.  In  the  process,  the  normal  program  of 
these  cells  is  usually  turned  off. 

We  consider  MyoD  a  "master  regulatory  gene" 
for  myogenesis  and  view  it  as  a  "nodal  point"  in 
the  flow  of  myogenic  information.  Many  tem- 
poral and  spatial  cues  from  the  early  embryo  im- 
pinge on  MyoD  to  turn  it  on  or  off,  but  once  acti- 
vated the  gene  can  keep  itself  on  (since  it 


promotes  its  own  transcription)  and  then  turn  on 
(directly  or  indirectly)  the  entire  battery,  per- 
haps hundreds,  of  muscle-specific  genes. 

MyoD  Protein  Is  a  Transcription  Factor  That 
Activates  Muscle-Specific  Genes 

MyoD  is  a  nuclear  protein,  318  amino  acids  in 
length,  that  binds  to  muscle-specific  enhancers. 
Many,  if  not  most,  muscle  enhancers  contain  mul- 
tiple MyoD-binding  sites.  MyoD  binds  to  a  con- 
sensus binding  sequence  of  which  the  apparent 
simplicity  belies  a  rather  sophisticated  interplay 
among  protein,  DNA,  and  subsequent  cell  type- 
specific  transcriptional  activation. 

Only  68  amino  acid  residues  of  MyoD  are  re- 
quired for  stable  muscle  cell  conversion.  This  re- 
gion contains  the  DNA-binding  domain,  which  is 
a  putative  basic-helix-loop-helix  (bHLH)  struc- 
ture. There  are  now  well  over  30  known  bHLH 
proteins,  sometimes  referred  to  as  the  Myc  homol- 
ogy family.  Extensive  mutagenesis  has  revealed 
that  the  HLH  region  is  required  for  dimerization, 
and  the  adjacent  basic  region,  for  DNA  binding. 

The  Activity  of  the  MyoD  Protein 
Is  Regulated 

Proliferating  myoblasts  in  tissue  culture  ex- 
press MyoD  RNA  and  protein,  yet  do  not  differen- 
tiate. As  assayed  by  in  vivo  footprinting,  the 
MyoD-binding  site  in  the  enhancer  of  the  MCK 
gene  is  occupied  in  differentiated  myotubes  but 
not  in  myoblasts.  Thus  there  are  physiological 
controls  that  regulate  MyoD  activity.  The  deci- 
sion either  to  proliferate  or  to  differentiate  into 
myotubes  is  governed  by  the  presence  or  absence 
of  serum,  but  the  mechanism  of  action  is  un- 
known. One  attractive  model  is  that  a  second  type 
of  HLH  protein  is  controlling  MyoD  activity.  We 
have  been  investigating  this  possibility. 

Information  Processing  by  HLH  Proteins 

Several  types  of  biologically  important  deci- 
sions use  information  provided  by  HLH  proteins. 
Two  of  these,  MyoD  and  the  achaete-scute  prod- 
uct (an  HLH  protein  important  for  neurogenesis). 
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seem  to  provide  information  about  cell  type.  Id 
(inhibitor  of  differentiation)  offers  a  pathway  for 
negative  control.  And  E2A  may  perform  an  effec- 
tor or  integration  function  yet  to  be  defined  fully. 
These  types  of  inputs  then  seem  to  be  integrated 
by  HLH  interactions  betvv^een  pairs  of  individual 
monomers,  and  the  result  seems  to  be  a  specific 
binding  capacity.  For  protein  complexes  contain- 
ing Id,  the  choice  is  probably  on-or-off;  for  com- 
plexes of  E2A  or  MyoD  homodimers  or  hetero- 
dimers,  the  result  can  be  altered  DNA-binding 
specificity.  Although  it  is  clear  that  interactions 
between  HLH  proteins  are  very  precise  and  spe- 
cific, the  rules  are  not  yet  known. 

MyoD  Is  Controlled  by  and  Can  Control  the 
State  of  Cell  Growth 

A  variety  of  agents  specifically  inhibit  myo- 
genic differentiation.  The  list  includes  genes 
such  as  ras,  fos,  jun,  fps,  erbA,  myc,  and  El  A; 
chemical  agents  such  as  butyrate  and  phorbol  es- 
ters; and  growth  factors  such  as  FGF  and  FGF|8. 
Most  of  these  reagents  can  inactivate  the  ex- 
pressed MyoD  protein,  and  several,  such  as  ras 
and  fos,  also  inhibit  MyoD  transcription.  The  spe- 
cific pathway  by  which  each  of  these  oncogenes, 
anti-oncogenes,  and  growth  factors  inhibits  myo- 
genesis provides  a  clue  to  how  MyoD  might  inte- 
grate information  coming  from  many  aspects  of 
cellular  function. 

Factors  that  influence  growth  regulate  MyoD, 
but  MyoD  itself  can  also  inhibit  cell  growth.  Ex- 
periments with  MyoD  mutants  show  that  the  HLH 
domain  is  required  for  growth  arrest. 

Activation  of  MyoD  During  Development 

Recently,  the  MyoD  gene  has  been  cloned  from 


Caenorhabditis  elegans,  and  an  antibody  to  the 
protein,  produced.  The  antibody  first  stains  cells 
at  the  100-cell  stage.  These  are  lineage  founder 
cells  fated  to  give  rise  to  body  wall  muscle  de- 
scendants some  three  to  four  generations  later. 
Less  than  3  kb  of  the  upstream  control  region  of 
MyoD  is  needed  for  body  wall  muscle-specific 
expression,  as  revealed  by  fusing  these  sequences 
to  a  /3-galactosidase  expression  reporter.  Results 
suggest  that  the  capacity  for  transcription  of 
MyoD  in  a  myogenic  lineage  may  precede  the 
actual  cell  type-specific  expression  of  the  pro- 
tein, and  this  capacity  segregates  into  specific 
blastomeres  as  early  as  the  28-cell  stage.  Presum- 
ably, subsequent  cell-cell  interactions  determine 
which  descendants  will  give  rise  to  nerve  and 
skin  and  turn  off  MyoD  transcription  and  which 
will  maintain  MyoD  expression  and  give  rise  to 
skeletal  muscle  cells. 

Conclusion 

The  flow  of  cell  type-specific  information  for 
myogenesis  from  the  egg  to  the  final  muscle  cell 
goes  through  a  nodal  point,  defined  by  functions 
supplied  by  the  MyoD  family  of  myogenic  regula- 
tory proteins.  It  is  possible  that  the  achaete-scute 
complex  of  genes  also  acts  as  a  nodal  point,  but 
for  neurogenesis.  An  alternative  organization 
could  have  also  been  imagined:  combinations  of 
less  tissue-restricted  genes  could  have  encoded 
the  MyoD  or  achaete-scute  functions.  The  nodal 
point  need  not  have  been  an  optimal  solution, 
but  simply  one  that  worked  at  some  crucial  time 
during  evolution.  Whether  other  cell  lineages 
employ  this  strategy,  and  whether  there  is  a  pro- 
found insight  provided  by  its  use  (such  as  the 
potential  to  make  cell  types  mutually  exclusive), 
await  further  analysis. 
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Model  of  the  structure  of  the  human  Tcell  antigen  receptor.  Basic  and  acidic  amino  acid  residues 
present  in  the  transmembrane  domains  of  each  of  the  chains  of  the  receptor  are  indicated  by  the 
one-letter  amino  acid  code  (D~,  aspartic  acid;  E',  glutamic  acid;  R* ,  arginine;  K"^,  lysine ).  Vand 
C  of  the  Ti  chains  correspond  to  variable  and  constant  domains  within  these  immunoglobulin- 
like  chains  involved  in  antigen  recognition.  The  ^-chain  dimer  and  the  y-,  8-,  and  e-chains  of  the 
CD3/T3  complex  are  believed  to  function  in  signal  transduction. 
Research  of  Arthur  Weiss. 
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THE  immune  system  has  evolved  to  provide  an 
organism  with  a  flexible  and  dynamic  mecha- 
nism to  respond  specifically  to  a  wide  variety  of 
antigens.  During  the  initiation  of  an  immune  re- 
sponse, antigen  must  not  only  be  recognized  by 
antigen-specific  lymphocytes,  but  this  recogni- 
tion event  must  lead  to  a  cellular  activation.  T  and 
B  lymphocytes  comprise  the  antigen-specific 
components  of  the  cellular  immune  system.  The 
activation  of  T  lymphocytes  is  critical  to  most  im- 
mune responses,  since  it  permits  these  cells  to 
exert  their  potent  regulatory  or  effector  activi- 
ties. During  activation,  relatively  quiescent  cells 
undergo  complex  changes  involving  cell  differ- 
entiation and  proliferation. 

Following  exposure  to  antigen,  activation  of  T 
lymphocytes  is  limited  to  only  those  cells  ex- 
pressing antigen-specific  receptors.  Activation  is 
a  consequence  of  ligand-receptor  interactions 
that  occur  at  the  interface  of  the  T  cell  and  an 
antigen-presenting  cell.  These  interactions  initi- 
ate intracellular  biochemical  events  within  the  T 
cell  that  culminate  in  cellular  responses.  The 
goal  is  to  understand  how  cell  surface  molecules 
on  the  T  cell,  and  in  particular  the  T  cell  antigen 
receptor  (TCR),  initiate  T  cell  activation. 

Although  it  is  clear  that  a  number  of  different 
cell  surface  molecules  on  the  T  lymphocyte  and 
the  antigen-presenting  cell  may  participate  in  the 
complex  cell-cell  interaction  that  occurs  during 
antigen  presentation,  the  TCR  must  play  a  promi- 
nent role.  Here  the  familiar  lock  and  key  analogy 
is  appropriate.  Antigen  is  the  ligand  (key)  for  a 
particular  set  of  clonally  distributed  receptors 
(locks)  on  T  lymphocytes.  Antigen  often  rep- 
resents a  protein  fragment  that  is  physically 
associated  with  a  molecule  of  the  major  histocom- 
patibility complex  (MHC) .  The  TCR  is  an  extraor- 
dinarily complex  structure.  It  consists  of  an  a//3- 
chain  disulfide-linked  heterodimer  (Ti)  derived 
from  immunoglobulin-like  genes  that  is  noncova- 
lently  associated  with  the  six  invariant  chains  of 
the  CD 3  complex.  Ti  may  be  viewed  as  the  li- 
gand-binding  subunit  of  the  TCR,  since  it  con- 
tains all  the  information  needed  to  recognize  an- 
tigen and  MHC  specificities.  CD3  has  been 


thought  to  play  some  role  in  transducing  the  li- 
gand occupancy  state  of  Ti  across  the  plasma 
membrane .  Hence  the  structural  basis  for  the  asso- 
ciation of  CD3  and  Ti  is  of  interest. 

Previous  studies  from  our  laboratory  have  dem- 
onstrated that  coexpression  of  CD3  and  Ti  on  the 
plasma  membrane  is  obligatory.  In  recent  stud- 
ies, we  have  found  that  the  structural  and  func- 
tional basis  for  the  interaction  between  Ti  and 
CD3  is  contained  within  the  transmembrane  do- 
mains of  these  proteins.  Further  mutational  stud- 
ies are  in  progress  to  understand  more  precisely 
how  Ti  and  CD3  interact  functionally  within 
these  domains. 

Since  the  transmembrane  regions  are  responsi- 
ble for  the  Ti-CD3  association,  we  have  taken  ad- 
vantage of  this  information  to  separate  regions  or 
domains  of  the  CD3  chains  from  Ti.  This  has  been 
accomplished  by  constructing  chimeric  mole- 
cules between  other  cell  surface  molecules 
linked  to  the  cytoplasmic  domain  of  the  CD3 
chains.  Chimeric  molecules  including  the  CD3 
f-chain  acquire  the  signal-transducing  capacity  of 
the  entire  TCR.  These  results  demonstrate  that 
the  cytoplasmic  domain  of  CD3  T  links  the  TCR  to 
intracellular  signaling  machinery.  The  ability  to 
create  such  functional  chimeric  receptors  may 
permit  the  creation  of  new  antiviral  or  antitumor 
TCRs,  which  may  be  valuable  in  gene  therapy. 

The  interaction  of  the  TCR  with  its  ligand,  anti- 
gen/MHC,  initiates  cellular  activation  by  induc- 
ing a  transmembrane  signal.  Such  signal  trans- 
duction is  manifested  as  the  formation  of 
intracellular  biochemical  mediators  called  sec- 
ond messengers,  which  can  initiate  or  influence 
cellular  response  pathways.  The  TCR  activates 
two  signal  transduction  pathways.  One  signal 
transduction  mechanism,  the  inositol  phospho- 
lipid pathway,  involves  receptor-stimulated  hy- 
drolysis of  a  rare  membrane  lipid,  PIP2  (phospha- 
tidylinositol  4,5-bisphosphate).  This  yields  two 
potent  intracellular  second  messengers  (inositol 
1 ,4,5-trisphosphate  and  diacylglycerol),  which 
regulate  the  mobilization  of  the  enzyme  protein 
kinase  C.  These  latter  two  events  are  physiologi- 
cally important  to  subsequent  cellular  responses. 


465 


structural  and  Functional  Studies  of  the  T  Cell  Antigen  Receptor 


The  TCR  activates  a  second  signal  transduction 
pathway  that  involves  an  enzyme  with  tyrosine 
kinase  activity.  The  importance  of  the  kinase 
pathway  and  its  relationship  to  the  inositol  phos- 
pholipid pathway  are  being  clarified  by  recent 
studies.  We  have  found  that  stimulation  of  the 
TCR  induces  the  tyrosine  phosphorylation  of  an 
isozyme  of  phospholipase  C,  an  enzyme  that  cata- 
lyzes the  hydrolysis  of  PIP2.  These  studies,  to- 
gether with  additional  studies  using  tyrosine  ki- 
nase inhibitors,  suggest  that  activation  of  a 
tyrosine  kinase  that  is  regulated  by  the  TCR  is 
required  for  activation  of  the  phosphatidyl  inosi- 
tol pathway. 

The  mechanism  by  which  the  TCR  couples  to 
intracellular  signaling  pathways  is  largely  unde- 
fined, as  are  many  of  the  components  of  the  sig- 
naling pathways  themselves.  To  define  and  char- 
acterize the  molecular  basis  by  which  the  TCR 
regulates  these  pathways,  we  are  using  a  somatic 
cell  genetic  approach.  We  have  isolated  a  num- 
ber of  mutants  derived  from  T  cell  leukemic  lines 
that  are  defective  in  TCR-mediated  activation  of 
the  inositol  phospholipid  pathway.  Unlike  the  pa- 
rental cells,  none  of  these  mutants  produce  lym- 
phokines  in  response  to  TCR  stimulation.  These 
mutants  define  four  distinct  gene  products  other 
than  the  Ti  chains  that  are  required  for  the  func- 
tional activation  of  the  inositol  phospholipid 
pathway. 

In  one  mutant,  the  activation  of  the  tyrosine 
kinase  pathway  is  still  intact.  The  defect  in  an- 
other of  these  mutants  can  be  attributed  to  the 
absence  of  CD45,  a  cell  surface  protein  with  tyro- 
sine phosphatase  activity.  The  absence  of  CD45 
prevents  the  TCR  from  activating  the  tyrosine  ki- 
nase or  phosphatidyl  inositol  pathway.  This  sug- 
gests a  complex  autoregulatory  system,  which  we 
are  intensively  studying.  In  the  remaining  two 
mutants,  biochemical  studies  have  comple- 
mented our  genetic  approach  and  have  suggested 


potential  defects.  We  previously  identified  two 
molecules  that  associate  with  the  TCR  in  a  ligand- 
induced  manner.  In  these  two  mutants,  these  mol- 
ecules fail  to  associate  with  the  TCR.  Further  stud- 
ies of  the  basis  for  the  defective  signal 
transduction  observed  in  these  mutants  are  in  pro- 
gress. Thus  these  mutants  are  proving  to  be  valu- 
able tools  with  which  to  dissect  the  complexities 
of  the  signal  transduction  pathways  and  their  re- 
lationships to  cellular  responses. 

An  alternative  approach  toward  understanding 
which  signal  transduction  pathways  can  lead  to 
cellular  responses  is  to  express  other  heterolo- 
gous receptors  with  well-defined  signal  transduc- 
tion functions  in  T  cells  and  examine  their  abili- 
ties to  activate  typical  T  cell  responses.  For 
example,  the  human  muscarinic  acetylcholine 
receptor  subtype  1  (HMl)  activates  the  inositol 
phospholipid  pathway  in  cardiac,  smooth  mus- 
cle, and  neuronal  cells.  We  have  expressed  HMl 
in  a  T  cell  line  and  find  that,  when  stimulated,  it 
can  activate  the  inositol  phospholipid  pathway 
but  not  a  tyrosine  kinase  pathway.  Moreover, 
HMl  stimulation  results  in  lymphokine  produc- 
tion and  other  typical  results  associated  with  TCR 
stimulation.  These  results  suggest  that  the  inosi- 
tol phospholipid  pathway,  a  signal  transduction 
pathway  common  to  both  the  TCR  and  HMl ,  does 
regulate  T  cell  responses.  How  this  pathway 
serves  to  regulate  later  cellular  responses  is  being 
addressed  in  ongoing  studies. 

T  cell  activation  is  a  complex  process  that  is 
regulated  by  cell  surface  molecules.  Investiga- 
tion of  the  molecules  and  events  involved  in  the 
activation  of  T  cells  should  lead  to  a  more  com- 
plete understanding  of  T  cell  biology  and  a  more 
rational  approach  to  the  manipulation  of  the  im- 
mune system.  Moreover,  through  the  study  of  the 
activation  of  T  cells,  it  is  likely  that  insight  into 
other  biological  systems  involving  cell  prolifera- 
tion and  differentiation  will  emerge. 
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LIKE  all  other  blood  cells,  lymphocytes — the 
principal  players  in  immune  recognition  of 
self  from  nonself — are  derived  ultimately  from 
stem  cells  in  the  bone  marrow.  It  is  both  biologi- 
cally and  clinically  important  to  delineate  the  de- 
cisions these  bone  marrow  precursors  make  as 
they  pass  through  microenvironments  that  define 
the  type  of  lymphocyte  (or  other  blood  cell)  they 
shall  become.  We  have  focused  on  identifying  the 
earliest  cell  in  the  bone  marrow  that  has  multipo- 
tent  capacity. 

Three  years  ago  we  were  able  to  isolate  the  he- 
matopoietic stem  cell  of  the  mouse.  This  past 
year  we  demonstrated  its  full  developmental  po- 
tential by  transferring  a  single  stem  cell  from  one 
strain  mixed  with  100  stem  cells  from  another 
strain  of  mouse  into  lethally  irradiated  mice  of 
the  second  strain.  Remarkably,  in  a  significant 
fraction  of  animals,  the  stem  cells  gave  rise  to 
blood  cells  and  tissue  lymphocytes.  In  every  case 
where  donor-strain  cells  were  found,  all  blood 
cell  types  had  come  from  the  single  marked  cell. 

In  one-third  of  these  cases,  the  stem  cell  prolif- 
erated; very  large  numbers  of  daughter  blood 
cells  were  generated  through  the  life  of  the  ani- 
mal. In  these  mice,  thousands  of  stem  cells,  de- 
rived from  the  initially  injected  single  cell,  could 
be  retrieved  and  transferred  to  a  second  genera- 
tion of  irradiated  animals,  all  of  whom  were  fully 
reconstituted.  Thus  this  stem  cell  has  a  remark- 
able profile  of  activities,  including  that  of  mas- 
sive self-renewal. 

In  the  past  year  we  have  traced  how  one  of  the 
first  daughter  cells  from  the  stem  cell — the  pre- 
B  lymphocyte  in  bone  marrow — becomes  leuke- 
mic. This  work  was  supported  jointly  by  the  Na- 
tional Cancer  Institute  of  the  National  Institutes 
of  Health  and  the  Howard  Hughes  Medical  Insti- 
tute. It  had  its  origin  several  years  ago  when  my 
laboratory — and  independently  that  of  another 
HHMI  investigator.  Max  Cooper  (University  of 
Alabama  at  Birmingham) — identified  a  molecule 
present  at  low  levels  on  normal  pre-B  lympho- 
cytes and  at  much  higher  levels  in  leukemias  and 
lymphomas  of  the  pre-B  series.  We  called  the  mol- 


ecule the  6C3/BP-1  antigen,  here  abbreviated  Ag 
(to  signify  antigen) . 

Like  hematopoietic  stem  cells,  pre-B  cells  are 
dependent  for  their  growth  on  a  class  of  stromal 
cells  found  embedded  inside  bone  cavities — the 
bone  marrow.  We  had  cloned  the  particular  bone 
marrow  stromal  cells  that  support  normal  pre-B 
lymphocytes  and  had  found  that  they  provide  a 
good  culture  microenvironment  in  which  to 
study  the  process  whereby  Abelson  leukemia 
virus  causes  pre-B  cells  to  become  leukemias 
and/or  lymphomas.  Among  the  products  of  these 
stromal  cells  is  the  protein  growth  factor  inter- 
leukin-7  (IL-7) .  Normal  pre-B  cells  don't  express 
IL-7,  while  the  stromal  cells  that  support  the 
growth  of  these  normal  and  preneoplastic  pre-B 
cells  do. 

The  Ag  is  only  expressed  at  high  levels  on  these 
leukemias  and  lymphomas  at  a  late  stage  of  the 
neoplastic  development  in  a  test  tube  containing 
stromal  cells,  pre-B  target  cells,  and  the  leuke- 
mia virus.  We  were  therefore  interested  in  deter- 
mining whether  the  conversion  from  stromal  cell 
dependence  to  stromal  cell-independent  malig- 
nant growth  resulted  from  tumor  cell  expression 
of  IL-7.  In  fact,  every  Abelson  leukemia  virus- 
induced  pre-B  leukemia  expressed  the  growth 
factor.  More  importantly,  by  specifically  block- 
ing the  expression  of  IL-7,  we  could  prevent  the 
growth  of  these  leukemias  in  tissue  culture. 

Thus  we  have  demonstrated  what  we  think  is  a 
critical  step  in  the  passage  from  normal  cell  to 
leukemic  cell — the  expression  of  a  growth  factor 
for  which  the  tumor  cell  had  a  receptor.  This 
result  confirms  in  part  a  hypothesis  we  pro- 
posed over  15  years  ago:  receptor-mediated 
leukemogenesis. 

We  next  investigated  whether  the  high-level 
expression  of  the  Ag  might  have  some  signifi- 
cance for  the  growth  of  tumor  cells  in  vitro  or  in 
vivo.  Our  collaboration  with  Max  Cooper 
brought  some  striking  insights  to  this  problem. 
His  group  had  cloned  the  cDNA  encoding  the  Ag 
and  had  found  it  to  contain  sequences  in  common 
with  aminopeptidase  A,  a  metalloproteinase. 
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(The  protein  is  an  enzyme  that  can  digest  certain 
other  proteins.)  At  the  same  time  we  had  devel- 
oped several  subclones  from  a  clonal  line  of  Abel- 
son  leukemias,  some  that  expressed  Ag  molecules 
(Ag*")  and  others  that  did  not  express  them 
(Ag"^S) .  The  Ag''*  and  Ag"^^  lines  were  stable  for 
those  properties  over  tens  of  cell  generations  in 
the  test  tube. 

We  wished  to  test  if  Ag*"  cells  might  use  their 
cell-surface  protease  to  digest  the  surrounding 
tissues,  invade  adjacent  blood  vessels,  and  spread 
via  the  bloodstream  to  distant  sites.  Our  experi- 
ments support  this  hypothesis.  Ag*"  subclones 
placed  in  the  thigh  muscle  grew  locally,  invaded 
the  bloodstream  at  our  earliest  observation  inter- 
val (6  days),  and  killed  50  percent  of  the  hosts  in 
17  days.  The  same  number  of  Ag"''^  subclones 
placed  in  the  thigh  muscle  grew  locally,  but  no 
cells  could  be  found  in  the  blood  until  17  days, 
and  the  50  percent  death  point  came  at  28  days. 
Thus,  although  the  tumors  grew  locally  at 


roughly  equal  rates,  only  the  Ag*"  cells  invaded 
the  bloodstream  early  and  caused  early  deaths. 

Perhaps  most  significantly,  the  bloodstream 
leukemia  derived  from  Ag"^^  cells  now  contained 
a  significant  fraction  of  Ag*''  cells — which  did  not 
occur  when  they  grew  in  the  test  tube.  That  the 
early  death  of  animals  was  due  to  the  high  inva- 
sive properties  of  Ag'"  cells  rather  than  some  in- 
trinsic growth  advantage  they  might  possess  was 
made  clear  when  we  showed  that  equal  numbers 
of  Ag'''  and  Ag"^^  cells  injected  directly  into  the 
bloodstream  led  to  death  of  leukemic  hosts  at  al- 
most the  same  time. 

We  now  need  to  know  how  reproducible  this 
condition  is  and  whether  manipulation  of  the  se- 
quence encoding  the  Ag  will  reveal  how  the  en- 
zyme acts  in  malignant  cells  to  promote  their 
invasiveness  and  in  normal  bone  marrow  differ- 
entiating from  hematopoietic  stem  cells  to  B  lym- 
phocytes. And  we  need  to  determine  whether 
there  is  a  human  homologue  active  in  normal  and 
neoplastic  hematopoiesis. 
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CELL  membrane  ion  channels  control  the  in- 
tracellular ionic  environment  and  play  an 
important  role  in  signal  transduction.  In  epithe- 
lial cells  they  are  also  responsible  for  secretion 
and  absorption  of  electrolytes  and  water. 

In  the  lungs,  ion  transport,  including  chloride 
secretion  by  the  airway  epithelium,  controls  the 
quantity  and  composition  of  the  respiratory  tract 
fluid.  Thus  it  helps  effect  mucociliary  clearance, 
a  normal  pulmonary  defense  mechanism. 

Net  vectorial  ion  transport  requires  the  asym- 
metric distribution  of  ion  channels  and  other 
transporters  in  polar  epithelial  cells.  For  active 
chloride  secretion,  chloride  enters  the  cell  at  the 
basolateral  membrane  (interstitial  side  of  the 
cell)  by  a  sodium-coupled  cotransport  process 
that  accumulates  intracellular  chloride  above 
electrochemical  equilibrium.  Chloride  then 
leaves  the  cell  passively,  exiting  across  the  apical 
membrane  (luminal  side  of  the  cell)  via  a  chlo- 
ride channel. 

Intracellular  sodium  is  maintained  at  a  low 
concentration  by  the  basolateral  sodium-potas- 
sium ATPase,  and  potassium  that  enters  on  the 
sodium  pump  is  recycled  by  basolateral  potas- 
sium channels.  Effective  secretion  requires  coor- 
dinated regulation  of  chloride  channels  at  the 
apical  membrane  and  potassium  channels  at  the 
basolateral  membrane.  The  regulation  of  these 
channels  controls  the  rate  of  transepithelial  chlo- 
ride secretion.  An  understanding  of  the  function 
and  regulation  of  epithelial  ion  channels  is  a  ma- 
jor goal  of  this  work. 

Understanding  the  regulation  of  apical  chlo- 
ride channels  is  also  important  in  understanding 
cystic  fibrosis  (CF) ,  a  common  lethal  genetic  dis- 
ease of  Caucasians.  In  CF  the  regulation  of  apical 
chloride  channels  is  defective.  Apical  membrane 
chloride  channels  are  activated  and  chloride  se- 
cretion is  stimulated  by  a  variety  of  hormones  and 
neurotransmitters  that  increase  intracellular  lev- 
els of  cAMP.  In  CF  the  apical  membrane  is  chlo- 
ride impermeable,  an  abnormality  that  may  ex- 
plain several  of  the  pathologic  manifestations.  In 
CF  airway  epithelial  cells,  hormonal  secreta- 
gogues  stimulate  cAMP  accumulation  appropri- 


ately but  fail  to  activate  (or  open)  chloride 
channels. 

In  many  cells  the  biologic  effects  of  cAMP  re- 
sult from  activation  of  cAMP-dependent  protein 
kinase  (PKA) ,  resulting  in  phosphorylation  of  tar- 
get proteins.  To  determine  if  the  enzyme  PKA  reg- 
ulates chloride  channels,  we  used  excised,  in- 
side-out patches  of  membrane  and  added  the 
purified  catalytic  subunit  of  PKA  plus  ATP  to  the 
internal  surface  of  the  channel.  We  found  that 
chloride  channels  were  activated  by  PKA  plus 
ATP  in  cell-free  patches  of  membrane  from  nor- 
mal cells  but  were  not  activated  in  CF  cells. 

Protein  kinase  C  (PKC) ,  another  important  en- 
zyme, regulates  cell  function  by  phosphoryla- 
tion. In  airway  epithelia,  several  secretagogues 
increase  the  cellular  mass  of  diacylglycerol,  an 
activator  of  PKC,  suggesting  that  the  latter  also 
regulates  the  chloride  channel.  In  experiments 
using  excised  cell-free  patches,  we  found  that 
PKC  could  have  a  dual  effect  on  chloride  chan- 
nels. At  a  high  calcium  concentration,  it  inacti- 
vated chloride  channels,  and  at  a  low  calcium 
concentration,  it  activated  them.  In  CF  cells, 
PKC-dependent  channel  inactivation  was  normal, 
but  activation  was  defective.  Thus  it  appears  that 
PKC  also  regulates  the  chloride  channel  by  phos- 
phorylation and  suggests  that  two  different  sites 
on  the  channel,  or  an  associated  membrane  pro- 
tein, may  be  involved  in  regulation,  with  one  of 
these  being  defective  in  CF. 

Although  phosphorylation-dependent  activa- 
tion of  chloride  channels  is  defective  in  CF,  an 
increase  in  the  cell  calcium  concentration  stimu- 
lates chloride  secretion  in  airway  epithelia.  Cal- 
cium-stimulated chloride  secretion  has  the  po- 
tential to  bypass  the  CF  secretory  defect.  We 
found  that  calcium  and  cAMP  activate  different 
chloride  channels  in  the  apical  membrane.  This 
conclusion  is  based  on  four  observations:  1)  cal- 
cium-activated chloride  channels  are  present  in 
the  apical  membrane  of  airway,  but  not  intestinal, 
epithelia;  2)  cAMP-activated  but  not  calcium- 
activated  chloride  channels  are  defective  in  CF 
airway  epithelia;  3)  calcium-  and  cAMP-activated 
channels  have  different  halide  permeabilities; 
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and  4)  an  increase  in  both  second  messengers 
produces  an  additive  increase  in  chloride 
current.  These  results  also  explain  the  previously 
puzzling  observation  that  calcium-stimulated 
chloride  secretion  is  defective  in  CF  intestine  but 
intact,  in  CF  airway:  the  calcium-activated  chlo- 
ride channels  that  could  circumvent  the  chloride 
secretory  defect  in  CF  airway  are  missing  from  the 
apical  membrane  of  intestinal  epithelia. 

The  gene  that  is  defective  in  CF  has  recently 
been  cloned  by  other  investigators.  This  gene 
— the  cystic  fibrosis  transmembrane  conduc- 
tance regulator  (CFTR) — was  cloned  based  on 
a  correct  chromosomal  location,  an  appropriate 
pattern  of  tissue  expression,  and  a  muta- 
tion (AF508)  that  was  present  on  70  percent 
of  CF  chromosomes  and  absent  on  non-CF 
chromosomes. 

We  tested  the  hypothesis  that  expression  of 
normal  CFTR  would  complement  the  CF  pheno- 
type,  by  expressing  nonmutated  CFTR  in  cul- 
tured CF  airway  epithelial  cells.  We  assessed  chlo- 
ride channel  activation  in  single  cells  using  a 
fluorescence  microscopic  assay  and  the  patch- 
clamp  technique.  Expression  of  CFTR,  but  not 
the  AF508  mutation,  corrected  the  chloride 
channel  defect.  These  results  demonstrate  a 
causal  relationship  between  mutations  in  the 
CFTR  gene  and  the  CF  phenotype.  Because  ex- 
pression of  mutated  CFTR  did  not  complement 
the  CF  phenotype,  other  interpretations  are  un- 
likely. Demonstration  that  the  CF  phenotype  can 
be  corrected  in  cultured  CF  cells  suggests  the  fea- 
sibility of  a  therapeutic  approach  based  on  cor- 
rection of  the  underlying  defect. 

Although  correction  of  defective  chloride  trans- 
port in  CF  epithelial  cells  strengthened  the  rela- 
tionship between  CF  and  epithelial  chloride 
channels,  it  did  not  demonstrate  the  function  of 
CFTR.  To  evaluate  its  function,  we  expressed 
CFTR  in  several  heterologous  cells:  HeLa  cells, 
Chinese  hamster  ovary  cells,  and  NIH  3T3  fibro- 
blasts. In  each  of  these  cell  types,  expression  of 
CFTR,  but  not  the  AF508  mutant,  causes  an  in- 
crease in  cAMP-stimulated  anion  permeability 


and  chloride  currents.  These  data  show  that  ex- 
pression of  CFTR  produces  cAMP-activated  chlo- 
ride currents  in  cells  that  do  not  normally  express 
CFTR  and  do  not  normally  have  cAMP-activated 
chloride  channels.  When  CFTR  is  overexpressed 
in  cells  that  already  express  high  levels  of  CFTR, 
the  chloride  permeability  increases  even  further. 
These  data  suggest  that  CFTR  is  itself  a  cAMP-acti- 
vated  chloride  channel.  Other  alternatives  have 
not  yet  been  excluded,  but  all  would  require  that 
this  diverse  group  of  cells  must  contain  quiescent 
chloride  channels  that  can  become  sensitive  to 
cAMP  only  in  the  presence  of  CFTR. 

In  addition  to  studies  of  the  chloride  channel, 
the  laboratory  also  focuses  on  understanding  the 
function  and  regulation  of  basolateral  membrane 
potassium  channels.  The  coordinated  regulation 
of  basolateral  potassium  channels  and  apical 
chloride  channels  is  critical  to  effective  chloride 
secretion.  Our  current  work  suggests  the  pres- 
ence of  two  potassium  channels  at  the  basolateral 
membrane — one  regulated  by  calcium  and  one 
regulated  by  some  other  agent.  The  changes  in 
the  activation  of  the  potassium  channels  are  criti- 
cal to  prevent  changes  in  cell  volume  and  to 
maintain  an  electrically  negative  intracellular 
voltage  that  drives  chloride  out  of  the  cell  across 
the  apical  membrane. 

A  third  aim  of  the  laboratory  focuses  on  the 
regulation  of  intracellular  calcium  concentra- 
tion, which  controls  many  cellular  processes. 
Stimulation  of  single  cells  with  agonists  often 
does  not  produce  a  steady  increase  in  cell  cal- 
cium; instead  it  causes  oscillating,  or  pulsatile, 
elevations  of  calcium.  Such  oscillations  have 
been  observed  in  many  cells,  and  it  has  been  pro- 
posed that  the  frequency  at  which  calcium  oscil- 
lates determines  the  biologic  response.  We  have 
turned  to  the  Xenopus  oocyte  as  a  model  system 
that  allows  us  to  manipulate  the  factors  that  con- 
trol intracellular  calcium.  Studies  of  the  genesis 
of  oscillations  in  cell  calcium  in  Xenopus  oo- 
cytes should  be  relevant  to  understanding  mecha- 
nisms that  underlie  this  important  regulatory  sys- 
tem in  many  cells. 
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Chloride  channel  function  in  lung  cells  from  a  cystic  fibrosis  patient  returns  to  normal  following 
insertion  of  a  normal  gene.  The  gene  for  cystic  fibrosis  transmembrane  conductance  regulator 
( CFTR)  is  mutated  in  patients  with  CP,  and  defective  function  of  chloride  channels  is  the  hall- 
mark of  the  disease.  The  top  row  shows  fluorescence  from  a  cystic  fibrosis  cell  that  is  expressing 
normal  CFTR;  the  bottom  row  shows  a  cell  that  is  expressing  mutated  CFTR.  Fluorescence  indi- 
cates chloride  channel  activity,  with  yellow  and  red  indicating  high  fluorescence  and  blue  and 
green  low  fluorescence.  Fluorescence  increases  faster  in  cells  expressing  the  normal  CFTR  gene 
( top ).  The  difference  between  cells  expressing  the  normal  gene  and  the  mutated  gene  is  most 
noticeable  at  the  4  min  point. 
Research  of  Michael  f.  Welsh. 
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DETAIL  MAP  OF  NF1  GENE 

AAGTTCGTACCGAGTAATCCGAGAC 
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Structure  of  part  of  the  NFl  gene  (above ),  in  relation  to  a  genomic  map  of  the  region  ( below ).  The 
map  is  oriented  with  the  centromere  of  chromosome  17  to  the  left.  Sequenced  exons,  or  coding 
regions,  are  shown  in  orange,  and  the  direction  of  transcription  is  indicated  by  the  large  orange 
arrow.  The  points  where  the  translocation  occurs  in  each  of  two  NFl  patients,  t(l;17)  and 
t(17;22)  (jagged  lines),  are  50-60  kilobases  apart.  These  points  provided  major  clues  to  the 
whereabouts  of  the  NFl  gene. 

The  three  genes  known  to  lie  within  NFl,  and  their  orientation  on  the  chromosome,  are  shown 
in  pink.  Sites  indicated  by  vertical  black  arrows  mark  where  the  enzyme  EcoRI  cleaved  the  geno- 
mic DNA  into  fragments  to  form  the  restriction  map. 

Only  nine  exons  of  NFl  are  shown  here.  Others  exist  in  either  direction  (indicated  by  large 
arrows),  but  within  this  region  the  sequence  of  individual  DNA  base  units  (represented  by  a 
sequence  of  letters  A,  C,  T,  and  G )  translates  into  a  peptide  capable  of  controlling  growth  signals 
within  a  cell. 

Adapted  from  White,  R.L.  1991.  Identification  of  the  neurofibromatosis  gene.  In  Origins  of 
Human  Cancer:  A  Comprehensive  Review.  Cold  Spring  Harbor,  NY:  Cold  Spring  Harbor. 
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THE  search  for  the  gene  that  is  altered  in  a 
dominantly  inherited  disorder,  neurofibro- 
matosis type  1  (NFl),  was  successful  when 
workers  in  our  laboratory  found  in  patients  an 
abnormal  sequence  of  DNA  in  the  relevant  region 
of  chromosome  17.  Knowledge  of  the  normal 
and  abnormal  functioning  of  this  gene  will  eluci- 
date the  biochemical  processes  that  lead  to  dis- 
ease in  patients  with  NFl,  and  will  open  new 
opportunities  to  develop  therapies  for  this 
relatively  common  and  sometimes  devastating 
disorder. 

Furthermore,  the  product  of  this  gene  falls  into 
the  category  of  proteins  that  help  to  control 
growth  and  differentiation,  and  for  that  reason 
the  discovery  has  broad  implications  for  those 
seeking  to  learn  more  about  the  intracellular 
chemical  signals  that  determine  the  basic  activi- 
ties of  life  and  that  may  lead,  when  defective,  to 
the  formation  of  tumors. 

The  region  of  chromosome  1 7  known  to  harbor 
the  NF 1  gene  had  been  narrowed  to  a  point  where 
only  a  few  genes  were  likely  to  exist.  However, 
the  first  candidate  to  be  examined,  EVI2A,  did 
not  differ  in  DNA  sequence  between  NFl  and 
non-NFl  individuals. 

Two  other  genes,  OMGP  and  EVI2B,  were  sub- 
sequently identified.  OMGP  was  especially  inter- 
esting because  it  is  the  coding  element  for  a  pro- 
tein expressed  in  cells  of  the  central  nervous 
system — cells  thought  to  be  counterparts  of  the 
peripheral  Schwann  cells  affected  in  NFl.  More- 
over, OMGP  lies  very  near  to  the  site  of  a  chromo- 
somal break  in  a  patient  with  NFl,  a  mutation 
assumed  to  be  the  cause  of  her  disease.  Neverthe- 
less, OMGP  and  EVI2B  were  eventually  elimi- 
nated as  candidates  on  the  same  grounds  as 
EVI2A. 

All  three  genes  lie  in  the  same  orientation  on 
the  chromosome.  That  is,  they  are  all  transcribed, 
or  "read,"  in  the  same  direction  when  the  ge- 
netic code  is  being  translated  to  form  a  protein 
within  the  cell. 

Finally,  examination  of  DNA  from  many  individ- 
uals detected  three  deletion  mutations  specific  to 


NFl,  within  the  region  defined  by  translocations 
in  two  unrelated  patients.  Two  of  these  deletions 
of  DNA  material  were  large  enough  to  disrupt  one 
or  another  of  the  known  genes.  The  third,  how- 
ever, did  not  affect  any  of  them.  This  observation 
suggested  that  the  mutation  in  that  patient  was 
disrupting  a  fourth  gene.  Furthermore,  the  se- 
quence affected  by  this  mutation  was  very  similar 
to  one  present  in  mouse  DNA.  (Conservation  of 
sequence  in  evolution  between  widely  disparate 
species  often  signals  the  presence  of  an  important 
coding  element.) 

The  conserved  sequence  was  used  to  seek  out 
complementary  DNA  (cDNA) — that  is,  recombi- 
nant DNA  copies  of  an  active  gene — from  a  "li- 
brary" of  clones  reflecting  gene  expression  in 
human  fetal  brain  tissue.  When  the  DNA  se- 
quence of  the  cDNA  corresponding  to  this  new 
gene  was  compared  among  normal  individuals 
and  several  NFl  patients,  a  number  of  mutations 
involving  single  base  pairs  of  DNA  were  detected, 
and  at  least  some  of  these  changes  were  of  a  kind 
expected  to  be  damaging  to  the  protein  product. 
These  results,  combined  with  other  information, 
indicated  that  the  NFl  gene  had  at  last  been 
found. 

The  remarkable  finding  of  three  other  genes 
within  the  NFl  gene,  all  oriented  in  the  opposite 
direction  to  it  on  the  chromosome,  appears 
unique  so  far  in  human  molecular  biology.  The 
significance  of  this  phenomenon,  and  the  possi- 
ble effect (s)  of  mutations  in  the  NFl  gene  upon 
expression  of  any  of  the  embedded  genes,  are 
subjects  for  further  investigation. 

The  NFl  gene  is  very  large,  with  coding  re- 
gions, or  exons,  scattered  over  a  wide  region  of 
genomic  DNA.  To  date,  several  thousand  base 
pairs  of  its  sequence  have  been  determined.  The 
first  discovery  to  emerge  from  the  sequence  data 
was  a  similarity  between  the  predicted  protein 
product  of  the  NFl  gene  and  the  catalytic  region 
of  certain  proteins,  called  GTPase-activating  pro- 
teins or  GAPs,  that  are  known  to  interact  with 
growth  regulators  within  the  cell.  Experiments 
showed  that,  in  fact,  the  NFl  protein  could  sub- 
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stitute  for  GAPs  in  switching  oflf  growth  signals 
that  might  otherwise  resuh  in  uncontrolled  pro- 
liferation of  cells  and  produce  a  tumor. 

Thus  the  NFl  product  may  have  a  role  in  hu- 
man biology  that  is  much  broader  than  its  rele- 
vance-to  neurofibromatosis,  as  important  as  that 


will  prove  to  be  for  NFl  sufferers  and  their  fami- 
lies. The  NFl  protein  is  likely  to  open  new  ave- 
nues of  inquiry  about  normal  cellular  processes 
and  about  events  that  lead  to  cancer.  Projects  to 
investigate  several  of  these  avenues  are  now 
under  way  in  our  laboratory. 
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Tcell  recognition  occurs  when  cell  surface  his- 
tocompatibility glycoproteins  present  anti- 
gens, processed  to  small  peptides,  to  an  antibody- 
like molecule  on  the  T  cell  receptor.  Each 
organism  has  only  a  small  number  of  different 
histocompatibility  molecules  (probably  less  than 
a  dozen),  so  that  each  membrane  glycoprotein 
must  be  able  to  "present"  many,  possibly  thou- 
sands, of  different  antigenic  peptides  to  thou- 
sands or  more  distinct  T  cell  receptors  through- 
out the  immunological  life  of  the  individual. 

In  the  past  few  years  our  laboratory  has  deter- 
mined the  three-dimensional  structure  of  two  hu- 
man-class histocompatibility  antigens  (HIA)  by 
high -resolution  x-ray  crystallography.  This  re- 
sulted in  the  visualization  of  antigen  presenta- 
tion, when  an  obvious  peptide-binding  cleft  was 
revealed  and  shown  to  be  filled  with  a  presumed 
mixture  of  peptide  antigens.  Comparison  of  the 
structures  of  two  polymorphic  alleles  of  the  hu- 
man histocompatibility  antigen  refined  to  2.6  A 
resolution  also  suggested  that  the  basis  for  allelic 
specificity,  or  "restriction,"  is  in  a  set  of  pockets 
in  the  peptide-binding  cleft. 

We  recently  crystallized  one  of  the  histocom- 
patibility antigens  complexed  with  a  peptide 
from  influenza  virus.  This  required  reconstitut- 
ing HIA  from  its  two  polypeptide  subunits  in  the 
presence  of  a  "restricted"  peptide.  It  appears 
that  the  HIA  molecule  required  peptide  to  be  re- 
constituted efficiently  in  vitro,  arguing  that  pep- 
tide binding  may  be  linked  to  subunit  assembly  or 
stability. 

In  the  past  year,  in  collaboration  with  Jack  Stro- 


minger  and  Joan  Gorga,  we  crystallized  a  number 
of  class  II  histocompatibility  antigens  and  the 
complexes  formed  with  superantigens. 

Our  laboratory  is  also  studying  how  influenza 
virus  infects  cells.  About  10  years  ago  we  deter- 
mined the  three-dimensional  structure  of  the  in- 
fluenza virus  hemagglutinin  (HA),  the  viral  gly- 
coprotein responsible  for  binding  the  virus  to 
cells  and  for  fusing  the  viral  membrane  to  a  cellu- 
lar membrane  to  effect  infectious  entry.  Recently 
we  determined  the  structure  of  a  series  of  com- 
plexes between  the  HA  and  derivatives  of  sialic 
acid,  the  cellular  receptor  for  influenza  virus. 
Our  laboratory  has  synthesized  a  number  of 
these  new  ligands  to  confirm  an  atomic  model 
for  virus-cell  binding  that  we  proposed  two 
years  ago. 

This  year,  in  collaboration  with  Jeremy 
Knowles,  Gary  Glick,  and  George  Whitesides,  we 
have  analyzed  the  binding  of  virus  to  multiden- 
tate  ligands  and  have  discovered  some  dimeric 
and  polymeric  ligands  with  increased  affinity.  In 
the  process  a  second  binding  site  has  been  lo- 
cated on  the  HA  at  an  interface  between  domains 
of  the  molecule,  which  although  probably  not 
physiological,  may  offer  opportunities  for  the  de- 
sign of  a  ligand  to  stabilize  the  interface  against 
the  conformational  change  required  for  the  HA's 
membrane  fusion  activity. 

A  number  of  other  crystal lographic  and  bio- 
chemical studies  are  under  way  on  influenza  C 
virus,  on  a  low-pH  fusion-active  conformation  of 
the  influenza  HA,  on  trypanosome  surface  anti- 
gens, and  on  the  glycoprotein  of  HIV-1  in  com- 
plex with  its  receptor,  CD4. 
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THE  general  goal  of  our  research  is  to  gain  a 
better  understanding  of  blood  cell  formation 
and  of  how  to  apply  the  knowledge  to  the  treat- 
ment of  relevant  diseases.  The  work  in  progress  is 
roughly  divided  into  research  on  the  role  of  the 
bone  marrow  environment  in  maintaining  blood- 
forming  cells  in  a  normal  way,  on  the  effect  of 
directing  the  expression  of  growth-regulating 
genes  in  specific  blood  cells  in  murine  models, 
and  on  the  development  of  gene  transfer  as  a  po- 
tential method  of  correcting  human  diseases  in 
which  blood  cells  are  affected. 

Investigating  the  Bone  Marrow 
Environment 

Blood  cells  are  produced  and  delivered  to  the 
circulation  by  precursor,  or  stem,  cells  in  the 
bone  marrow.  Balance  is  carefully  maintained  be- 
tween production  and  utilization.  Although  ad- 
vances have  recently  been  made  in  explaining  the 
control  of  production  of  daughter  cells,  the  pro- 
duction of  the  stem  cells  is  still  little  understood. 

Bone  marrow  stem  cells  reside  in  a  complex 
environment  comprising  many  specialized  cells, 
called  stromal  cells.  This  microenvironment  can 
be  recapitulated  in  vitro.  However,  investigation 
of  stem  cell  interactions  with  the  microenviron- 
ment has  been  hampered  in  the  past  by  the  com- 
plex nature  of  the  stromal  components.  Using  mo- 
lecular methods,  our  laboratory,  in  collaboration 
with  Vikrum  Patel  at  Northwestern  University, 
has  generated  permanent  cell  lines  derived  from 
this  stromal  milieu,  and  some  of  these  lines  have 
been  shown  to  maintain  blood  formation.  More- 
over, they  have  led  to  the  identification  of  impor- 
tant proteins  of  the  bone  marrow  that  help  to  an- 
chor primitive  blood-forming  cells  in  the 
appropriate  locations.  Interaction  of  blood-form- 
ing cells  with  such  proteins  may  be  important  in  a 
variety  of  normal  and  abnormal  functions,  in- 
cluding control  of  blood  formation,  regeneration 
of  stem  cells  after  marrow  transplantation,  and 
movement  of  leukemia  cells  out  of  the  bone 
marrow  cavity. 

In  another  project,  cell  lines  generated  from 


mice  exhibiting  a  severe  genetic  abnormality  in 
the  bone  marrow  microenvironment  have  helped 
to  identify  the  causative  molecular  lesion  as  well 
as  the  normal  protein  that  this  defective  gene 
should  produce.  Work  done  in  collaboration 
with  Kris  Zsebo  at  Amgen  has  shown  that  the  pro- 
tein is  an  important  growth  factor  for  very  primi- 
tive blood-forming  cells.  It  may  prove  to  be  a 
valuable  therapeutic  tool  for  bone  marrow 
diseases. 

In  addition,  Steven  Paul  of  our  laboratory,  in 
collaboration  with  Yu  Chung  Yang  (currently  at 
Indiana  University)  and  Genetics  Institute,  has 
utilized  a  stromal  cell  line  generated  from  pri- 
mate bone  marrow  to  identify  and  clone  another 
new  growth  factor  affecting  blood-forming  cells 
and  lymphocytes.  This  factor,  called  interleukin- 
1 1  (IL-1 1),  appears  to  stimulate  the  growth  of  B 
lymphocytes  and  the  bone  marrow  cells  that  give 
rise  to  blood  platelets.  Further  characterization 
of  the  effects  of  this  growth  factor  in  mice  is 
under  way. 

Regulated  Expression  of  Growth-regulating 
Genes  in  Hematopoietic  Cells 

In  order  to  study  the  effects  of  certain  growth- 
regulating  genes  on  the  behavior  of  hematopoi- 
etic stem  cells,  the  laboratory  is  utilizing  embry- 
onic stem  (ES)  cells  to  generate  transgenic  mice 
containing  foreign  genes.  Such  genes  are  easily 
introduced  into  cultured  cells.  Upon  transfer  of 
the  manipulated  cells  into  embryos,  ES  cells  con- 
tribute to  all  the  tissues  of  the  resulting  animals. 
This  new  and  powerful  technology  allows  the  ex- 
pression of  new  genes  in  specific  hematopoietic 
cells  (such  as  red  or  white  blood  cells)  in  adult 
mice. 

Deniz  Toksoz  of  our  laboratory  is  investigating 
the  effects  of  expression  of  specific  growth  factor 
genes  on  the  development  of  blood-forming  cells 
in  the  early  mouse  embryo.  The  earliest  stage  of 
blood  formation  occurs  in  the  fetal  yolk  sac. 
Growth-regulating  genes,  such  as  macrophage  or 
granulocyte  colony-stimulating  factors,  have 
well-defined  pharmacologic  activities  on  hemato- 
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poietic  cells  in  adult  animals.  However,  less  is 
known  about  the  role  of  these  factors  in  normal 
functioning  of  the  hematopoietic  system  during 
fetal  development.  Through  use  of  ES  cell  lines 
that  have  been  programmed  to  make  large 
amou«ts  of  these  growth  factors  during  the  fetal 
yolk  sac  phase  of  embryo  development,  new  in- 
sights into  the  control  of  blood  formation  in  the 
embryo  may  be  gained. 

Gene  Transfer  and  Somatic  Gene  Therapy 

For  several  years  we  have  been  studying  the  use 
of  gene  transfer  to  treat  human  genetic  diseases. 
Slow  but  steady  progress  has  been  made,  using 
viral  vehicles  (called  vectors)  to  transfer  a 
correcting  gene  into  hematopoietic  stem  cells. 
The  laboratory  has  studied  the  transfer  of  adeno- 
sine deaminase  (ADA) ,  a  lack  of  which  can  lead  to 
a  fatal  disease  called  severe  combined  immunode- 
ficiency (SCID) .  Jane  Apperley  and  Barry  Luskey, 
in  a  continuing  collaboration  with  Stuart  Orkin 


(HHMI,  at  the  Children's  Hospital,  Boston),  has 
demonstrated  that  ADA  can  be  transferred  into  he- 
matopoietic stem  cells.  Once  these  cells  are 
transplanted  into  irradiated  mice,  the  human  ADA 
gene  expresses  functional  protein  at  nearly  the 
levels  of  the  endogenous  mouse  ADA  gene.  Such 
levels  are  sustained  in  75  percent  of  the  animals 
for  longer  than  six  months,  demonstrating  stable 
expression  of  the  introduced  gene  after  complete 
reconstitution  of  the  hematopoietic  system  by  the 
transplanted  cells.  This  work  demonstrates  that 
the  introduction  and  expression  of  a  vector-borne 
gene  appears  to  be  a  viable  approach  to  the  cor- 
rection of  some  severe  diseases  of  the  hematopoi- 
etic system  in  humans. 

In  addition,  Barry  Luskey,  in  collaboration  with 
Arthur  Neinhuis  and  David  Bodine  at  the  National 
Institutes  of  Health,  has  begun  studies  aimed  at 
transferring  the  ADA  gene  into  monkey  bone 
marrow  cells.  These  studies  are  very  important 
prior  to  any  use  of  retroviral  technology  to  treat 
human  disease. 
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DURING  embryonic  development,  tissues 
grow  by  the  rapid  proliferation  of  their  con- 
stituent cells.  An  important  mechanism  by  which 
intricate  patterns  of  embryonic  cell  growth  are 
regulated  involves  the  local  production  of  pro- 
tein growth  factors  that  stimulate  cell  prolifera- 
tion. In  adults  the  cells  of  most  tissues  proliferate 
slowly,  dividing  at  a  minimal  rate  that  is  just  suffi- 
cient to  replace  senescent  cells.  However,  in  a 
few  specific  situations  (for  example,  after  tissue 
injury)  the  rate  of  cell  proliferation  can  increase 
dramatically.  This  exceptionally  rapid  cell  divi- 
sion is  similar  to  embryonic  cell  proliferation  and 
appears  to  be  stimulated  by  some  of  the  same 
growth  factors. 

Our  research  group  is  investigating  the  action 
of  platelet-derived  growth  factor  (PDGF).  This 
potent  growth  factor  for  fibroblast  and  smooth 
muscle  cells  is  found  in  platelets  and  is  released 
at  sites  of  tissue  injury.  PDGF  is  also  produced  by 
other  tissues,  including  endothelial  cells  that  line 
the  inner  surfaces  of  blood  vessels.  In  this  context 
PDGF  is  likely  to  play  a  major  role  in  stimulating 
the  proliferation  of  smooth  muscle  cells  that  con- 
stitute atherosclerotic  plaques.  Its  role  in  vascu- 
lar proliferation  appears  to  be  especially  promi- 
nent in  the  recurrent  blockage  of  coronary 
arteries  that  occurs  after  clinical  interventions 
such  as  angioplasty  or  atherectomy,  which  are  un- 
dertaken in  an  attempt  to  restore  blood  flow 
through  vessels  narrowed  by  atherosclerosis. 
PDGF  also  plays  a  role  in  the  growth  of  some  tu- 
mors. At  least  one  monkey  sarcoma  is  caused  by 
the  aberrant  production  of  PDGF,  which  stimu- 
lates the  tumor  cells  to  grow  in  an  uncontrolled 
fashion. 

Like  other  growth  factors,  PDGF  acts  on  cells 
by  first  binding  to  specific  receptor  sites  located 
on  the  cell  surface.  This  interaction  of  PDGF  with 
its  receptor  is  transmitted  as  a  signal  across  the 
cell  membrane  and  triggers  a  series  of  complex 
reactions  inside  the  cell  that  culminate  in  DNA 
synthesis  and  cell  division.  To  study  the  mecha- 
nism of  signal  transmission  by  the  PDGF  receptor, 
we  purified  the  receptor  from  mouse  cells, 
cloned  the  gene  that  encodes  the  mouse  receptor 


protein,  expressed  this  receptor  in  cells  that  nor- 
mally lack  PDGF  receptors,  and  demonstrated 
that  this  expressed  receptor  mimics  the  actions  of 
native  PDGF  receptors  and  mediates  all  of  the 
known  responses  to  PDGF. 

The  PDGF  receptor  is  anchored  at  the  surface 
of  the  cell  and  is  oriented  so  that  approximately 
half  of  the  receptor,  the  PDGF-binding  domain,  is 
located  outside  the  cell,  and  the  other  half  of  the 
receptor  is  located  inside  the  cell.  The  receptor 
appears  to  consist  of  several  distant  regions, 
termed  domains,  that  have  distinct  functions.  Us- 
ing the  cloned  gene  for  the  receptor,  we  can  pro- 
duce individual  domains  of  the  receptor  protein 
and  study  the  functions  of  these  domains.  For  ex- 
ample, we  have  produced  receptor  fragments 
that  contain  the  PDGF-binding  domain  but  lack 
the  other  portions  of  the  molecule.  To  localize 
more  precisely  the  portion  of  the  receptor  that  is 
essential  for  binding  PDGF,  we  are  now  prepar- 
ing even  smaller  versions  of  the  binding  domain 
by  deleting  portions  of  the  molecule.  When  a 
minimal  domain  for  binding  is  defined,  we  will 
study  the  three-dimensional  structure  of  this  sim- 
plified molecule  and  use  this  information  to  de- 
sign agents  that  can  block  the  binding  of  PDGF  to 
its  receptor.  These  agents  should  function  as 
blockers  of  the  actions  of  PDGF  and  will  facilitate 
the  study  of  the  role  of  PDGF  in  atherosclerosis 
and  cancer. 

One  of  the  major  problems  in  growth  factor 
research  has  been  to  determine  how  the  portion 
of  the  receptor  inside  the  cell  senses  that  the  bind- 
ing domain  on  the  outside  of  the  cell  has  inter- 
acted with  PDGF.  We  have  recently  found  that 
the  transmission  of  the  signal  from  the  outside 
domain  to  the  inside  of  the  cell  involves  two  ma- 
jor steps.  First,  when  a  receptor  molecule  binds 
PDGF,  the  receptor  pairs  with  another  identical 
receptor  molecule  to  form  a  receptor  dimer.  Each 
of  the  two  receptor  molecules  in  the  dimeric 
complex  phosphorylates  its  partner,  thereby 
modifying  the  partner.  The  phosphorylation  reac- 
tion results  in  the  addition  of  a  phosphate  group 
to  the  partner  and  is  accomplished  by  a  region  of 
the  receptor  termed  the  kinase  domain.  We  have 
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designed  and  produced  mutant  receptors  that 
have  normal  PDGF-binding  domains  but  defec- 
tive kinase  domains.  When  these  mutants  are  in- 
troduced into  cells  that  have  normal  receptors,  a 
dimer  (heterodimer)  is  formed  between  the  nor- 
mal and  kinase-defective  receptors.  The  normal 
and  mutant  receptors  in  the  heterodimer  com- 
plex cannot  phosphorylate  each  other  and  cannot 
transmit  the  signals  required  to  initiate  cell 
growth.  These  experiments  prove  that  formation 
of  a  dimer  consisting  of  two  normal  receptors  is 
required  for  proper  functioning  and  signal  trans- 
duction of  the  receptor.  By  introducing  mutant 
receptors  into  specific  tissues  of  animals  we  hope 
to  be  able  to  block  the  function  of  the  normal 
receptors  and  assess  the  role  of  PDGF  in  physio- 
logical processes  and  in  disease  states. 

When  the  dimerized  receptor  is  phosphory- 
lated,  the  second  major  step  in  signal  transduc- 
tion occurs.  The  phosphorylated  receptor  physi- 
cally binds  to  signaling  molecules  that  are  inside 
the  cell.  We  have  recently  found  that  the  interac- 
tion between  the  receptor  and  signaling  mole- 
cules occurs  at  the  phosphorylation  sites  on  the 
receptor.  Using  information  about  the  structures 
of  these  sites,  we  are  designing  ways  to  disrupt 
the  interaction  between  the  receptors  and  the  sig- 
naling molecules.  We  are  also  using  the  receptors 
as  a  tool  for  discovering  previously  unidentified 
signaling  molecules  that  mediate  that  prolifera- 
tive response  to  PDGF.  Recently  we  identified 
one  of  these  molecules,  phosphatidylinositol-3 
kinase  (PI3  kinase),  that  appears  to  play  an  im- 
portant role  in  PDGF-stimulated  proliferation. 
Other  investigators  have  found  that  PI3  kinase  is 
also  important  in  the  cell  transformation  caused 


by  some  mammalian  oncogenic  viruses.  Using  the 
receptor  as  a  probe,  we  recently  purified  the  PI3 
kinase  and  cloned  the  gene  for  this  signaling  mol- 
ecule. We  hope  that  by  studying  the  interaction 
of  the  receptor  and  PI  3  kinase  we  can  understand 
an  important  set  of  reactions  that  are  involved  in 
regulating  cell  growth. 

We  recently  have  begun  to  study  the  fibroblast 
growth  factors  (FGF).  These  factors  appear  to 
play  important  roles  in  the  earliest  stages  of  em- 
bryogenesis  and  in  angiogenesis  (the  formation 
of  new  blood  vessels) .  The  development  of  new 
vessels  can  be  beneficial  (e.g.,  in  the  setting  of 
atherosclerotic  narrowing  of  blood  vessels  in  the 
heart)  or  deleterious  (e.g.,  in  the  formation  of 
new  vessels  that  supply  nutrients  to  tumors  or  in 
the  vascular  proliferation  that  occurs  in  the  eyes 
of  diabetic  patients) .  We  have  identified  some  of 
the  FGF  receptors  and  are  now  examining  their 
mechanisms  of  action. 

The  long-range  goal  of  these  studies  is  to  probe 
the  role  of  growth  factors  in  normal  embryonic 
development,  in  tissue  repair,  and  in  prolifera- 
tive diseases.  Using  the  tools  of  molecular  biol- 
ogy, cell  biology,  and  protein  chemistry,  we  and 
other  research  groups  are  identifying  the  factors, 
receptors,  and  regulatory  molecules  involved  in 
these  processes.  Studies  of  the  spatial  and  tem- 
poral distribution  of  the  growth  factors  and  re- 
ceptors in  normal  and  diseased  tissues  will  pro- 
vide insight  into  the  function  of  these  molecules. 
By  understanding  the  molecular  details  of  the 
protein-protein  interactions  involved  in  growth 
factor  action,  it  may  be  possible  to  devise  new 
therapeutic  strategies  to  treat  proliferative 
diseases. 
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GENE  replacement  therapy  is  being  consid- 
ered for  the  treatment  of  a  variety  of  ac- 
quired and  inherited  diseases.  These  novel  thera- 
peutic modalities  involve  the  transfer  of  genetic 
material  into  somatic  tissues  of  affected  individ- 
uals. A  major  focus  of  my  laboratory  is  the  devel- 
opment of  new  therapies  for  the  treatment  of  car- 
diopulmonary diseases  based  on  gene  transfer 
into  hepatocytes  and  airway  epithelial  cells. 

Familial  hypercholesterolemia  (FH)  is  an  auto- 
somal dominant  disorder  in  humans  that  is  an  ex- 
cellent model  for  developing  gene  replacement 
therapies  of  hyperlipidemic  states.  Patients  with 
the  homozygous-deficient  form  of  FH  have  severe 
hypercholesterolemia  and  suffer  life-threatening 
coronary  artery  disease  in  childhood  that  is  refrac- 
tory to  conventional  medical  therapies.  The  mo- 
lecular basis  of  this  disorder  is  a  systemic  defi- 
ciency in  the  receptor  that  degrades  low-density 
lipoprotein  (LDL),  the  primary  carrier  of  plasma 
cholesterol.  Hepatocytes  are  the  appropriate  tar- 
get for  gene  transfer  in  this  disease,  because  the 
liver  is  the  primary  organ  responsible  for  LDL  me- 
tabolism and  cholesterol  excretion. 

We  have  used  an  animal  model  for  FH — the 
Watanabe  heritable  hyperlipidemic  (WHHL)  rab- 
bit— to  develop  different  types  of  liver-directed 
gene  replacement  therapies  for  this  disease.  One 
approach  is  similar  in  concept  to  the  well- 
described  bone  marrow-directed  gene  therapies. 
This  ex  vivo  method  involves  isolating  hepato- 
cytes from  a  genetically  deficient  animal,  trans- 
ferring functional  genetic  material  into  the  hepa- 
tocyte  in  vitro,  and  transplanting  the  genetically 
modified  hepatocytes  back  into  the  affected  re- 
cipient. We  have  used  recombinant  retroviruses 
to  transfer  a  functional  human  LDL  receptor  gene 
stably  into  hepatocytes  derived  from  WHHL  rab- 
bits. Transplantation  of  these  cells  into  WHHL 
rabbits  leads  to  substantial  decreases  in  serum 
cholesterol  for  over  six  months. 

An  alternative  and  less-invasive  approach  is  to 
deliver  a  functional  LDL  receptor  gene  into  the 
hepatocytes  in  vivo.  We  have  collaborated  with 
George  and  Cathy  Wu  (University  of  Connecti- 
cut) to  develop  an  in  vivo  gene  delivery  system 


that  is  based  on  interactions  with  hepatocyte- 
specific  receptors.  Using  this  approach  we  can 
deliver  reporter  genes  specifically  to  hepatocytes 
in  vivo  and  obtain  high-level  expression  in  the 
liver  for  at  least  four  months  following  gene 
transfer.  Administration  of  a  gene  transfer  sub- 
strate capable  of  expressing  LDL  receptor  into  the 
circulation  of  WHHL  rabbits  led  to  significant  re- 
ductions in  the  level  of  serum  cholesterol.  He- 
patic overexpression  of  LDL  receptor  by  gene 
transfer  can  potentially  prevent  atherosclerotic 
disease  in  FH  and  other  hyperlipidemic  states. 

Cystic  fibrosis  (CF),  another  inherited  disease, 
is  characterized  by  abnormal  salt  and  water  trans- 
port that  leads  to  pathology  within  the  pancreas 
and  lungs.  CF  is  the  most  common  congenital  dis- 
ease among  Caucasians,  with  a  prevalence  of  1  in 
2,000  births.  The  recent  discovery  of  the  gene 
that  causes  CF  suggests  a  new  therapeutic  strategy 
in  which  normal  functioning  CF  genes  are  di- 
rectly introduced  into  pancreatic  or  respiratory 
cells  of  affected  patients. 

As  a  first  step  in  the  development  of  a  genetic 
cure  for  CF,  we  have  attempted  to  correct  the 
physiological  abnormality  in  cells  from  a  patient 
with  CF  by  introducing  into  them  a  normal  CF 
gene.  Replication-defective  viruses  were  used  to 
shuttle  a  normal  CF  gene  into  pancreatic  cells  of  a 
patient  with  CF.  Prior  to  gene  transfer,  the  cells 
manifested  the  typical  abnormalities  associated 
with  this  disease  (i.e.,  decreased  transport  of  salt 
across  the  membrane).  Following  gene  transfer, 
the  cells  regained  normal  regulation  of  salt  trans- 
port. This  demonstrates  the  feasibility  of  gene 
therapy  of  CF  at  a  cellular  level. 

We  have  now  entered  the  second  and  more  dif- 
ficult phase  of  the  development  of  CF  gene  thera- 
pies, in  which  we  are  attempting  to  deliver  nor- 
mal CF  genes  to  the  relevant  somatic  cells  of  the 
body,  the  cells  that  line  the  airways  of  the  lung. 
The  overall  strategy  is  to  develop  a  gene  transfer 
substrate  capable  of  delivering  a  functional  CF 
gene  to  airway  cells  in  vivo.  Direct  inhalation  of 
the  gene  transfer  substrate  could  provide  a  nonin- 
vasive way  for  delivering  the  genes  to  the  appro- 
priate cell. 
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OUR  ability  to  resist  a  wide  range  of  infectious 
agents  depends  on  the  normal  function  of 
the  immune  system.  The  humoral  portion  of  this 
system  is  responsible  for  the  production  of  spe- 
cific antibody  molecules  from  B  lymphocytes. 
Millions  of  B  lymphocytes  are  produced  daily  in 
the  fetal  liver  or  adult  bone  marrow  from  primi- 
tive stem  cells,  and  a  careful  balance  between 
growth  rate  and  development  must  be  main- 
tained for  the  stem  cells  and  their  progeny.  Too 
low  a  growth  rate  can  result  in  immunodefi- 
ciency; too  high  a  growth  rate,  in  various  types  of 
leukemia  or  lymphoma.  Our  laboratory  has  con- 
centrated on  defining  the  growth  control  mecha- 
nisms that  regulate  these  stem  cells  and  their 
production  of  B  lymphocytes  under  normal  cir- 
cumstances and  in  various  disease  states. 

The  ABL  Oncogene  in  Human  and 
Murine  Leukemias 

The  ABL  oncogene  was  first  isolated  as  the  ac- 
tive genetic  element  of  the  Abelson  murine  leu- 
kemia virus.  This  agent  is  capable  of  causing  a 
wide  range  of  leukemias  in  mice,  including  those 
of  immature  cell  types  within  the  B  cell  lineage. 
The  biological  properties  of  the  ABL  gene  prod- 
uct depend  on  its  activity  as  a  tyrosine-specific 
protein  kinase. 

The  human  homologue  of  the  ABL  gene  has 
now  been  strongly  implicated  in  the  pathogene- 
sis of  a  family  of  human  leukemias  that  harbor  a 
specific  cytogenetic  abnormality  called  the  Phila- 
delphia chromosome  or  Ph  1 .  This  chromosome 
translocation  uses  messenger  RNA  splicing  to  join 
portions  of  chromosome  22  sequences  encoding 
a  gene  called  BCR  to  a  portion  of  the  coding  se- 
quences of  the  ABL  gene  on  chromosome  9.  The 
tyrosine  kinase  activity  of  the  chimeric  BCR/ABL 
gene  product  is  evoked  under  these  circum- 
stances and  strongly  correlates  with  the  transfor- 
mation activity  of  the  protein. 

Two  different  forms  of  BCR/ABL  protein  can 
occur,  depending  on  the  precise  position  of  the 
chromosomal  breakpoints.  In  human  chronic  my- 
elogenous leukemia,  a  larger  protein  product 


called  P2 1 0  BCR/ABL  is  produced,  and  in  cases  of 
Phi -positive  acute  lymphocytic  leukemia,  a 
PI 85  BCR/ABL  protein  product  is  commonly 
found.  Surprisingly,  the  precise  contribution  of 
BCR  sequences  determines  the  efficiency  of  tyro- 
sine kinase  activity  of  the  ABL  segment  and  the 
malignant  potential  of  the  gene  product. 

Detailed  site-directed  mutagenesis  studies  on 
molecularly  cloned  copies  of  the  BCR/ABL  genes 
show  that  BCR  sequences  are  essential  to  activate 
the  ABL  tyrosine  kinase  activity  and  produce  a 
functional  oncogene.  The  precise  function  of 
BCR  in  the  normal  cell  is  not  known,  but  several 
lines  of  evidence,  including  sequence  homolo- 
gies, autophosphorylation  activity,  and  inactiva- 
tion  with  nucleotide  analogues,  suggest  that  BCR 
may  be  a  protein  kinase  itself.  We  are  evaluating 
the  possibility  that  the  enzymatic  action  of  BCR  is 
required  to  activate  the  enzyme  function  of  ABL 
in  the  chimeric  oncogene. 

The  normal  ABL  oncogene  products  are  ex- 
pressed in  many  cell  types,  but  their  role  in  mam- 
malian cell  physiology  is  unknown.  Gross  struc- 
tural changes  can  activate  their  oncogenic 
potential.  It  has  been  difficult  to  identify  more 
subtle  mutations  that  might  activate  ABL  in  other 
types  of  leukemias  because  the  normal  ABL  gene 
can  be  toxic  to  most  rodent  fibroblast  cell  types 
when  highly  expressed  via  gene  transfection  tech- 
niques. The  precise  mechanism  of  toxicity  is  not 
established,  but  probably  relates  to  a  cell  cycle 
blocking  effect. 

To  circumvent  this  problem,  we  have  used  a 
new  approach  to  preparing  retroviral  expression 
stocks  developed  by  Dan  Littman  and  his  col- 
leagues (HHMI,  University  of  California,  San  Fran- 
cisco). Full-length  cDNA  copies  of  the  cellular 
ABL  genes  are  cloned  into  a  retroviral  vector  that 
has  been  modified  to  allow  amplification  in  an 
acute  transfection  system.  Retroviral  particles  are 
produced  that  can  transmit  the  cellular  ABL  gene 
at  high  efficiency  to  a  wide  variety  of  cell  types. 
Using  this  system,  we  have  been  able  to  select  for 
transformed  clones  that  harbor  new  classes  of  ac- 
tivating mutations  in  ABL.  We  will  further  ana- 
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lyze  such  mutant  strains  for  their  biological  po- 
tential and  test  for  analogous  mutations  in  natu- 
rally occurring  human  leukemias. 

The  activation  of  the  cellular  ABL  gene  to  on- 
cogenic forms  correlates  with  the  stable  in  vivo 
modification  of  the  protein  with  phosphotyro- 
sine.  The  normal  ABL  proteins  are  not  detectably 
modified  with  phosphotyrosine  in  vivo,  al- 
though they  can  autophosphorylate  themselves 
in  vitro.  We  have  recently  demonstrated  that  a 
specific  intracellular  inhibitor  suppresses  the  ty- 
rosine kinase  activity  of  the  normal  cellular  ABL 
proteins,  but  does  not  effect  the  oncogenic 
forms.  If  normal  ABL  proteins  are  hyperex- 
pressed,  they  can  overcome  this  inhibitor  and 
modify  themselves  on  tyrosine  in  vivo.  In  further 
studies  on  this  inhibitor,  we  will  try  to  define  its 
mechanism  of  differential  action  on  the  normal 
and  oncogenic  forms  of  the  ABL  proteins. 

Stem  and  Progenitor  Cell  Growth  Regulation 

One  possible  mechanism  of  action  of  onco- 
genic forms  of  ABL  would  be  a  transcriptional 
activation  of  specific  growth  factor  genes,  lead- 
ing to  an  autocrine  stimulation  of  receptors  in 
different  cell  types.  We  have  tested  this  idea  in  a 
pre-B  cell  line  that  is  interleukin-7  (IL-7)  depen- 
dent. Although  all  oncogenic  forms  of  ABL  could 
relieve  the  cell  of  growth  factor  dependence  and 
lead  to  tumor  progression,  there  were  interesting 
differences  between  Abelson  virus  and  the  BCR/ 
ABL  forms.  A  subset  of  the  lines  derived  with 
Abelson  virus  showed  activation  of  a  low  level  of 
transcription  from  the  IL-7  locus,  but  none  of  the 
BCR/ABL  lines  showed  this  activation.  When 
forced  autocrine  expression  of  IL-7  was  engi- 
neered in  the  same  indicator  line,  it  could  render 
the  cells  growth  factor-independent,  but  the 


cells  did  not  progress  to  tumorigenic  behavior.  It 
is  likely  that  a  bypass  of  the  growth  factor  require- 
ments and  additional  signals  initiated  by  the  on- 
cogenic forms  of  ABL  determine  tumor  progres- 
sion for  immature  B  lineage  cell  types. 

In  collaboration  with  Naomi  Rosenberg  of 
Tufts  Medical  School,  we  have  developed  a  sys- 
tem for  retroviral  infection  of  murine  bone 
marrow  stem  cells  with  the  BCR/ABL  oncogene 
and  reimplantation  into  syngeneic  hosts.  This 
procedure  leads  to  tumors  with  the  characteris- 
tics of  human  chronic  myelogenous  leukemia. 
Interestingly,  animals  infected  with  the  PI 85 
BCR/ABL  forms  show  more  aggressive  tumors 
that  invade  nonhematopoietic  organs  like  liver  or 
kidney  and  show  shorter  latency  than  animals  in- 
fected with  the  P2 1 0  BCR/ABL  form  of  the  gene. 
This  system  should  be  valuable  for  analyzing  new 
therapies  directed  at  the  BCR/ABL  oncogene  and 
for  defining  the  growth  regulation  of  primitive 
hematopoietic  stem  cells. 

One  long-range  goal  of  our  group  is  to  develop 
effective  in  vitro  culture  techniques  for  the  prop- 
agation and  enrichment  of  stem  and  progenitor 
cells  for  different  lineages.  Previously  we  used 
the  grovvT:h  stimulatory  properties  of  the  BCR/ 
ABL  oncogene  in  concert  with  selected  bone 
marrow-adherent  stromal  lines  to  grow  clonal 
lines  of  B  lymphoid  progenitor  cells  that  could 
repopulate  the  B  cell  lineage  of  immunodefec- 
tive  (SCID)  mice.  By  modifying  the  culture  con- 
ditions, we  have  now  been  able  to  cultivate  such 
progenitor  cells  without  the  need  for  co-stimula- 
tion by  the  oncogene.  These  populations  are  very 
effective  in  reconstituting  the  B  cell  lineage  in 
vivo  and,  in  preliminary  analysis,  also  show  evi- 
dence of  reconstituting  other  lineages.  We  hope 
to  develop  clonal  lines  that  will  retain  these 
functional  properties. 
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CELLS  use  diverse  molecular  mechanisms  for 
regulating  the  synthesis  of  proteins  in  re- 
sponse to  environmental  changes.  Any  of  the 
many  steps  in  the  pathway  by  which  the  informa- 
tion stored  in  genes  becomes  converted  into  pro- 
teins can  be  subject  to  regulation.  My  laboratory 
is  particularly  interested  in  understanding  the 
process  by  which  messenger  RNA  (mRNA)  be- 
comes translated  into  proteins.  We  wish  to  un- 
derstand fundamental  aspects  of  this  process,  as 
well  as  how  translation  of  certain  proteins  can  be 
regulated  in  response  to  conditions  outside  the 
cell. 

The  information  in  mRNA  is  translated  into 
protein  by  a  large  RNA-protein  complex,  the  ri- 
bosome.  It  has  been  known  for  some  time  that  the 
movement  of  ribosomes  along  the  mRNA  is  not 
linear  with  time.  Rather,  they  pause  at  discrete 
places  along  the  way.  Why  they  do  so  is  not  well 
understood,  but  is  thought  to  be  due  either  to 
rare  codons  or  to  secondary  structures  in  the 
mRNA. 

As  a  postdoctoral  fellow  in  Peter  Walter's  labo- 
ratory, I  devised  a  new  method  to  determine  the 
distribution  of  ribosomes  on  an  mRNA  with 
single-nucleotide  precision.  Using  this  assay,  we 
have  been  able  to  detect  and  map  discrete  sites  of 
ribosome  pausing  during  translation  in  extracts. 
Two  of  these  rate-limiting  steps  correspond  to  ini- 
tiation and  termination  of  translation.  Others  oc- 
cur during  elongation  of  the  nascent  protein.  We 
are  using  this  method  to  identify  features  of 
mRNA  sequence  or  structure  that  result  in  ribo- 
some pausing. 

Surprisingly,  we  have  found  that  the  pausing 
causes  a  sort  of  traffic  jam,  in  that  ribosomes  pile 
up  behind  each  paused  one.  Under  certain  condi- 
tions, we  have  detected  up  to  nine  ribosomes 
piled  up  behind  the  leading  one.  As  a  result,  the 
ribosomes  become  tightly  packed,  such  that  their 


centers  are  only  27  nucleotides  apart.  This  tight 
packing  along  the  mRNA  forces  the  ribosomes  in 
each  cluster  to  orient  themselves  in  a  zigzag 
pattern. 

Although  we  have  only  detected  ribosome 
stacking  in  cell  extracts,  we  are  interested  in  de- 
termining if  this  phenomenon  occurs  in  intact 
cells.  "Zigzag"  or  "helical"  polyribosomes  have 
been  previously  seen  in  the  electron  micrographs 
of  certain  cells,  but  their  functional  significance 
has  been  unclear.  It  seems  likely  that  they  repre- 
sent densely  stacked  ribosome  clusters,  caused  by 
the  pausing  of  the  leading  ribosome. 

We  are  also  investigating  the  mechanism  by 
which  the  translation  of  a  secretory  protein,  insu- 
lin, is  regulated  in  response  to  conditions  outside 
the  cell.  Insulin  is  the  major  protein  involved  in 
regulating  glucose  metabolism  in  mammals.  It  is 
synthesized  by  the  (8  cells  of  the  pancreas,  which 
are  organized  into  clusters  known  as  islets.  When 
primary  cultures  of  rat  pancreatic  islets  are  incu- 
bated in  high  concentrations  of  glucose,  the  syn- 
thesis of  the  insulin  precursor  molecule,  proin- 
sulin,  increases  between  5-  and  10-fold  within 
minutes.  During  the  first  hour  of  glucose  adminis- 
tration, this  increased  synthesis  of  insulin  is 
largely  due  to  more  efficient  translation  of  preex- 
isting mRNA.  Although  the  phenomenon  has 
been  described  for  many  years,  little  is  known  of 
the  molecular  mechanisms  involved. 

We  are  using  the  above  assay,  in  conjunction 
with  biochemical  fractionation,  to  probe  the 
mechanism  by  which  the  translation  of  proinsu- 
lin  mRNA  is  regulated  in  pancreatic  islets.  Our 
initial  results  indicate  that  preproinsulin  transla- 
tion is  regulated  at  both  the  initiation  and  elonga- 
tion levels.  We  hope  that  a  better  understanding 
of  this  phenomenon  will  be  helpful  in  explaining 
how,  in  certain  patients  with  diabetes,  normal  in- 
sulin production  is  impaired. 
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A major  focus  in  my  laboratory  has  been  the 
analysis  of  human  metabolic  disorders  at  the 
molecular  level.  Phenylketonuria  (PKU),  the  dis- 
ease under  investigation,  is  an  inborn  error  in 
amino  acid  metabolism  that  causes  severe  and 
permanent  mental  retardation  in  untreated  chil- 
dren. The  condition  is  caused  by  defects  in  the 
liver  enzyme  phenylalanine  hydroxylase  (PAH) 
and  is  transmitted  from  the  parents  to  both  boys 
and  girls.  In  the  United  States  it  affects  300-400 
new^borns  a  year,  and  1  of  every  50  individuals  is 
an  asymptomatic  carrier  of  the  disease  trait. 

Prenatal  Diagnosis 

Our  laboratory  has  isolated  the  human  PAH 
gene  by  molecular  cloning  and  used  the  cloned 
gene  to  analyze  cellular  DNA  of  normal  and  af- 
fected individuals.  Extensive  benign  variations  in 
this  gene  can  be  used  to  trace  the  transmission  of 
individual  PAH  genes  from  the  parents  to  the 
children  in  PKU  families.  A  fetus  ^Nho  has  in- 
herited the  same  PAH  genes  as  an  affected  sibling 
will  have  PKU.  This  has  led  to  a  prenatal  diagnosis 
procedure  for  PKU  in  families  w^ith  previously 
affected  children,  and  the  procedure  has  been 
adopted  in  the  United  States  and  many  western 
European  countries. 

Carrier  Detection 

The  above  procedure  is  applicable  only  to  fami- 
lies with  affected  children.  To  prevent  PKU  from 
occurring  in  the  first  place,  it  is  necessary  to 
identify  the  asymptomatic  carriers,  which  re- 
quires extensive  knowledge  of  the  gene  defects. 
To  determine  the  molecular  basis  of  PKU  among 
Caucasians,  patient  samples  were  obtained  from 
various  European  countries  and  molecularly  ana- 
lyzed. The  defects  in  the  10  most  common  mu- 
tant PAH  genes  and  a  number  of  less  prevalent 
ones  have  been  characterized. 

The  1 0  prevalent  mutations  together  represent 
about  two-thirds  of  all  PKU  genes  in  Caucasians. 
If  carrier  screening  is  carried  out  today  and  fol- 
lowed by  prenatal  diagnosis  in  couples  who  are 
both  carriers,  it  is  theoretically  possible  to  re- 
duce the  incidence  of  PKU  by  ninefold.  Thus  the 


number  of  PKU  patients  in  this  country  could  po- 
tentially be  reduced  from  300-400  a  year  to  27- 
36  a  year.  It  is  important  to  continue  with  the 
molecular  analysis:  if  90  percent  of  PKU  genes 
could  be  detected  throughout  the  Caucasian  pop- 
ulation, it  would  be  possible  to  reduce  the  num- 
ber of  PKU  patients  to  only  3-4  a  year. 

Population  Dynamics 

A  number  of  these  PKU  genes  have  distinctive 
patterns  of  distribution  on  the  European  conti- 
nent. One  is  very  prevalent  in  eastern  Europe,  and 
the  frequency  decreases  in  a  gradual  fashion  from 
east  to  west.  These  results  suggested  that  the  mu- 
tation occurred  in  eastern  Europe  some  time  ago 
and  was  then  spread  throughout  the  European 
continent  by  migration  of  the  people  in  prehis- 
toric times.  Similarly,  another  PKU  gene  is  very 
frequent  in  northern  Europe  but  less  frequent  in 
the  neighboring  countries.  These  results  sug- 
gested that  multiple  PKU  mutations  occurred  in- 
dependently in  various  parts  of  Europe  and  then 
spread  throughout  the  Caucasian  race. 

Somatic  Gene  Therapy 

The  other  major  goal  of  our  laboratory  is  to  ex- 
plore the  potential  for  somatic  gene  therapy  of 
the  genetic  disorder.  The  PAH  gene  has  been  in- 
serted into  the  genome  of  an  incapacitated  virus. 
The  recombinant  viruses  are  able  to  infect  mam- 
malian cells  and  incorporate  themselves  into  the 
genome  of  the  infected  cells,  but  they  are  no 
longer  able  to  produce  new  virus  to  continue  the 
infection  process.  The  recombinant  viruses  were 
used  to  infect  cultured  rodent  hepatoma  cells, 
thereby  conferring  on  them  an  ability  to  synthe- 
size the  corresponding  human  enzyme.  More  re- 
cently we  have  demonstrated  that  normal  hepatic 
cells  from  mice  can  be  cultured  and  efficiently 
infected  by  the  recombinant  virus. 

These  results  have  led  to  experimentation  with 
hepatocyte  transplantation  in  laboratory  animals. 
A  variety  of  inert  substances  were  used  as  support 
for  the  mouse  hepatic  cells  prior  to  transplanta- 
tion. Unfortunately,  the  transplanted  hepatocytes 
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lived  for  only  a  few  weeks  in  the  recipient  ani- 
mals. However,  when  the  hepatocytes  were  re- 
turned to  the  liver  of  mice  by  direct  injection  into 
the  portal  vein  or  the  spleen,  the  cells  incorpo- 
rated themselves  into  the  liver  parenchyma  and 
lived  fpr  as  long  as  the  mice  did.  They  also  con- 
tinued to  function  as  hepatocytes  in  the  trans- 
planted animals.  These  exciting  results  should 
form  the  foundation  for  the  curing  of  liver  dis- 
orders in  humans  by  gene  therapy  in  the  future. 

The  investigation  of  the  feasibility  of  gene  ther- 
apy for  PKU  has  been  hampered  by  the  lack  of  an 


animal  model.  Recently,  the  laboratory  of  Wil- 
liam Dove  at  the  University  of  Wisconsin  has  re- 
ported the  creation  by  chemical  mutagenesis  of  a 
mouse  that  is  deficient  in  PAH  activity  in  the 
liver.  Our  initial  experiments  with  these  mice 
have  shown  that  the  PAH  gene  can  indeed  be 
transferred  into  their  hepatocytes  in  culture  and 
reconstitute  fully  the  missing  enzyme  activity. 
This  mouse  model  will  be  critically  important  for 
us  to  test  the  efficacy  of  our  hepatic  gene  transfer 
and  hepatocyte  transplantation  protocols  for  cor- 
rection of  the  hepatic  deficiency. 
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VISION  begins  in  the  rods  and  cones  of  the 
retina,  where  light  is  absorbed  and  trans- 
duced into  a  neural  signal  consisting  of  an  elec- 
trical hyperpolarization  at  the  photoreceptor 
membrane.  This  signal  is  relayed  to  second-order 
neurons  in  the  retina  through  a  modulation  of  the 
release  of  synaptic  transmitter  at  the  photorecep- 
tor's terminal.  In  darkness  the  transmitter  is  re- 
leased at  a  high  rate,  and  in  light  the  membrane 
hyperpolarization  reduces  the  release  in  a  graded 
fashion.  This  modulation  of  synaptic  transmitter 
release  can  lead  to  a  hyperpolarizing  or  depolar- 
izing response  to  light  in  a  second-order  neuron, 
depending  on  the  polarity  of  a  given  synapse. 

The  phototransduction  process — the  way  the 
hyperpolarizing  response  to  light  is  generated  in 
the  receptors — is  as  follows.  In  darkness  an  ionic 
conductance  in  the  plasma  membrane  of  the  re- 
ceptor's outer  segment  (the  part  of  the  cell  that 
contains  the  visual  pigment)  is  kept  open  by  the 
cyclic  nucleotide  guanosine  3':5'-cyclic  mono- 
phosphate (cGMP),  letting  both  Na^  and  Ca^^ 
into  the  cell.  This  "dark"  current  depolarizes  the 
cell  and  causes  the  steady  release  of  synaptic 
transmitter  described  above. 

Light  activates  the  following  reaction  cascade: 
light  -*■  photoisomerization  of  visual  pigment 
G  protein  activation  cGMP  phosphodiesterase 
stimulation  cGMP  hydrolysis.  As  a  result,  the 
cGMP  level  falls  in  the  outer  segment,  causing 
the  ionic  conductance  to  close  and  leading  se- 
quentially to  the  membrane  hyperpolarization 
and  the  reduction  of  synaptic  transmitter  release. 
This  phototransduction  scheme  applies  to  both 
rods  and  cones,  with  only  quantitative  differ- 
ences between  the  two  types  of  receptors. 

One  consequence  of  the  conductance  closure 
in  the  light  is  that  the  Ca^^  influx  stops.  The  re- 
sulting imbalance  between  influx  and  efflux 
leads  to  a  decrease  in  the  intracellular  free  Ca^"^ 
concentration.  This  Ca^"^  decrease  reduces  a  tonic 
inhibition  exerted  by  Ca^^  on  the  cGMP-synthe- 
sizing  enzyme  guanylate  cyclase  and  causes  an 
increase  in  the  synthesis  of  cGMP  in  the  light. 
Thus  Ca^^  mediates  a  negative  feedback  on  the 
light-activated  cGMP  hydrolysis,  and  this  feed- 


back should  be  a  candidate  mechanism  underly- 
ing the  well-known  phenomenon  of  background 
light  adaptation  in  photoreceptors.  Indeed,  we 
have  found  that  this  adaptation  essentially  disap- 
pears upon  removing  the  feedback  experimen- 
tally by  eliminating  the  Ca^^  influx  and  efflux. 

We  have  now  gone  on  to  ask  the  complemen- 
tary question — whether  we  can  quantitatively 
predict  the  background  adaptation  in  photore- 
ceptors from  the  known  properties  of  the  Ca^"*^ 
feedback.  First,  we  derived  the  time  course  of 
light-induced  phosphodiesterase  activity  (which 
hydrolyzes  cGMP)  from  a  cell's  response  to  a  dim 
flash  in  the  condition  of  no  feedback.  Next,  we 
introduced  the  equations  that  describe  the  Ca^^ 
feedback,  the  components  of  which  include  a 
Ca^^  influx  through  the  cGMP-gated  conduc- 
tance, a  Ca^"^  efflux  through  a  Na^-Ca^^  exchange 
mechanism,  and  the  Ca^"^  modulation  on  the 
guanylate  cyclase.  All  of  these  processes  are 
known  in  quantitative  detail  from  previous  work. 

Computations  showed  that  such  a  simple 
model  of  the  Ca^"^  feedback  could  indeed  predict 
quite  well  a  cell's  physiological  response  to  a 
dim  flash.  Furthermore,  the  model  led  to  a 
steady-state  response-intensity  relation  for  steps 
of  light  that  also  fitted  well  the  experimental  re- 
lation. The  predicted  response-intensity  relation 
is  consistent  with  Weber's  law,  which  constitutes 
the  classical  description  of  background  light  ad- 
aptation. In  summary,  there  is  both  experimental 
and  theoretical  support  for  the  Ca^"^  feedback  to 
be  the  predominant  mechanism  underlying  back- 
ground adaptation  in  photoreceptors. 

Another  problem  we  are  working  on  is  a  molec- 
ular characterization  of  the  cGMP-activated  con- 
ductance mediating  phototransduction.  This 
conductance  now  appears  to  belong  to  a  family  of 
cyclic  nucleotide-gated  channels  that  includes  a 
channel  in  olfactory  cilia  supposedly  mediating 
olfactory  transduction.  The  known  channels  in 
this  family  so  far  are  all  slightly  diff^erent  from  one 
another  in  their  functional  characteristics.  For  ex- 
ample, the  rod  and  cone  channels  show  similar 
activation  by  cGMP  but  differ  in  the  current- 
voltage  relation.  The  olfactory  channel,  on  the 
other  hand,  diff'ers  from  both  the  rod  and  cone 
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channels  by  having  a  much  higher  affinity  for 
cGMP  (and  also  adenosine  3':5'-cyclic  monophos- 
phate, or  cAMP,  another  important  cyclic 
nucleotide) . 

Recently  the  rod  channel  in  bovine  retina  was 
purified  and  cloned  by  others.  On  the  basis  of 
structural  homology  to  the  rod  channel,  Randall 
Reed,  another  HHMI  investigator  at  Johns  Hop- 
kins, has  also  obtained  a  cDNA  clone  for  the  rat 
olfactory  channel.  We  examined  the  physiology 
of  this  channel,  after  expression  in  a  mammalian 
cell  line,  and  found  that  it  has  a  current-voltage 
relation  remarkably  similar  to  that  of  the  rod 


channel,  except  that  cGMP  and  cAMP  activate 
this  channel  about  50  times  more  effectively  than 
they  do  the  rod  channel.  The  next  step  to  be  taken 
is  to  generate  hybrids  between  the  two  channels, 
in  order  to  identify  the  part  of  the  molecules  that 
determines  affinities  for  cyclic  nucleotides. 

In  addition,  with  Dr.  Reed's  help,  we  have  iso- 
lated, again  by  structural  homology,  several 
clones  from  a  human  retinal  cDNA  library  that 
may  encode  for  both  the  rod  and  cone  channels 
in  the  human  retina.  Experiments  are  now  un- 
der way  to  test  these  clones  for  functional 
expression. 


490 


Molecular  Mechanisms  of  Ion  Channel  Function 


Gary  Yellen,  Ph.D. — Assistant  Investigator 

Dr.  Yellen  is  also  Assistant  Professor  of  Neuroscience  and  Biophysics  at  the  Johns  Hopkins  University 
School  of  Medicine.  He  received  his  undergraduate  degree  in  biochemical  sciences  from  Harvard  College 
and  his  Ph.D.  degree  in  physiology  from  Yale  University,  where  he  studied  with  Charles  Stevens.  Dr.  Yellen 
did  his  postdoctoral  research  on  ion  channel  physiology  as  a  Life  Sciences  Research  Foundation  postdoc- 
toral fellow  at  Brandeis  University,  where  he  worked  with  Christopher  Miller. 


ALL  electrical  signaling  in  the  nervous  system 
is  controlled  by  ion  channels,  a  class  of 
membrane  proteins  that  form  pores  through  the 
membrane.  Charged  ions  such  as  sodium,  potas- 
sium, and  calcium  pass  through  ion  channels  and 
carry  an  electrical  current.  The  channels  them- 
selves are  regulated,  so  that  the  pores  are  only 
open  when  the  proper  chemical  or  electrical  sig- 
nal is  present,  and  only  certain  ions  can  pass 
through  a  particular  kind  of  channel.  By  under- 
standing how^  channels  open  and  close  and  how 
they  are  regulated,  we  define  the  repertoire  of 
molecular  changes  used  by  neurons  when  they 
signal,  sense,  and  learn. 

Ion  channels,  like  other  membrane  proteins, 
have  resisted  standard  biochemical  and  structural 
analysis.  Their  structure  has  only  recently  begun 
to  be  elucidated  by  a  combination  of  protein 
chemistry  and  molecular  biology.  On  the  other 
hand,  we  have  detailed  knowledge  of  the  func- 
tioning of  ion  channels.  Because  each  ion  chan- 
nel catalyzes  the  transport  of  millions  of  ions  per 
second,  we  can  measure  electrically  the  current 
carried  by  just  a  single-channel  protein  molecule. 
This  technique  of  single-channel  recording  has 
allowed  us  to  make  a  detailed  model  for  the  con- 
formational changes  between  open  and  closed 
states  induced  by  chemical  ligands  and  changes 
in  voltage,  but  we  still  have  no  knowledge  of  the 
protein  structures  that  underlie  these  conforma- 
tional changes. 

My  laboratory  is  using  two  complementary  ap- 
proaches to  relate  ion  channel  function  to  struc- 
ture. Site-directed  mutagenesis  allows  us  to  mod- 
ify any  amino  acid  in  a  protein  for  which  we  have 
the  cloned  genetic  material.  Rather  than  modify- 
ing the  protein  directly,  we  change  the  DNA  se- 
quence and  then  inject  the  modified  messenger 
RNA  into  immature  frog  eggs  (oocytes),  which 
manufacture  the  modified  protein.  This  method 
allows  us  to  test  specific  theories  about  which 
parts  of  the  channel  protein  are  important  for  spe- 
cific functional  features. 

We  have  applied  this  first  strategy  to  voltage- 


activated  potassium  channels,  which  participate 
in  electrical  signaling.  By  systematic  mutagene- 
sis, we  have  identified  the  region  of  the  potas- 
sium channel  protein  that  lines  the  pore  through 
which  ions  cross  the  membrane.  We  found  spe- 
cific amino  acid  residues  in  the  protein  sequence 
that  control  the  sensitivity  of  the  channel  to  tet- 
raethylammonium,  an  organic  ion  that  can  block 
current  through  the  channel.  Natural  variation  of 
one  of  these  amino  acids  explains  the  differences 
in  drug  sensitivity  between  potassium  channels 
in  different  organs  or  species.  Amino  acids  in  this 
region  of  the  protein  can  also  affect  the  rate  at 
which  ions  are  conducted  through  the  pore. 
These  discoveries  put  us  in  a  position  to  discover 
the  basic  mechanisms  of  ion  selectivity  and  chan- 
nel gating  at  the  level  of  individual  amino  acids. 

We  have  also  used  this  strategy  to  study  acetyl- 
choline-activated  cation  channels,  which  convert 
neurochemical  signals  into  electrical  signals  at 
synapses.  We  have  changed  amino  acids  in  the 
region  of  the  protein  that  binds  acetylcholine  and 
identified  specific  residues  that  play  a  critical 
role  in  binding  and  signal  transduction  by  acetyl- 
choline. These  studies  are  teaching  us  more 
about  the  molecular  basis  of  drug  recognition 
and  signal  transduction  in  this  protein. 

A  second  approach  for  finding  parts  of  the  pro- 
tein that  are  important  for  specific  functions  in- 
volves mutant  selection.  We  are  developing  a  sys- 
tem in  which  we  can  randomly  mutagenize  the 
gene  for  a  channel  and  then  select  the  interesting 
mutants.  This  strategy  is  almost  the  reverse  of 
site-directed  mutagenesis.  Instead  of  choosing  to 
change  a  particular  part  of  the  protein  and  look- 
ing for  the  resultant  changes  in  function,  we  se- 
lect for  a  change  in  function  and  then  look  at  the 
underlying  structural  changes.  We  have  ex- 
pressed the  genes  for  the  nicotinic  acetylcholine 
receptor  in  tissue-cultured  mammalian  cell  lines 
in  preparation  for  this  mutant  selection  strategy. 
We  have  also  found  a  method  for  selecting  certain 
channel  mutants  by  using  a  fluorescently  tagged 
toxin  molecule  that  binds  differently  to  these 
mutant  channels. 
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OF  the  billions  of  cells  composing  the  mam- 
malian brain,  a  few  thousand,  located  in  the 
hypothalamus,  function  together  to  form  a  biolog- 
ical clock.  This  clock  can  control  the  timing  of 
daily  behaviors  such  as  sleep  and  wakefulness 
with  an  accuracy  of  minutes.  Although  chemical 
and  electrical  rhythms  have  been  detected  in 
these  mammalian  neural  pacemakers,  little  is 
known  about  the  underlying  biochemistry  used 
by  these  cells  to  compute  time. 

In  a  simpler  model  organism,  the  fruit  fly  Dro- 
sophila, genes  and  proteins  central  to  biological 
timing  are  beginning  to  be  recovered  and  charac- 
terized. Mutations  in  several  genes  alter  the  fruit 
fly's  circadian  (daily)  rhythms  of  locomotor  activ- 
ity, which  can  be  compared  to  wake/sleep  cycles 
in  mammals.  The  same  mutations  also  alter  non- 
circadian,  high-frequency  rhythms.  Normal  fruit 
flies  produce  a  courtship  song  composed  of 
wing-beating  cycles  with  a  periodicity  of  about  a 
minute.  At  the  cellular  level,  changes  in  chemical 
and  electrical  conductances  are  measured  in 
some  of  the  mutants,  possibly  linking  measure- 
ment of  time  to  controlled  transmission  of  signals 
between  cells. 

The  best-studied  gene  in  the  Drosophila  clock 
system  is  the  per  {period)  locus.  Three  mutant 
forms  of  per  have  been  recovered  that  affect  the 
pace  of  the  insect's  clock.  In  the  per^  mutant, 
circadian  rhythms  have  a  long  period  of  30  rather 
than  24  hours.  For  the  mutant  per^,  daily  cycles 
have  a  shortened,  19-hour  period.  The  pet*"  mu- 
tants have  no  daily  rhythms.  Corresponding 
changes  are  found  in  the  courtship  song;  an 
80-second  song  for  per^,  40  seconds  for per^,  and 
song  arrhythmicity  for  per^.  For  at  least  some  tis- 
sues, per*^  and  per^  mutants  exhibit  lower  than 
normal  levels  of  intercellular  communication, 
while  per^  mutants  show  hypernormal  communi- 
cation. These  eff^ects  appear  to  be  mediated  by 
changes  in  conductance  of  specialized  channels, 
or  gap  junctions,  between  cells. 

The  molecular  changes  associated  with  the 
mutations  have  been  established.  Although  per*" 
cannot  make  a  full-length  protein,  per^  and  per' 
each  make  a  per  protein  that  is  altered  by  substi- 


tution of  a  diff'erent,  single  amino  acid.  Compara- 
ble changes  in  cycle  time  can  also  be  produced 
by  altering  the  amount  of  per  protein  the  fly  pro- 
duces. For  example,  Drosophila  that  have  re- 
ceived by  microinjection  a  new  gene  that  under- 
produces the  per  protein  20-fold  have  40-hour 
daily  rhythms. 

A  variety  of  experiments  have  shown  us  that  the 
per  protein  functions  in  the  nervous  system  to 
control  daily  and  circaminuten  rhythms.  Re- 
cently we  have  become  interested  in  tracking  the 
development  of  the  fly's  clock,  in  an  eff^ort  to 
identify  the  first  cells  making  the  per  protein.  A 
Drosophila  embryo  receiving  a  flash  of  light  only 
hours  after  its  conception  will  develop  into  an 
adult  fly  showing  a  daily  routine  of  activity  in 
phase  with  the  timing  of  the  embryonic  light  sig- 
nal. The  fact  that  an  adult  fly  can  "remember" 
this  timing  must  mean  that  a  clock  was  running 
when  the  light  signal  was  received  and  has  been 
running  ever  since.  These  experiments  show  that 
the  embryo's  circadian  clock  starts  running 
within  1 2  hours  of  fertilization  of  the  egg,  a  time 
at  which  as  few  as  50  cells  may  be  found  making 
the  per  protein. 

Until  recently  only  the  per  locus  was  known  to 
be  essential  for  production  of  biological  rhythms 
in  the  fruit  fly.  Genetic  screens  for  rhythm  muta- 
tions have  led  to  the  discovery  of  other  indispens- 
able genes.  Of  special  interest  is  the  soiree  gene. 
The  soiree  mutants  are  arrhythmic  in  constant 
darkness  but  show  nocturnal  behavior  in  the  pres- 
ence of  a  light/dark  cycle.  The  per^  mutants  are 
diurnal  in  the  same  light  cycles.  In  behavioral 
tests  of  per^  soiree  double  mutants,  the  nocturnal 
behavior  is  expressed,  indicating  a  more  central 
role  for  the  new  gene. 

Development  of  Skin,  Muscle,  and  Nerve 

The  nervous  system  and  skin  are  derived  from  a 
common  set  of  cells  in  the  embryo,  the  ectoderm. 
Each  cell  within  the  ectoderm  must  choose  a  fate, 
and  in  certain  mutants  of  Drosophila  the  choice 
goes  awry.  Neurogenic  mutants  have  lost  the  ca- 
pacity to  choose  skin,  so  that  only  nerve  is 
formed.  We  believe  these  mutants  have  lost  the 
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ability  to  form  a  set  of  early  acting  developmental 
signals,  so  we  are  using  the  mutations  to  mark, 
isolate,  and  characterize  the  products  of  these 
genes  and  putative  developmental  signals.  Neu- 
rogenic mutations  have  been  recovered  at  seven 
genetic  loci:  Notch,  big  brain,  mastermind, 
neuralized.  Delta,  almondex,  and  Enhancer  of 
split. 

Notch  has  been  characterized  more  completely 
than  any  other  gene  in  the  neurogenic  group.  The 
gene  produces  a  very  large  protein  (2,700  amino 
acids)  that  is  predominantly  composed  of  an  un- 
interrupted array  of  36  copies  of  a  hormone-like 
molecule,  a  relative  of  epidermal  growth  factor. 
The  Notch  protein  spans  the  cellular  membrane 
with  the  36  hormone  copies  exposed  to  its  neigh- 
bors. We  suspect  that  the  neighbors  read  that  sig- 
nal and  send  signals  to  the  TVo^c^-bearing  cell  in 
return  to  come  up  with  the  correct  allocations  of 
skin  and  nerve.  Biochemical  experiments  have 
shown  that  the  entire  string  of  hormone  copies  is 
tethered  to  the  cell's  surface;  thus  signaling  be- 
tween cells  must  be  intimate.  Only  cells  that  can 
touch  each  other  could  communicate  through 
such  a  protein. 

From  work  with  temperature-sensitive  muta- 
tions we  know  that  Notch  proteins  are  used  to 
instruct  development  throughout  embryonic, 
larval,  and  pupal  life.  Mutations  altering  the 
structures  of  individual  hormone-like  elements 
of  the  Notch  protein  have  been  examined  to  un- 
derstand the  role  each  plays  in  early  and  late  de- 
velopment. Of  the  36  hormone  repeats,  no  two 
are  identical  and,  importantly,  changes  in  differ- 
ent hormone  elements  produce  different  develop- 
mental abnormalities.  Thus  alternate  parts  of  the 
Notch  hormone  string  must  be  read  as  develop- 
ment unfolds.  In  part  these  specificities  could  al- 


low a  cell  to  talk  to  changing  neighbors  from  the 
time  of  cell  birth  to  differentiation  to  form  adult 
tissue. 

What  do  signals  from  these  genes  tell  a  cell  to 
do?  For  several  years  it  has  been  argued  that  gene 
action  at  Notch,  Delta,  big  brain,  almondex, 
neuralized,  mastermind,  and  Enhancer  of  split 
stimulates  an  undifferentiated  ectodermal  cell  to 
develop  as  an  epidermal  cell;  the  genes  are  ex- 
pected to  provide  a  series  of  epidermalizing  sig- 
nals during  cell  differentiation,  with  loss  of  func- 
tion generating  a  nerve  cell.  New  work  shows  that 
this  simple  picture  cannot  be  accurate. 

Notch  proteins  have  now  been  found  in  cells 
giving  rise  to  embryonic  muscle.  In  Notch  mu- 
tants, strong  effects  on  muscle  development  are 
seen,  with  increased  numbers  of  some  muscle 
cell  types  generated,  probably  at  the  expense  of 
others.  Thus  parallel  changes  in  muscle,  skin,  and 
nerve  development  take  place  in  Notch  mutants. 
Of  most  significance,  comparable  effects  on  mus- 
cle development  are  seen  with  mutations  of 
Delta,  big  brain,  mastermind,  almondex,  En- 
hancer of  split,  and  neuralized. 

We  have  learned  three  things  from  these  stud- 
ies. 1)  The  developmental  fates  of  many  cell 
types  are  switched  in  neurogenic  mutants.  2)  The 
genes  must  provide  differentiation  signals  that 
can  be  read  by  cells  composing  different  germ 
layers,  with  no  apparent  overlap  in  the  cells'  final 
developmental  fates.  3)  The  genes  defined  by  the 
neurogenic  mutations  probably  work  together  to 
form  a  single  developmental  pathway,  which  gen- 
erates a  common  differentiation  signal  in  all  cells 
affected  in  the  mutants.  This  last  conclusion  fol- 
lows from  our  observation  that  any  developmen- 
tal anomaly  caused  by  loss  of  one  gene  in  the 
group  predicts  a  comparable  developmental 
change  upon  loss  of  any  other  gene  in  the  series. 
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THE  remarkable  process  of  development  re- 
quires that  a  fertilized  egg  undergo  many 
rounds  of  cell  division  with  accompanying  differ- 
entiation to  generate  specialized  cell  types  that 
ultimately  form  the  mature  organism.  For  devel- 
opment to  proceed  normally,  just  the  right  num- 
ber and  types  of  cells  must  be  available  at  each 
stage.  It  follows  that  any  cell's  decision  to  divide 
and/or  differentiate  must  be  carefully  regulated. 
These  two  critical  processes,  proliferation  and 
differentiation,  are  also  controlled  in  the  adult — 
for  example,  during  the  maintenance  of  tissues 
and  in  wound  healing. 

The  decision  of  a  cell  to  divide  or  to  express  a 
specialized  function  is  often  determined  by  sig- 
nals from  its  environment.  Prominent  among  the 
agents  that  convey  such  signals  are  the  growth 
factors,  which  are  polypeptides  synthesized  and 
secreted  by  cells.  Our  laboratory  studies  the  mo- 
lecular mechanisms  by  which  growth  factors  and 
other  transmembrane  signaling  agents  exert  their 
effects  on  cell  proliferation  and  differentiation. 
The  transforming  genes  of  DNA  tumor  viruses 
such  as  adenovirus  often  modify  these  programs 
and  are  therefore  useful  for  dissecting  the  growth 
regulatory  pathways. 

Growth  factors  transmit  signals  to  cells  by  bind- 
ing to  specific  receptor  proteins,  which  span  the 
cell's  plasma  membrane  and  induce  second  mes- 
senger signals  in  the  cytoplasm.  The  latter  signals 
have  a  multitude  of  targets,  some  in  the  cyto- 
plasm and  some  in  the  nucleus.  Although  individ- 
ual growth  factors  may  exert  profound  changes 
on  cells,  some  effects,  such  as  the  induction  of 
cell  proliferation,  may  require  the  combined  ac- 
tions of  more  than  one  growth  factor.  When  the 
signal  pathways  are  inappropriately  activated, 
cells  may  lose  control  of  growth  and  a  tumor  may 
arise.  When  the  pathways  are  blocked,  essential 
cell  types  may  degenerate  and  die.  It  follows  that 
errors  in  signaling  can  result  in  diverse  sorts  of 
diseases. 

In  the  case  of  the  nervous  system,  transmem- 
brane signals  induced  by  neurotransmitters  regu- 
late the  properties  of  neurons.  These  small  mole- 


cules, released  by  neurons  at  synapses,  bind  to 
receptors  on  postsynaptic  target  cells.  Neuro- 
transmitter stimulation  of  target  neurons  is  a  criti- 
cal step  in  the  rapid  transmission  of  nerve  im- 
pulses, but  it  can  also  more  slowly  regulate  the 
activities  of  specific  genes,  allowing  the  nervous 
system  to  modify  its  properties  in  response  to  its 
environment.  Such  modification  may  underlie 
the  processes  of  neural  adaptation  and  memory. 

Our  laboratory  has  identified  a  group  of  imme- 
diate early-response  genes  that  are  rapidly  in- 
duced by  growth  factor  stimulation  and  appear  to 
be  primary  targets  in  the  nucleus  for  the  growth 
factor-induced  signals.  Our  work  focuses  on  the 
c-fos  gene,  which  is  very  rapidly  induced  by  a 
wide  range  of  transmembrane  signals.  This  gene 
encodes  a  protein,  c-Fos,  which  is  a  member  of  a 
family  of  transcription  factors  that  bind  to  spe- 
cific sites  in  the  regulatory  regions  of  other  genes 
and  thereby  control  their  activity.  In  this  manner 
c-Fos  acts  as  an  intermediary  for  the  conversion  of 
short-term  transmembrane  signals  into  longer- 
term  changes  in  the  cell. 

We  have  shown  that  a  protein  kinase,  Raf, 
which  phosphorylates  proteins  on  serine  and 
threonine  residues  and  which  resides  primarily  in 
the  cytoplasm,  has  a  role  in  transmitting  these 
activating  signals  to  the  c-fos  promoter  in  the  nu- 
cleus. Mutations  that  deregulate  Raf  can  induce 
human  tumors,  and  our  work  suggests  that  this 
may  follow  from  aberrant  activation  of  the  ex- 
pression of  early  response  genes  such  as  c-fos. 

We  have  also  identified  novel  transcription  fac- 
tor proteins  that  bind  to  regulatory  regions  of  the 
c-fos  promoter  and  activate  the  gene  in  response 
to  second  messenger  signals.  One  factor  is  re- 
sponsive to  cAMP  and  moves  from  the  cytoplasm 
to  the  nucleus  upon  activation.  It  apparently  co- 
operates with  other  factors  to  enable  a  wide  range 
of  transmembrane  signals  to  activate  the  gene. 

We  are  particularly  concerned  with  the  role  of 
c-fos  in  programs  of  neuronal  differentiation  in- 
duced by  nerve  growth  factor  (NGF).  In  vivo, 
NGF  is  required  for  the  differentiation  and  main- 
tenance of  peripheral  neurons.  Expression  of 
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c-fos  appears  to  be  a  first  step  in  the  activation  of 
a  multistage  gene  expression  program  induced  by 
NGF  that  can  culminate  in  cell  division  or  the 
induction  of  terminally  differentiated  functions. 

In  our  model  neuronal  cells,  we  have  shown 
that  neurotransmitters  as  well  as  growth  factors 
can  also  induce  the  expression  of  c-Fos.  It  is  ap- 
parent from  this  and  other  studies  that  c-Fos  has  a 
critical  role  in  the  adult  nervous  system,  not  just 
in  neural  development.  We  have  shown  that  c-Fos 
induced  by  NGF  can  cooperate  with  other  factors 
to  induce  the  gene  for  tyrosine  hydroxylase  (TH) , 
an  enzyme  that  catalyzes  a  critical  step  in  the  pro- 
duction of  neurotransmitters  in  the  catechol- 
amine family.  This  participation  of  c-Fos  in  TH 
control  may  allow  a  neuron  to  coordinate  the  pro- 
duction of  catecholamine  neurotransmitters  with 
the  activity  of  the  neuron  during  its  function  in 
the  nervous  system. 

We  have  gone  on  to  show  that  NGF  induces 
other  genes  as  well.  One  of  these  encodes  peri- 
pherin,  a  neuron-specific  intermediate  filament 
protein  that  is  present  in  the  axons  of  peripheral 
neurons  as  a  component  of  the  neuronal  cytoskel- 
eton.  Our  studies  of  the  developing  rat  nervous 
system  indicate  that  peripherin  expression  coin- 
cides with  the  final  steps  of  neuronal  maturation 
and  acquisition  of  function.  Thus  the  mecha- 
nisms that  control  peripherin  expression  appear 
to  be  quite  distinct  from  those  that  control  c-Fos 
or  TH. 


Study  of  these  mechanisms  may  give  a  clue  to 
how  cells  permanently  exit  from  the  cell  cycle 
and  induce  the  expression  of  genes  that  they  em- 
ploy after  losing  the  capacity  to  proliferate.  One 
event  that  may  block  exit  from  the  cell  cycle  is 
the  uncontrolled  expression  of  c-myc,  another 
growth  factor-induced  gene.  It  encodes  a  pro- 
tein, c-Myc,  that  is  distantly  related  to  c-Fos  and 
has  a  specialized  but  poorly  understood  function 
in  inducing  cell  proliferation.  It  is  expressed  at 
abnormally  high  levels  in  many  tumors.  We  have 
identified  a  DNA  nucleotide  sequence  to  which 
c-Myc  can  bind  and  a  protein  partner  of  c-Myc 
that  can  stimulate  its  DNA  binding.  Studies  of  the 
complexes  of  c-Myc  with  DNA  may  reveal  the  role 
of  c-Myc  in  normal  cell  proliferation  and  in 
tumori  genesis. 

We  find  that  c-Myc  expression  is  elevated  in  the 
naturally  occurring  childhood  brain  tumor  me- 
dulloblastoma,  where  it  is  accompanied  by  a 
block  to  differentiation.  We  also  find  that  the 
transforming  gene  of  adenovirus,  Ela,  like  c- 
myc,  can  block  the  differentiation  of  our  model 
neurons,  mimicking  the  state  of  the  tumor. 

We  seek  to  understand  how  cells  respond  to 
growth  factor  signals  by  proliferating  and  how 
cells  coordinate  their  final  differentiation  with 
their  exit  from  the  cell  cycle.  Future  studies  will 
emphasize  the  roles  of  transmembrane  signaling 
in  adult  nervous  system  function. 
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THE  nervous  system  provides  a  dramatic  exam- 
ple of  the  capabilities  of  cells  to  signal  each 
other  and  to  respond  to  environmental  cues.  An 
understanding  of  signal  transduction  mechanisms 
is  essential  for  elucidating  the  cellular  and  molec- 
ular basis  of  information  processing  in  biological 
systems.  Phototransduction,  the  neuronal  excita- 
tion process  triggered  by  light  and  resulting  in 
the  generation  of  a  receptor  potential,  provides 
an  ideal  model  system  in  which  to  study  signal 
transduction  in  response  to  an  environmental 
stimulus.  The  aim  of  our  research  program  is  to 
elucidate  mechanisms  used  for  signal  transduc- 
tion in  the  visual  system,  using  a  combined  mo- 
lecular, genetic,  and  physiological  approach. 

Drosophila  is  unique  as  an  experimental  organ- 
ism in  that  it  allows  a  multidisciplinary  approach 
to  biological  questions  that  will  yield  not  only 
much  new  information,  but  may  well  provide  a 
type  of  information  not  otherwise  obtainable. 
First,  the  system  is  amenable  to  classical  genetics: 
over  the  past  decade  several  groups  have  isolated 
a  large  number  of  Drosophila  mutants  with  spe- 
cific defects  in  behavior  mediated  by  visual  in- 
put. Many  of  these  have  been  shown  to  define 
genes  important  for  phototransduction.  Second, 
since  the  eye  is  fully  dispensable,  it  can  be  manip- 
ulated without  affecting  viability.  Third,  genes 
can  be  readily  introduced  into  the  Drosophila 
germline.  Thus  genes  and  the  proteins  they  en- 
code can  be  experimentally  manipulated  in  vitro 
and  their  functions  studied  in  vivo  in  their  nor- 
mal cellular  and  organismal  environment. 

Experimental  Strategy 

Over  the  past  few  years  my  colleagues  and  I 


have  been  working  on  the  isolation  and  character- 
ization of  genes  important  for  photoreceptor  cell 
function.  Recently  our  efforts  have  focused  on 
the  characterization  of  genes  encoding  products 
involved  in  the  intermediate  events  of  the  visual 
transduction  cascade. 

An  example  of  the  power  and  utility  of  a  ge- 
netic approach  to  identify  molecules  with  unex- 
pected roles  is  the  ninaA  gene.  ninaA  mutants 
have  a  10-  to  20-fold  reduction  of  rhodopsin  lev- 
els. Characterization  of  the  ninaA  gene  has 
shown  that  it  encodes  a  visual  system-specific 
polypeptide  with  high-amino  acid  sequence 
identity  to  the  human  cyclosporin  A-binding 
protein,  cyclophilin.  Cyclosporin  A  is  an  immu- 
nosuppressing  drug  widely  used  to  prevent  graft 
rejection  following  transplant  surgery.  Cyclophi- 
lins  are  peptidyl-prolyl  cis-trans  isomerases  that 
catalyze  isomerization  about  the  peptide  bond  be- 
tween a  proline  residue  (Pro)  and  its  amino 
neighbor  (Xaa).  A  model  for  ninaA  function  is 
that  opsin  is  a  substrate  for  the  ninaA  protein; 
isomerization  about  one  or  several  Xaa-Pro  pep- 
tide bonds  may  be  crucial  for  proper  membrane 
intercalation,  folding,  or  stability  of  opsins. 

In  addition  to  ninaA,  we  are  also  studying  pro- 
tein kinases  and  two  arrestins  involved  in  regu- 
lating this  signaling  process.  We  are  using  molec- 
ular, genetic,  and  physiological  approaches  to 
dissect  the  role  of  these  proteins  in  vivo.  The 
results  of  these  studies  should  help  us  understand 
the  molecular  basis  of  sensory  reception  and  in- 
formation processing  and  may  help  us  understand 
abnormalities  in  the  human  visual  and  nervous 
systems. 
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